ID NC003070_1 HYPOTHETICAL; PRT; 490 AA. AC NC003070_1; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[264...327, 3641...3913, 3996...4276, DE 4486...4605, 4706...5095, 5174...5326, 5439...5630]; Length: 1473. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 21 FIRST EXON; p-value: NaN. FT GENSCAN 22 22 AA on splice site: t/at -> Y. FT GENSCAN 23 112 INTERNAL EXON; p-value: NaN. FT GENSCAN 113 113 AA on splice site: t/tc -> F. FT GENSCAN 114 206 INTERNAL EXON; p-value: NaN. FT GENSCAN 207 246 INTERNAL EXON; p-value: NaN. FT GENSCAN 247 376 INTERNAL EXON; p-value: NaN. FT GENSCAN 377 427 INTERNAL EXON; p-value: NaN. FT GENSCAN 428 490 LAST EXON; p-value: NaN. SQ SEQUENCE 490 AA; 56714 MW; E32E7DCF02E52434 CRC64; MDGLSSFVIL DTSFATIYIW EYIPNQRKQI HNRRNTDYRE RERSTAKLFT RKPLKSDGLV KMEDQVGFGF RPNDEELVGH YLRNKIEGNT SRDVEVAISE VNICSYDPWN LRFQSKYKSR DAMWYFFSRR ENNKGNRQSR TTVSGKWKLT GESVEVKDQW GFCSEGFRGK IGHKRVLVFL DGRYPDKTKS DWVIHEFHYD LLPEHQRTYV ICRLEYKGDD ADILSAYAID PTPAFVPNMT SSAGSVVNQS RQRNSGSYNT YSEYDSANHG QQFNENSNIM QQQPLQGSFN PLLEYDFANH GGQWLSDYID LQQQVPYLAP YENESEMIWK HVIEENFEFL VDERTSMQQH YSDHRPKKPV SGVLPDDSSD TETGSMIFED TSSSTDSVGS SDEPGHTRID DIPSLNIIEP LHNYKAQEQP KQQSKEKVIS SQKSECEWKM AEDSIKIPPS TNTVKQSWIV LENAQWNYLK NMIIGVLLFI SVISWIILVG // ID NC003070_2 HYPOTHETICAL; PRT; 140 AA. AC NC003070_2; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[8666...8625, 8464...8417, 8325...8236, DE 7232...7157, 6594...6428]; Length: 423. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 14 FIRST EXON; p-value: NaN. FT GENSCAN 15 30 INTERNAL EXON; p-value: NaN. FT GENSCAN 31 60 INTERNAL EXON; p-value: NaN. FT GENSCAN 61 85 INTERNAL EXON; p-value: NaN. FT GENSCAN 86 86 AA on splice site: g/at -> D. FT GENSCAN 87 140 LAST EXON; p-value: NaN. SQ SEQUENCE 140 AA; 15954 MW; 64F6555BAC4AEAE8 CRC64; MAASEHRCVG CGFRGNCKEV ADEYIECERM IIFIDLILHR PKVYRHVLYN AINPATVNIQ VWEFPMSVIF FVDILLLTSN SMALKDMWDS PMNVGHQHIV ATPSECHRKD EETVEAEVEA PLDSYPENAT TMARKQPKAS // ID NC003070_3 HYPOTHETICAL; PRT; 358 AA. AC NC003070_3; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[12940...11864]; Length: 1077. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 358 SINGLE EXON; p-value: NaN. SQ SEQUENCE 358 AA; 40288 MW; DCF366CC2D388DAC CRC64; MDLSLAPTTT TSSDQEQDRD QELTSNIGAS SSSGPSGNNN NLPMMMIPPP EKEHMFDKVV TPSDVGKLNR LVIPKQHAER YFPLDSSNNQ NGTLLNFQDR NGKMWRFRYS YWNSSQSYVM TKGWSRFVKE KKLDAGDIVS FQRGIGDESE RSKLYIDWRH RPDMSLVQAH QFGNFGFNFN FPTTSQYSNR FHPLPEYNSV PIHRGLNIGN HQRSYYNTQR QEFVGYGYGN LAGRCYYTGS PLDHRNIVGS EPLVIDSVPV VPGRLTPVML PPLPPPPSTA GKRLRLFGVN MECGNDYNQQ EESWLVPRGE IGASSSSSSA LRLNLSTDHD DDNDDGDDGD DDQFAKKGKS SLSLNFNP // ID NC003070_4 HYPOTHETICAL; PRT; 1763 AA. AC NC003070_4; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[23525...24451, 24752...24962, 25041...25435, DE 25524...25743, 25825...25997, 26081...26203, 26292...26452, DE 26543...26776, 26862...27012, 27099...27281, 27618...27713, DE 27803...28431, 28612...28727, 28890...29080, 29160...30065, DE 30147...30311, 30410...30820]; Length: 5292. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 309 FIRST EXON; p-value: NaN. FT GENSCAN 310 379 INTERNAL EXON; p-value: NaN. FT GENSCAN 380 380 AA on splice site: g/gt -> G. FT GENSCAN 381 511 INTERNAL EXON; p-value: NaN. FT GENSCAN 512 584 INTERNAL EXON; p-value: NaN. FT GENSCAN 585 585 AA on splice site: g/tg -> V. FT GENSCAN 586 642 INTERNAL EXON; p-value: NaN. FT GENSCAN 643 683 INTERNAL EXON; p-value: NaN. FT GENSCAN 684 736 INTERNAL EXON; p-value: NaN. FT GENSCAN 737 737 AA on splice site: ag/a -> R. FT GENSCAN 738 814 INTERNAL EXON; p-value: NaN. FT GENSCAN 815 815 AA on splice site: ag/g -> R. FT GENSCAN 816 865 INTERNAL EXON; p-value: NaN. FT GENSCAN 866 926 INTERNAL EXON; p-value: NaN. FT GENSCAN 927 958 INTERNAL EXON; p-value: NaN. FT GENSCAN 959 1167 INTERNAL EXON; p-value: NaN. FT GENSCAN 1168 1168 AA on splice site: aa/g -> K. FT GENSCAN 1169 1206 INTERNAL EXON; p-value: NaN. FT GENSCAN 1207 1207 AA on splice site: t/gt -> C. FT GENSCAN 1208 1270 INTERNAL EXON; p-value: NaN. FT GENSCAN 1271 1572 INTERNAL EXON; p-value: NaN. FT GENSCAN 1573 1627 INTERNAL EXON; p-value: NaN. FT GENSCAN 1628 1763 LAST EXON; p-value: NaN. SQ SEQUENCE 1763 AA; 197445 MW; 26368BCE82153341 CRC64; MEDEPREATI KPSYWLDACE DISCDLIDDL VSEFDPSSVA VNESTDENGV INDFFGGIDH ILDSIKNGGG LPNNGVSDTN SQINEVTVTP QVIAKETVKE NGLQKNGGKR DEFSKEEGDK DRKRARVCSY QSERSNLSGR GHVNNSREGD RFMNRKRTRN WDEAGNNKKK RECNNYRRDG RDREVRGYWE RDKVGSNELV YRSGTWEADH ERDVKKVSGG NRECDVKAEE NKSKPEERKE KVVEEQARRY QLDVLEQAKA KNTIAFLETG AGKTLIAILL IKSVHKDLMS QNRKMLSVFL VPKVPLVYQV LVMTAQILLN ILRHSIIRME TIDLLILDEC HHAVKKHPYS LVMSEFYHTT PKDKRPAIFG MTASPVNLKG VSSQVDCAIK IRNLETKLDS TVCTIKDRKE LEKHVPMPSE IVVEYDKAAT MWSLHETIKQ MIAAVEEAAQ ASSRKSKWQF MGARDAGAKD ELRQVYGVSE RTESDGAANL IHKLRAINYT LAELGQWCAY KVGQSFLSAL QSDERVNFQV DVKFQESYLS EVVSLLQCEL LEGAAAEKVA AEVGKPENGN AHDEMEEGEL PDDPVVSGGE HVDEVIGAAV ADGKVTPKVQ SLIKLLLKYQ HTADFRAIVF VERVVAALVL PKVFAELPSL SFIRCASMIG HNNSQEMKSS QMQDTISKFR DGHVTLLVAT SVAEEGLDIR QCNVVMRFDL AKTVLAYIQS RGRARKPGSD YILMVERGNV SHAAFLRNAR NSEETLRKEA IERTDLSHLK DTSRLISIDA VPGTVYKVEA TGAMVSLNSA VGLVHFYCSQ LPGDRYAILR PEFSMEKHEK PGGHTEYSCR LQLPCNAPFE ILEGPVCSSM RLAQQAVCLA ACKKLHEMGA FTDMLLPDKG SGQDAEKADQ DDEGEPVPGT ARHREFYPEG VADVLKVLSM SMDLYVARAM ITKASLAFKG SLDITENQLS SLKKFHVRLM SIVLDVDVEP STTPWDPAKA YLFVPVTDNT SMEPIKGINW ELVEKITKTT AWDNPLQRAR PDVYLGTNER TLGGDRREYG FGKLRHNIVF GQKSHPTYGI RGAVASFDVV RASGLLPVRD AFEKEVEEDL SKGKLMMADG CMVAEDLIGK IVTAAHSGKR FYVDSICYDM SAETSFPRKE GYLGPLEYNT YADYYKQKIY VVQDRLFFYF LHNLRLLRLY KSSSIMLFIR YGVDLNCESE TVLDKTYYVF LPPELCVVHP LSGSLIRGAQ RLPSIMRRVE SMLLAVQLKN LISYPIPTSK ILEALTAASC QETFCYERAE LLGDAYLKWV VSRFLFLKYP QKHEGQLTRM RQQMVSNMVL YQFALVKGLQ SYIQADRFAP SRWSAPGVPP VFDEDTKDGG SSFFDEEQKP VSEENSDVFE DGEMEDGELE GDLSSYRVLS SKTLADVVEA LIGVYYVEGG KIAANHLMKW IGIHVEDDPD EVDGTLKNVN VPESVLKSID FVGLERALKY EFKEKGLLVE AITHASRPSS GVSCYQRLEF VGDAVLDHLI TRHLFFTYTS LPPGRLTDLR AAAVNNENFA RVAVKHKLHL YLRHGSSALE KQIREFVKEV QTESSKPGFN SFGLGDCKAP KVLGDIVESI AGAIFLDSGK DTTAAWKVFQ PLLQPMVTPE TLPMHPVREL QERCQQQAEG LEYKASRSGN TATVEVFIDG VQVGVAQNPQ KKMAQKLAAR NALAALKEKE IAESKEKHIN NGNAGEDQGE NENGNKKNGH QPFTRQTLND ICLRKNWPMP SYR // ID NC003070_5 HYPOTHETICAL; PRT; 1869 AA. AC NC003070_5; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[50954...50883, 50631...50419, 50254...50229, DE 49026...48936, 48852...48075, 47982...47746, 46611...46373, DE 46145...46044, 45954...45757, 45559...45514, 40885...40675, DE 40329...40213, 39814...39678, 39566...39409, 39287...39136, DE 39054...38904, 37100...37023, 36921...36810, 36685...36624, DE 35999...35730, 35647...35567, 35471...34401, 34327...34034, DE 33088...33029, 32670...32547, 32459...32431, 32347...32282, DE 32195...32088, 31998...31933, 31813...31639, 31602...31517]; Length: DE 5610. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 24 FIRST EXON; p-value: NaN. FT GENSCAN 25 95 INTERNAL EXON; p-value: NaN. FT GENSCAN 96 103 INTERNAL EXON; p-value: NaN. FT GENSCAN 104 104 AA on splice site: tc/a -> S. FT GENSCAN 105 134 INTERNAL EXON; p-value: NaN. FT GENSCAN 135 393 INTERNAL EXON; p-value: NaN. FT GENSCAN 394 394 AA on splice site: g/ct -> A. FT GENSCAN 395 472 INTERNAL EXON; p-value: NaN. FT GENSCAN 473 473 AA on splice site: t/ct -> S. FT GENSCAN 474 552 INTERNAL EXON; p-value: NaN. FT GENSCAN 553 586 INTERNAL EXON; p-value: NaN. FT GENSCAN 587 652 INTERNAL EXON; p-value: NaN. FT GENSCAN 653 667 INTERNAL EXON; p-value: NaN. FT GENSCAN 668 668 AA on splice site: a/ag -> K. FT GENSCAN 669 737 INTERNAL EXON; p-value: NaN. FT GENSCAN 738 738 AA on splice site: ag/g -> R. FT GENSCAN 739 776 INTERNAL EXON; p-value: NaN. FT GENSCAN 777 777 AA on splice site: ag/g -> R. FT GENSCAN 778 822 INTERNAL EXON; p-value: NaN. FT GENSCAN 823 823 AA on splice site: g/gg -> G. FT GENSCAN 824 875 INTERNAL EXON; p-value: NaN. FT GENSCAN 876 925 INTERNAL EXON; p-value: NaN. FT GENSCAN 926 926 AA on splice site: ag/t -> S. FT GENSCAN 927 976 INTERNAL EXON; p-value: NaN. FT GENSCAN 977 1002 INTERNAL EXON; p-value: NaN. FT GENSCAN 1003 1039 INTERNAL EXON; p-value: NaN. FT GENSCAN 1040 1040 AA on splice site: g/aa -> E. FT GENSCAN 1041 1060 INTERNAL EXON; p-value: NaN. FT GENSCAN 1061 1150 INTERNAL EXON; p-value: NaN. FT GENSCAN 1151 1177 INTERNAL EXON; p-value: NaN. FT GENSCAN 1178 1534 INTERNAL EXON; p-value: NaN. FT GENSCAN 1535 1632 INTERNAL EXON; p-value: NaN. FT GENSCAN 1633 1652 INTERNAL EXON; p-value: NaN. FT GENSCAN 1653 1693 INTERNAL EXON; p-value: NaN. FT GENSCAN 1694 1694 AA on splice site: g/ga -> G. FT GENSCAN 1695 1703 INTERNAL EXON; p-value: NaN. FT GENSCAN 1704 1725 INTERNAL EXON; p-value: NaN. FT GENSCAN 1726 1761 INTERNAL EXON; p-value: NaN. FT GENSCAN 1762 1783 INTERNAL EXON; p-value: NaN. FT GENSCAN 1784 1841 INTERNAL EXON; p-value: NaN. FT GENSCAN 1842 1842 AA on splice site: a/ac -> N. FT GENSCAN 1843 1869 LAST EXON; p-value: NaN. SQ SEQUENCE 1869 AA; 205326 MW; C596BF64E2446E99 CRC64; MSTVGELACS YAVMILEDEG IAITADKIAT LVKAAGVSIE SYWPMLFAKM AEKRNVTDLI MNVGAGGGGG APVAAAAPAA GGGAAAAPAA EEKKKLDIGD YILSLNHSNA TRRSPVVSVQ EVVKEKQSTN NTSLLITKEE GLELYEDMIL GRSFEDMCAQ MYYRGKMFGF VHLYNGQEAV STGFIKLLTK SDSVVSTYRD HVHALSKGVS ARAVMSELFG KVTGCCRGQG GSMHMFSKEH NMLGGFAFIG EGIPVATGAA FSSKYRREVL KQDCDDVTVA FFGDGTCNNG QFFECLNMAA LYKLPIIFVV ENNLWAIGMS HLRATSDPEI WKKGPAFGMP GVHVDGMDVL KVREVAKEAV TRARRGEGPT LVECETYRFR GHSLADPDEL RDAAEKAKYA ARDPIAALKK YLIENKLAKE AELKSIEKKI DELVEEAVEF ADASPQPGRS QLLENVFADP KGFGIGPDGR YRSQPLQIKV SSSELSVLDE EKEEEVVKGE AEPNKDSVVS KAEPVKKPRP CELYVCNIPR SYDIAQLLDM FQPFGTVISV EVVSRNPQTG ESRGSGYVTM GSINSAKIAI ASLDGTEVGG REMRVRYSVD MNPGTRRNPE VLNSTPKKIL MYESQHKVYV GNLPWFTQPD GLRNHFSKFG TIQYEGRRII VREGIEKKRD MAGDMQGVRV VEKYSPVIVM VMSNVAMGSV NALVKKALDV GVNHMVIGAY RMAISALILV PFAYVLERAS LMQFFFLLGL SYTSATVSCA LVSMLPAITF ALALIFRTEN VKILKTKAGM LKVIGTLICI SGALFLTFYK GPQISNSHSH SHGTLSIKYP CKYSSTCLMS IFAAFQCALL SLYKSRDVND WIIDDRFVIT VIIYAGVVGQ AMTTVATTWG IKKLGAVFAS AFFPLTLISA TLFDFLILHT PLYLGSVIGS LVTITGLYMF LWGKNKETES STALSSGMDN EAQYTTPNKD NDSKSPRGFE AANSCTGPVM DTNTSGEELL AKARKPYTIT KQRERWTEDE HERFLEALRL YGRAWQRIEE HIGTKTAVQI RSHAQKFFTK FGKAHSFWFT FQLEKEAEVK GIPVCQALDI EIPPPRPKRK PNTPYPRKPG NNGTSSSQVS SAKDAKLVSS ASSSQLNQAF LDLEKMPFSE KTSTGKENQD ENCSGVSTVN KYPLPTKVSG DIETSKTSTV DNAVQDVPKK NKDKDGNDGT TVHSMQNYPW HFHADIVNGN IAKCPQNHPS GMVSQDFMFH PMREETHGHA NLQATTASAT TTASHQAFPA CHSQDDYRSF LQISSTFSNL IMSTLLQNPA AHAAATFAAS VWPYASVGNS GDSSTPMSSS PPSITAIAAA TVAAATAWWA SHGLLPVCAP APITCVPFST VAVPTPAMTE MDTVENTQPF EKQNTALQDQ NLASKSPASS SDDSDETGVT KLNADSKTND DKIEEVVVTA AVHDSNTAQK KNLVDRSSCG SNTPSGSDAE TDALDKMEKD KEDVKETDEN QPDVIELNNR KIKMRDNNSN NNATTDSWKE VSEEGRIAFQ ALFARERLPQ SFSPPQVAEN VNRKQSDTSM PLAPNFKSQD SCAADQEGVV MIGVGTCKSL KTRQTGFKPY KRCSMEVKES QVGNINNQSD EKFLHHPCDK ENNNRLFRFD PKMSEETKDN QRLQRPAPRL NERILSSLSR RSVAAHPWHD LEIGPGAPQI FNVVVEITKG SKVKYELDKK TGLIKVDRIL YSSVVYPHNY GFVPRTLCED NDPIDVLVIM QEPVLPGCFL RARAIGLMPM IDQGEKDDKI IAVCVDDPEY KHYTDIKELP PHRLSEIRRF FEDCILFLQC SSLFISIDLS TNKKNENKEV AVNDFLPSES AVEAIQYSM // ID NC003070_6 HYPOTHETICAL; PRT; 527 AA. AC NC003070_6; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[52239...52346, 52434...52730, 52938...53183, DE 53484...53624, 53703...54494]; Length: 1584. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 36 FIRST EXON; p-value: NaN. FT GENSCAN 37 135 INTERNAL EXON; p-value: NaN. FT GENSCAN 136 217 INTERNAL EXON; p-value: NaN. FT GENSCAN 218 264 INTERNAL EXON; p-value: NaN. FT GENSCAN 265 527 LAST EXON; p-value: NaN. SQ SEQUENCE 527 AA; 59173 MW; 058E9B7B2A2EAE04 CRC64; MGKKNGSSSW LTAVKRAFRS PTKKDHSNDV EEDEEKKREK RRWFRKPATQ ESPVKSSGIS PPAPQEDSLN VNSKPSPETA PSYATTTPPS NAGKPPSAVV PIATSASKTL APRRIYYARE NYAAVVIQTS FRGYLARRAL RALKGLVKLQ ALVRGHNVRK QAKMTLRCMQ ALVRVQSRVL DQRKRLSHDG SRKSAFSDSH AVFESRYLQD LSDRQSMSRE GSSAAEDWDD RPHTIDAVKV MLQRRRDTAL RHDKTNLSQA FSQKMWRTVG NQSTEGHHEV ELEEERPKWL DRWMATRPWD KRASSRASVD QRVSVKTVEI DTSQPYSRTG AGSPSRGQRP SSPSRTSHHY QSRNNFSATP SPAKSRPILI RSASPRCQRD PREDRDRAAY SYTSNTPSLR SNYSFTARSG CSISTTMVNN ASLLPNYMAS TESAKARIRS HSAPRQRPST PERDRAGLVK KRLSYPVPPP AEYEDNNSLR SPSFKSVAGS HFGGMLEQQS NYSSCCTESN GVEISPASTS DFRNWLR // ID NC003070_7 HYPOTHETICAL; PRT; 511 AA. AC NC003070_7; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[58927...57392]; Length: 1536. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 511 SINGLE EXON; p-value: NaN. SQ SEQUENCE 511 AA; 57214 MW; 1086DDE1F73D6492 CRC64; MAFRDSSSAV IRIRRRLPDL LTSVKLKYVK LGLHNSCNVT TILFFLIILP LTGTVLVQLT GLTFDTFSEL WSNQAVQLDT ATRLTCLVFL SFVLTLYVAN RSKPVYLVDF SCYKPEDERK ISVDSFLTMT EENGSFTDDT VQFQQRISNR AGLGDETYLP RGITSTPPKL NMSEARAEAE AVMFGALDSL FEKTGIKPAE VGILIVNCSL FNPTPSLSAM IVNHYKMRED IKSYNLGGMG CSAGLISIDL ANNLLKANPN SYAVVVSTEN ITLNWYFGND RSMLLCNCIF RMGGAAILLS NRRQDRKKSK YSLVNVVRTH KGSDDKNYNC VYQKEDERGT IGVSLARELM SVAGDALKTN ITTLGPMVLP LSEQLMFLIS LVKRKMFKLK VKPYIPDFKL AFEHFCIHAG GRAVLDEVQK NLDLKDWHME PSRMTLHRFG NTSSSSLWYE MAYTEAKGRV KAGDRLWQIA FGSGFKCNSA VWKALRPVST EEMTGNAWAG SIDQYPVKVV Q // ID NC003070_8 HYPOTHETICAL; PRT; 630 AA. AC NC003070_8; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[67512...67324, 66897...66835, 66749...66678, DE 66557...66450, 66342...66262, 66160...66107, 65864...65739, DE 65652...65563, 65498...65331, 65217...65110, 65017...64901, DE 64807...64751, 64656...64582, 63820...63431, 62124...62050, DE 59925...59806]; Length: 1893. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 63 FIRST EXON; p-value: NaN. FT GENSCAN 64 84 INTERNAL EXON; p-value: NaN. FT GENSCAN 85 108 INTERNAL EXON; p-value: NaN. FT GENSCAN 109 144 INTERNAL EXON; p-value: NaN. FT GENSCAN 145 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 189 INTERNAL EXON; p-value: NaN. FT GENSCAN 190 231 INTERNAL EXON; p-value: NaN. FT GENSCAN 232 261 INTERNAL EXON; p-value: NaN. FT GENSCAN 262 317 INTERNAL EXON; p-value: NaN. FT GENSCAN 318 353 INTERNAL EXON; p-value: NaN. FT GENSCAN 354 392 INTERNAL EXON; p-value: NaN. FT GENSCAN 393 411 INTERNAL EXON; p-value: NaN. FT GENSCAN 412 436 INTERNAL EXON; p-value: NaN. FT GENSCAN 437 566 INTERNAL EXON; p-value: NaN. FT GENSCAN 567 591 INTERNAL EXON; p-value: NaN. FT GENSCAN 592 630 LAST EXON; p-value: NaN. SQ SEQUENCE 630 AA; 71521 MW; B67DC360B3537DCB CRC64; MSGSRRKATP ASRTRVGNYE MGRTLGEGSF AKVKYAKNTV TGDQAAIKIL DREKVFRHKM VEQLKREIST MKLIKHPNVV EIIEVMASKT KIYIVLELVN GGELFDKIAQ QGRLKEDEAR RYFQQLINAV DYCHSRGVYH RDLKPENLIL DANGVLKVSD FGLSAFSRQV REDGLLHTAC GTPNYVAPEV LSDKGYDGAA ADVWSCGVIL FVLMAGYLPF DEPNLMTLYK RICKAEFSCP PWFSQGAKRV IKRILEPNPI TIVQELKYVL ISEMQRISIA ELLEDEWFKK GYKPPSFDQD DEDITIDDVD AAFSNSKECL VTEKKEKPVS MNAFELISSS SEFSLENLFE KQAQLVKKET RFTSQRSASE IMSKMEETAK PLGFNVRKDN YKIKMKGDKS GRKGQLSVAT EVFEVAPSLH VVELRKTGGD TLEFHKFLRM EKRSDSESVE ILGDWDSPPP EERIVMVSVP TSPESDYARS NQPKEIESRV SDKETASASG EVAARRVLPP WMDPSYEWGG GKWKVDGRKN KNKKEKEKEK EEIIPFKEII EALLGNSGDK VQQDNKVFEV APSLHVVELR KTGDDTLEFH KEFVCMWDTH LYKEITNLNI WDTLSSTLVL AIWTVNASHE // ID NC003070_9 HYPOTHETICAL; PRT; 352 AA. AC NC003070_9; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[71998...71942, 71721...71041, 70968...70840, DE 70285...70139, 68863...68819]; Length: 1059. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: NaN. FT GENSCAN 20 246 INTERNAL EXON; p-value: NaN. FT GENSCAN 247 289 INTERNAL EXON; p-value: NaN. FT GENSCAN 290 338 INTERNAL EXON; p-value: NaN. FT GENSCAN 339 352 LAST EXON; p-value: NaN. SQ SEQUENCE 352 AA; 40043 MW; 1D0CECD50690284D CRC64; MVSDLPLDED DIALLKSPYI GEIVEEIGFV REKRIAHCIV QCDDGGDEDV NSAPNIFTYD NVPLKKRHYL GTSDTFRSFE PLNEHACIVC DIADDGVVPC SGNECPLAVH RKCVELDCED PATFYCPYCW FKEQATRSTA LRTRGVAAAK TLVQYGCSEL RSGDIVMTRE NSQLENGSDN SLPMQLHENL HQLQELVKHL KARNSQLDES TDQFIDMEKS CGEAYAVVND QPKRVLWTVN EEKMLREGVE KFSDTINKNM PWKKILEMGK GIFHTTRNSS DLKDKWRNMN YLDLVEVEID IIIVVAEIGD QNGEGKKRRS DEEDESNRNH PKIRICGMDS DVRSTRTIYK PS // ID NC003070_10 HYPOTHETICAL; PRT; 154 AA. AC NC003070_10; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[72583...72669, 73087...73163, 73287...73395, DE 73611...73740, 73822...73883]; Length: 465. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 29 FIRST EXON; p-value: NaN. FT GENSCAN 30 54 INTERNAL EXON; p-value: NaN. FT GENSCAN 55 55 AA on splice site: ca/g -> Q. FT GENSCAN 56 91 INTERNAL EXON; p-value: NaN. FT GENSCAN 92 134 INTERNAL EXON; p-value: NaN. FT GENSCAN 135 135 AA on splice site: g/tg -> V. FT GENSCAN 136 154 LAST EXON; p-value: NaN. SQ SEQUENCE 154 AA; 16617 MW; BB33DA930EA8E465 CRC64; MQQQQSPQMF PMVPSIPPAN NITTEQIQKY LDENKKLIMA IMENQNLGKL AECAQYQALL QKNLMYLAAI ADAQPPPPTP GPSPSTAVAA QDPQQQQQIH QQAMQGHMGI RPMGMTNNGM QHAMQQPETG LGGNVGLRGG KQDGADGQGK DDGK // ID NC003070_11 HYPOTHETICAL; PRT; 307 AA. AC NC003070_11; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[75633...76556]; Length: 924. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 307 SINGLE EXON; p-value: NaN. SQ SEQUENCE 307 AA; 34704 MW; 2C77A1381F252722 CRC64; MKEDRRLPHK RDAFQFLKTK AAYVIVIVLT YAFGYFSAYH YHQPLQQQLP PSTTAVETTK PQVCSIDNFR VTTPCGNLVP PELIRQTVID RIFNGTSPYI DFPPPHAKKF LRPKRIKGWG SYGAVFENLI RRVKPKTIVE VGSFLGASAI HMANLTRRLG LEETQILCVD DFRGWPGFRD RFKDMALVNG DVLLMYQFMQ NVVISDFSGS ILPVPFSTGS ALEKLCEWGV TADLVEIDAG HDFNSAWADI NRAVRILRPG GVIFGHDYFT AADNRGVRRA VNLFAEINRL KVKTDGQHWV IDSVKVT // ID NC003070_12 HYPOTHETICAL; PRT; 46 AA. AC NC003070_12; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[78321...78181]; Length: 141. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 46 SINGLE EXON; p-value: NaN. SQ SEQUENCE 46 AA; 5192 MW; 87F07E3B913D0C17 CRC64; MTWPIDCIIV LYAECKIIIV MGFILVVAVR AECNCVHDNG EKPEKI // ID NC003070_13 HYPOTHETICAL; PRT; 528 AA. AC NC003070_13; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[84864...84099, 84062...83884, 83671...83072, DE 79976...79935]; Length: 1587. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 255 FIRST EXON; p-value: NaN. FT GENSCAN 256 256 AA on splice site: g/cc -> A. FT GENSCAN 257 315 INTERNAL EXON; p-value: NaN. FT GENSCAN 316 515 INTERNAL EXON; p-value: NaN. FT GENSCAN 516 528 LAST EXON; p-value: NaN. SQ SEQUENCE 528 AA; 59433 MW; 8804EF91B6A53398 CRC64; MRTEIESLWV FALASKFNIY MQQHFASLLV AIAITWFTIT IVFWSTPGGP AWGKYFFTRR FISLDYNRKY KNLIPGPRGF PLVGSMSLRS SHVAHQRIAS VAEMSNAKRL MAFSLGDTKV VVTCHPAVAK EILNSSVFAD RPVDETAYGL MFNRAMGFAP NGTYWRTLRR LGSNHLFNPK QIKQSEDQRR VIATQMVNAF ARNPKSACAV RDLLKTASLC NMMGLVFGRE YELESNNNLE SECLKGLVEE GYDLLAGLDF QQIRFRCSQL VPKVNLLLSR IIHEQRAATG NFLDMLLSLQ GSEKLSESDM VAVLWEMIFR GTDTVAVLVE WVLARIVMHP KVQLTVHDEL DRVVGRSRTV DESDLPSLTY LTAMIKEVLR LHPPGPLLSW ARLSITDTSV DGYHVPAGTT AMVNMWAIAR DPHVWEDPLE FKPERFVAKE GEAEFSVFGS DLRLAPFGSG KRVCPGKNLG LTTVSFWVAT LLHEFEWLPS VEANPPDLSE VLRLSCEMAC PLIVNKRKID GHQFKSLA // ID NC003070_14 HYPOTHETICAL; PRT; 237 AA. AC NC003070_14; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[88145...87880, 87162...86715]; Length: 714. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 88 FIRST EXON; p-value: NaN. FT GENSCAN 89 89 AA on splice site: ag/a -> R. FT GENSCAN 90 237 LAST EXON; p-value: NaN. SQ SEQUENCE 237 AA; 26250 MW; 5C3A5AFE61E105FC CRC64; MNEEMSGESP ENNKHVKKPT MPEKIDYVFK VVVIGDSAVG KTQLLSRFTH NEFCYDSKST IGVEFQTRTI TLRGKLVKAQ IWDTAGQERY RAVTSAYYRG ALGAMVVYDI TKRLSFDHVA RWVEELRAHA DDSAVIMLVG NKADLSVGKR AVPTEDAVEF AETQRLFFSE VSALSGGNVD EAFFRLLEEI FSRVVVSRKA MESDGGATVK LDGSRIDVIS GSDLETSNIK EQASCSC // ID NC003070_15 HYPOTHETICAL; PRT; 1653 AA. AC NC003070_15; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[88977...89081, 89173...89263, 89405...89501, DE 91743...92070, 92270...92501, 92569...92933, 93045...93171, DE 93271...94281, 94357...95075, 95160...95430, 95743...95872, DE 95979...96157, 96554...97238, 97580...97805, 98457...98605, DE 98908...99013, 99783...99923]; Length: 4962. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 35 FIRST EXON; p-value: NaN. FT GENSCAN 36 65 INTERNAL EXON; p-value: NaN. FT GENSCAN 66 66 AA on splice site: g/cc -> A. FT GENSCAN 67 97 INTERNAL EXON; p-value: NaN. FT GENSCAN 98 98 AA on splice site: aa/c -> N. FT GENSCAN 99 207 INTERNAL EXON; p-value: NaN. FT GENSCAN 208 284 INTERNAL EXON; p-value: NaN. FT GENSCAN 285 285 AA on splice site: g/gt -> G. FT GENSCAN 286 406 INTERNAL EXON; p-value: NaN. FT GENSCAN 407 448 INTERNAL EXON; p-value: NaN. FT GENSCAN 449 449 AA on splice site: t/at -> Y. FT GENSCAN 450 785 INTERNAL EXON; p-value: NaN. FT GENSCAN 786 786 AA on splice site: g/aa -> E. FT GENSCAN 787 1025 INTERNAL EXON; p-value: NaN. FT GENSCAN 1026 1115 INTERNAL EXON; p-value: NaN. FT GENSCAN 1116 1116 AA on splice site: g/tt -> V. FT GENSCAN 1117 1158 INTERNAL EXON; p-value: NaN. FT GENSCAN 1159 1159 AA on splice site: gg/a -> G. FT GENSCAN 1160 1218 INTERNAL EXON; p-value: NaN. FT GENSCAN 1219 1219 AA on splice site: g/ga -> G. FT GENSCAN 1220 1446 INTERNAL EXON; p-value: NaN. FT GENSCAN 1447 1447 AA on splice site: ag/a -> R. FT GENSCAN 1448 1522 INTERNAL EXON; p-value: NaN. FT GENSCAN 1523 1571 INTERNAL EXON; p-value: NaN. FT GENSCAN 1572 1572 AA on splice site: ct/g -> L. FT GENSCAN 1573 1607 INTERNAL EXON; p-value: NaN. FT GENSCAN 1608 1653 LAST EXON; p-value: NaN. SQ SEQUENCE 1653 AA; 183445 MW; 4A75D99758A0D37F CRC64; MEFCPTCGNL LRYEGGGNSR FFCSTCPYVA YIQRQVEIKK KQLLVKKSIE AVVTKDDIPT AAETEAPCPR CGHDKAYFKS MQIRSADEPE SRFYRCLNKK MSKQRKKADL ATVLRKSWYH LRLSVRHPTR VPTWDAIVLT AASPEQAELY DWQLRRAKRM GRIASSTVTL AVPDPDGKRI GSGAATLNAI YALARHYEKL GFDLGPEMEV ANGACKWVRF ISAKHVLMLH AGGDSKRVPW ANPMGKVFLP LPYLAADDPD GPVPLLFDHI LAIASCARQA FQDQGGLFIM TGDVLPCFDA FKMTLPEDAA SIVTVPITLD IASNHGVIVT SKSESLAESY TVSLVNDLLQ KPTVEDLVKK DAILHDGRTL LDTGIISARG RAWSDLVALG CSCQPMILEL IGSKKEMSLY EDLVAAWVPS RHDWLRTRPL GELLVNSLGR QKMYSYCTYD LQFLHFGTSS EVLDHLSGDA SGIVGRRHLC SIPATTVSDI AASSVILSSE IAPGVSIGED SLIYDSTVSG AVQIGSQSIV VGIHIPSEDL GTPESFRFML PDRHCLWEVP LVGHKGRVIV YCGLHDNPKN SIHKDGTFCG KPLEKVLFDL GIEESDLWSS YVAQDRCLWN AKLFPILTYS EMLKLASWLM GLDDSRNKEK IKLWRSSQRV SLEELHGSIN FPEMCNGSSN HQADLAGGIA KACMNYGMLG RNLSQLCHEI LQKESLGLEI CKNFLDQCPK FQEQNSKILP KSRAYQVEVD LLRACGDEAK AIELEHKVWG AVAEETASAV RYGFREHLLE SSGKSHSENH ISHPDRVFQP RRTKVELPVR VDFVGGWSDT PPWSLERAGY VLNMAITLEG SLPIGTIIET TNQMGISIQD DAGNELHIED PISIKTPFEV NDPFRLVKSA LLVTGIVQEN FVDSTGLAIK TWANVPRGSG LGTSSILAAA VVKGLLQISN GDESNENIAR LVLVLEQLMG TGGGWQDQIG GLYPGIKFTS SFPGIPMRLQ VVPLLASPQL ISELEQRLLV VFTGQVRLAH QVLHKVVTRY LQRDNLLISS IKRLTELAKS GREALMNCEV DEVGDIMSEA WRLHQELDPY CSNEFVDKLF EFSQPYSSGF KLVGAVQRPP SCEVTVLPLP GIKVKRPRKI STLVAFGFGD NAVKRLCNGA YTCNAVSCIF HKQGKKKLCA ERERRRKMGL LTNKIEREEL KPGDHIYTYR AIFAYSHHGI FVGGSKVVHF RPEHNPMDSS TSSISSSSSE DICSIFPDCG FRQPDSGVVL SCLDCFLKNG SLYCFEYGVS PSVFLTKVRG GTCTTAQSDT TDSVIHRAMY LLQNGFGNYD IFKNNCEDFA LYCKTGLLIM DKLGVGRSGQ ASSIVGAPLA ALLSSPFKLL IPSPIGVATV TAGMYCMSRY ATDIGVRSDV IKVSVEDLAL NLDVKTIEQG EEEEEDEEED SDTDYVRRRD SIRSDLIEEE MANLYVKAVP PPDMNRNTEW FMYPGVWTTY MLILFFGWLV VLSVSGCSPG MAWTVVNLAH FVVTYHSFHW MKGTPFADDQ GIYNGLTWWE QMDNGQQLTR NRKFLTLVPV VLYLIASHTT DYRHPWLFLN TLAVMVLVVA KFPNMHKPTR VVNPAVSDAY QAKQVIETVS FCVPNSEAFT EETVNLVPWG QRV // ID NC003070_16 HYPOTHETICAL; PRT; 300 AA. AC NC003070_16; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[100683...101577, 101679...101686]; Length: DE 903. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 298 FIRST EXON; p-value: NaN. FT GENSCAN 299 299 AA on splice site: g/aa -> E. FT GENSCAN 300 300 LAST EXON; p-value: NaN. SQ SEQUENCE 300 AA; 34200 MW; 8AD99D448F1C5B67 CRC64; MGAAEARALW QRTASRCFVV HEDAKMAPRL ACCQHQQSSS GNTEKNSFSS GSFGDSSDFS CDTKWWLKGS TGFDEEVTNS FLEDTKCKKL HEFVDLIGIR EEEDYSFISK KADATTPWWR STTDKDELAL MVATKSVDHN IQNCDLPPPQ KLHKSIHSSS GEKGFKTAVK SPWKQGVWKD RFERSLSYNG STESKNTSPM SSPRSDDLSK GQLLEALRHS QTRAREAERA AREACAEKDR VITILLKQAS QMLAYKQWLK LLEMEALYLQ MKKEEEQEEQ VKGMNLKKRK QRGEKKKKES // ID NC003070_17 HYPOTHETICAL; PRT; 187 AA. AC NC003070_17; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[105294...104731]; Length: 564. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 187 SINGLE EXON; p-value: NaN. SQ SEQUENCE 187 AA; 20452 MW; 81E6CEBB321444D1 CRC64; MKLSSPPVTN NEPTATASAV KSCGGGGKET SSSTTRHPVY HGVRKRRWGK WVSEIREPRK KSRIWLGSFP VPEMAAKAYD VAAFCLKGRK AQLNFPEEIE DLPRPSTCTP RDIQVAAAKA ANAVKIIKMG DDDVAGIDDG DDFWEGIELP ELMMSGGGWS PEPFVAGDDA TWLVDGDLYQ YQFMACL // ID NC003070_18 HYPOTHETICAL; PRT; 1093 AA. AC NC003070_18; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[109595...111359, 112306...113195, DE 113279...113905]; Length: 3282. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 588 FIRST EXON; p-value: NaN. FT GENSCAN 589 589 AA on splice site: g/cc -> A. FT GENSCAN 590 885 INTERNAL EXON; p-value: NaN. FT GENSCAN 886 1093 LAST EXON; p-value: NaN. SQ SEQUENCE 1093 AA; 123057 MW; AC19F93FDE4D43F1 CRC64; MNIGRLVWNE DDKAIVASLL GKRALDYLLS NSVSNANLLM TLGSDENLQN KLSDLVERPN ASNFSWNYAI FWQISRSKAG DLVLCWGDGY CREPKEGEKS EIVRILSMGR EEETHQTMRK RVLQKLHDLF GGSEEENCAL GLDRVTDTEM FLLSSMYFSF PRGEGGPGKC FASAKPVWLS DVVNSGSDYC VRSFLAKSAG IQTVVLVPTD LGVVELGSTS CLPESEDSIL SIRSLFTSSL PPVRAVALPV TVAEKIDDNR TKIFGKDLHN SGFLQHHQHH QQQQQQPPQQ QQHRQFREKL TVRKMDDRAP KRLDAYPNNG NRFMFSNPGT NNNTLLSPTW VQPENYTRPI NVKEVPSTDE FKFLPLQQSS QRLLPPAQMQ IDFSAASSRA SENNSDGEGG GEWADAVGAD ESGNNRPRKR GRRPANGRAE ALNHVEAERQ RREKLNQRFY ALRSVVPNIS KMDKASLLGD AVSYINELHA KLKVMEAERE RLGYSSNPPI SLDSDINVQT SGEDVTVRIN CPLESHPASR IFHAFEESKV EVINSNLEVS QDTVLHTFVV KSEELTKEKL ISALSREQTN SVQSRTSSAS LFAVLILNVL LWRWLKASAC KAQRLPPGPP RLPILGNLLQ LGPLPHRDLA SLCDKYGPLV YLRLGNVDAI TTNDPDTIRE ILLRQDDVFS SRPKTLAAVH LAYGCGDVAL APMGPHWKRM RRICMEHLLT TKRLESFTTQ RAEEARYLIR DVFKRSETGK PINLKEVLGA FSMNNVTRML LGKQFFGPGS LVSPKEAQEF LHITHKLFWL LGVIYLGDYL PFWRWVDPSG CEKEMRDVEK RVDEFHTKII DEHRRAKLED EDKNGDMDFV DVLLSLPGEN GKAHMEDVEI KALIQDMIAA ATDTSAVTNE WAMAEAIKQP RVMRKIQEEL DNVVGSNRMV DESDLVHLNY LRCVVRETFR MHPAGPFLIP HESVRATTIN GYYIPAKTRV FINTHGLGRN TKIWDDVEDF RPERHWPVEG SGRVEISHGP DFKILPFSAG KRKCPGAPLG VTMVLMALAR LFHCFEWSSP GNIDTVEVYG MTMPKAKPLR AIAKPRLAAH LYT // ID NC003070_19 HYPOTHETICAL; PRT; 658 AA. AC NC003070_19; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[114622...115242, 115961...116027, DE 117234...118522]; Length: 1977. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 207 FIRST EXON; p-value: NaN. FT GENSCAN 208 229 INTERNAL EXON; p-value: NaN. FT GENSCAN 230 230 AA on splice site: a/aa -> K. FT GENSCAN 231 658 LAST EXON; p-value: NaN. SQ SEQUENCE 658 AA; 70726 MW; 107C6978127E0E67 CRC64; MQSIFGQEPS PDGPGTMDFS ELKSSKIEPL RSKNIDFRQQ IEYHKSTHSS KNDSQAIEQY AKVASDMSKL THVGIAGEAQ MVDVSSKDNS KRTALACCKV ILGKRVFDLV LANQMGKGDV LGVAKIAGIN GAKQTSSLIP LCHNIALTHV RVDLRLNPED FSVDIEGEAS CTGKTGVEME AMTAVSVAGL TVYDMCKAAS KDISITDHYG LSEPVKPAEK RSCQRTNRSK SEFESGSDSE SSSSITLNLD HIDALSSNKT PDELFSSRLQ RDSRRVKSIA TLAAQIPGRN VTHAPRPGGF SSSVVSGLSQ GSGEYFTRLG VGTPARYVYM VLDTGSDIVW LQCAPCRRCY SQSDPIFDPR KSKTYATIPC SSPHCRRLDS AGCNTRRKTC LYQVSYGDGS FTVGDFSTET LTFRRNRVKG VALGCGHDNE GLFVGAAGLL GLGKGKLSFP GQTGHRFNQK FSYCLVDRSA SSKPSSVVFG NAAVSRIARF TPLLSNPKLD TFYYVGLLGI SVGGTRVPGV TASLFKLDQI GNGGVIIDSG TSVTRLIRPA YIAMRDAFRV GAKTLKRAPD FSLFDTCFDL SNMNEVKVPT VVLHFRGADV SLPATNYLIP VDTNGKFCFA FAGTMGGLSI IGNIQQQGFR VVYDLASSRV GFAPGGCA // ID NC003070_20 HYPOTHETICAL; PRT; 283 AA. AC NC003070_20; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[119429...119626, 120293...120946]; Length: DE 852. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 66 FIRST EXON; p-value: NaN. FT GENSCAN 67 283 LAST EXON; p-value: NaN. SQ SEQUENCE 283 AA; 31471 MW; D93CA19623625088 CRC64; MGKPRSVSPT IFLLVILLVT AGEKTEAKPW WKKVGDSIGN IAGGIRNAVG NYKPVEMGLS LKGGFCLWSG TAQINHEDSS TKPSVKNSPS ATSTRLLLSP PSFTGNRFSF RFRWRRIRRR NRVNRASREF LIAHNLVRAR VGEPPFQWDG RLAAYARTWA NQRVGDCRLV HSNGPYGENI FWAGKNNWSP RDIVNVWADE DKFYDVKGNT CEPQHMCGHY TQIVWRDSTK VGCASVDCSN GGVYAICVYN PPGNYEGENP FGSYDDQIGL ARDDPPAVIG GMA // ID NC003070_21 HYPOTHETICAL; PRT; 1767 AA. AC NC003070_21; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[130099...130039, 129914...129853, DE 129060...128968, 128669...128600, 128562...128479, 128312...128028, DE 127935...127868, 127782...127651, 127573...127532, 127453...126935, DE 126840...126686, 126589...126076, 125843...125742, 125571...125329, DE 125226...125134, 125022...124921, 124818...124714, 124632...124499, DE 124394...124211, 124123...123992, 123897...123785, 123669...123579, DE 123501...121582]; Length: 5304. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 20 FIRST EXON; p-value: NaN. FT GENSCAN 21 21 AA on splice site: g/tt -> V. FT GENSCAN 22 41 INTERNAL EXON; p-value: NaN. FT GENSCAN 42 72 INTERNAL EXON; p-value: NaN. FT GENSCAN 73 95 INTERNAL EXON; p-value: NaN. FT GENSCAN 96 96 AA on splice site: g/at -> D. FT GENSCAN 97 123 INTERNAL EXON; p-value: NaN. FT GENSCAN 124 124 AA on splice site: g/ag -> E. FT GENSCAN 125 218 INTERNAL EXON; p-value: NaN. FT GENSCAN 219 219 AA on splice site: t/at -> Y. FT GENSCAN 220 241 INTERNAL EXON; p-value: NaN. FT GENSCAN 242 285 INTERNAL EXON; p-value: NaN. FT GENSCAN 286 299 INTERNAL EXON; p-value: NaN. FT GENSCAN 300 472 INTERNAL EXON; p-value: NaN. FT GENSCAN 473 523 INTERNAL EXON; p-value: NaN. FT GENSCAN 524 524 AA on splice site: ag/t -> S. FT GENSCAN 525 695 INTERNAL EXON; p-value: NaN. FT GENSCAN 696 729 INTERNAL EXON; p-value: NaN. FT GENSCAN 730 810 INTERNAL EXON; p-value: NaN. FT GENSCAN 811 841 INTERNAL EXON; p-value: NaN. FT GENSCAN 842 875 INTERNAL EXON; p-value: NaN. FT GENSCAN 876 910 INTERNAL EXON; p-value: NaN. FT GENSCAN 911 954 INTERNAL EXON; p-value: NaN. FT GENSCAN 955 955 AA on splice site: aa/g -> K. FT GENSCAN 956 1016 INTERNAL EXON; p-value: NaN. FT GENSCAN 1017 1060 INTERNAL EXON; p-value: NaN. FT GENSCAN 1061 1097 INTERNAL EXON; p-value: NaN. FT GENSCAN 1098 1098 AA on splice site: ag/t -> S. FT GENSCAN 1099 1128 INTERNAL EXON; p-value: NaN. FT GENSCAN 1129 1767 LAST EXON; p-value: NaN. SQ SEQUENCE 1767 AA; 195378 MW; EDCC3A5DB3AEC667 CRC64; MAPKNNRGKT KGDKKKKEEK VLPVIVDVIV NLPDETEAIL KGISTDRIID VRRLLSVNFD TCHVTNYSLS HEITDILPEL PYSKSSHKEY KKIDCDWLQI RGSRLKDTVD VSALKPCVLT LTEEDYNEGT AVAHVRRLLD IVACTTCFGP SPEKSDSVKS AQVKGGGKNS KQSDTSPPPS PASKDTVVDE AGETSHSFPK LGSFYEFFSL AHLTPPLQYI RLATKRETED IAKEDHLLSI DVKLCNGKLV HIEGCRKGFY SIGKQRIICH NLVDLLRQIS RAFDNAYSDL LKAFSERNKF GNLPYGFRAN TWLIPPTAAQ SPAAFPPLPV EDERWGGDGG GQGRDGSYDL VPWSNEFAFI ASMPCKTAEE RQVRDRKVFL LHNLFVDVAT FRAIKAVQKV MAEPVLAEED SEVLYSETVR DLTVTVTRDT SNASSKVDTK IDGIQATGLD KKKLMERNLL KGLTADENTA AHDVATLGTI SLKYCGYIAV VKLEKESEEL SPPSQIVDLL EQPEGGANAL NINSLRFLLH KSSPEQNKKT PQQHDDELTS SREFVSKMLE ESIAKLEGEE IDRDSIMRWE LGACWIQHLQ DQKNTEKDKK QTGEKSKNEL KVEGLGKPLK SLNSSKKKTD VSSPKTPQTA LSSQVDAVSS EADTAASLQS DAEKNAQENV LILKNLLSDA AFTRLKESDT GLHHKVADFG SLELSPVDGR TLTDFMHTRG LRMRSLGYVA VISAVATDTD KIAIKVAAAL NMMLGIPENV AATPHNPWNV HPLIFRWLEK FLKKRYDYDL NAFSYKDLRK FAILRGLCHK VGIELIPRDF DMDSPAPFRK TDVVSLVPVH KQAACSSADG RQLLESSKTA LDKGKLEDAV TYGTKALAKL VAVCGPYHRM TAGAYSLLAV VLYHTGDFNQ ATIYQQKALD INERELGLDH PDTMKSYGDL AVFYYRLQHT ELALKYVKRA LYLLHLTCGP SHPNTAATYI NVAMMEEGLG NVHVALRYLH KALKCNQRLL GPDHIQTAAS YHAIAIALSL MEAYHLSVQH EQTTLRILRA KLGPDDLRTQ DAAAWLEYFE SKAFEQQEAA RNGTPKPDAS IASKGHLSVS DLLDYINPSH NAKGKESVAA KRKNYILKLK EKSKQSNVSE HLVEIPREKQ KEMSEEDTEE TGSEEGKSSE ENHETILAPV EEPPSPPVIE DATMDNSNPI TSSDVSTEPQ HPDGSEDGWQ PVQRPRSAGS YGRRMKQRRA SIGKVYTYQK KNVEADIDNP LFQNATQQND KYYILKKRTA SYSSYADHHS PGLTTQGTKF GRKIVKTLAY RVKSTQPSSG NAKTAGETSE EDGLKTDASS VEPPTLSSTV QSEAYHTKNS VVSLGKSPSY KEVALAPPGS IAKYQVWVPQ AEVSDKQEDD EMEKKTEQGT SMELTRDEQM ITGLEEEVKK EISADPESNI TQGEEEIKVE LQPSEGVLGG SHINENDESG GGIQVEEQVE VELINDGVTD MIHSTREQQV IDQLAADSED LKAKLSISTT DSGDASRGLL PNKKLSASAA PFNPSSPPSI IRPTPIGMNI GPSWPVNMTL HHGPPPPYPS PPTTPNLMQP MSFVYPPPYS QSVPTSTYPV TSGPFHPNQF PWQLNVSDFV PRTVWPGCHP VEFPPPHMIT EPIAATVLEP TVILPTDIDT SGVEETKEGT QDVAVADEVM DSVNHVNNAV ARSETENGNR KSEEGEKTFS ILLRGRRNRK QTLRMPISLL NRPYDSQPFK LGYSRVIRDS EAPKSVA // ID NC003070_22 HYPOTHETICAL; PRT; 64 AA. AC NC003070_22; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[131594...131656, 131820...131951]; Length: DE 195. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 21 FIRST EXON; p-value: NaN. FT GENSCAN 22 64 LAST EXON; p-value: NaN. SQ SEQUENCE 64 AA; 7766 MW; A6684804BF050E7E CRC64; MKWINRFFRK QVNLTIMKSD KERDLKKGNT ENRHTKETLI LYRTKPKGNI GWFALISQGR RENE // ID NC003070_23 HYPOTHETICAL; PRT; 766 AA. AC NC003070_23; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[136266...136206, 135866...135841, DE 135807...135708, 135251...135188, 135104...134582, 134501...134289, DE 134206...133911, 133752...133641, 133539...133303, 133218...132881, DE 132744...132414]; Length: 2301. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 20 FIRST EXON; p-value: NaN. FT GENSCAN 21 21 AA on splice site: g/ga -> G. FT GENSCAN 22 29 INTERNAL EXON; p-value: NaN. FT GENSCAN 30 62 INTERNAL EXON; p-value: NaN. FT GENSCAN 63 63 AA on splice site: g/cg -> A. FT GENSCAN 64 83 INTERNAL EXON; p-value: NaN. FT GENSCAN 84 84 AA on splice site: ag/g -> R. FT GENSCAN 85 258 INTERNAL EXON; p-value: NaN. FT GENSCAN 259 329 INTERNAL EXON; p-value: NaN. FT GENSCAN 330 427 INTERNAL EXON; p-value: NaN. FT GENSCAN 428 428 AA on splice site: ag/t -> S. FT GENSCAN 429 465 INTERNAL EXON; p-value: NaN. FT GENSCAN 466 544 INTERNAL EXON; p-value: NaN. FT GENSCAN 545 656 INTERNAL EXON; p-value: NaN. FT GENSCAN 657 657 AA on splice site: aa/g -> K. FT GENSCAN 658 766 LAST EXON; p-value: NaN. SQ SEQUENCE 766 AA; 88579 MW; 01A9C2D941FF597D CRC64; MEIKHKEEYV TLKKNLKIDA GNNIQREIQR RRDQLLELKK TPFTDFRLLA ESYYLAVTYF VSAKDQLKYT RVWLMAFSHD NRVRFKDEGK PLSSEYGYGR KARPSLDRVF KNVKWGFKKP LSFPSHKDPD HKETSSVTRK NIINPQDSFL QNWNKIFLFA CVVALAIDPL FFYIPIVDSA RHCLTLDSKL EIAASLLRTL IDAFYIIHIV FQFRTAYIAP SSRVFGRGEL VDDAKAIALK YLSSYFIIDL LSILPLPQIV VLAVIPSVNQ PVSLLTKDYL KFSIIAQYVP RILRMYPLYT EVTRTSGIVT ETAWAGAAWN LSLYMLASHV FGALWYLISV EREDRCWQEA CEKTKGCNMK FLYCENDRNV SNNFLTTSCP FLDPGDITNS TIFNFGIFTD ALKSGVVESH DFWKKFFYCF WWGLRNLSAL GQNLQTSKFV GEIIFAISIC ISGLVLFALL IGNMQKYLES TTVREEEMRV RKRDAEQWMS HRMLPEDLRK RIRRYEQYRW QETRGVEEET LLRNLPKDLR RDIKRHLCLD LLKKVPLFEI MDEQLLDAVC DRLRPVLYTE NSYVIREGDP VGEMLFVMRG RLVSATTNGG RSGFFNAVNL KASDFCGEDL LPWALDPQSS SHFPISTRTV QALTEVEAFA LTAEDLKFYS VQWRTWSVSF IQAAWRRYCR RKLAKSLRDE EDRLREALAS QDKEHNAATV SSSLSLGGAL YASRFASNAL HNLRHNISNL PPRYTLPLLP QKPTEPDFTA NHTTDP // ID NC003070_24 HYPOTHETICAL; PRT; 451 AA. AC NC003070_24; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[136732...136816, 136935...137758, DE 138494...138541, 138958...139114, 139185...139345, 139488...139568]; DE Length: 1356. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 28 FIRST EXON; p-value: NaN. FT GENSCAN 29 29 AA on splice site: g/tt -> V. FT GENSCAN 30 303 INTERNAL EXON; p-value: NaN. FT GENSCAN 304 319 INTERNAL EXON; p-value: NaN. FT GENSCAN 320 371 INTERNAL EXON; p-value: NaN. FT GENSCAN 372 372 AA on splice site: g/tt -> V. FT GENSCAN 373 425 INTERNAL EXON; p-value: NaN. FT GENSCAN 426 451 LAST EXON; p-value: NaN. SQ SEQUENCE 451 AA; 50548 MW; 3551F5526DB2951D CRC64; MSDSGEPKPS QQEEPLPQPA AQETQSQQVC TFFKKPTKSK NIRKRTIDAD EEDGDSKSES SILQNLKKVA KPDSKLYFSS GPSKSSTTTS GAPERSVFHY DSSKEIQVQN DSGATATLET ETDFNQDARA IRERVLKKAD EALKGNKKKA SDEKLYKGIH GYTDHKAGFR REQTISSEKA GGSHGPLRAS AHIRVSARFD YQPDICKDYK ETGYCGYGDS CKFLHDRGDY KPGWQIEKEW EEAEKVRKRN KAMGVEDEDD EADKDSDEDE NALPFACFIC REPFVDPVVT KCKHYFCEHC ALKIFQVLND EEGEGCGLWH IYLTTLDMLG RIGYNTVRSI LPDDPQQAAS SASPSTGSFL WESLLASLPA AVISTDADDM DSGAPQEDKC GEMGEPALLC EQCRFTVQGF ENFSTHLKSE EHAHESQYYR NEDDETDGDD YVRDSEDDEE E // ID NC003070_25 HYPOTHETICAL; PRT; 359 AA. AC NC003070_25; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[142141...142281, 142424...142639, DE 142711...142804, 143659...143801, 143912...143966, 144002...144109, DE 144447...144559, 144927...145004, 145132...145207, 145345...145400]; DE Length: 1080. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 47 FIRST EXON; p-value: NaN. FT GENSCAN 48 119 INTERNAL EXON; p-value: NaN. FT GENSCAN 120 150 INTERNAL EXON; p-value: NaN. FT GENSCAN 151 151 AA on splice site: g/cg -> A. FT GENSCAN 152 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 216 INTERNAL EXON; p-value: NaN. FT GENSCAN 217 217 AA on splice site: g/gg -> G. FT GENSCAN 218 252 INTERNAL EXON; p-value: NaN. FT GENSCAN 253 253 AA on splice site: a/gc -> S. FT GENSCAN 254 290 INTERNAL EXON; p-value: NaN. FT GENSCAN 291 316 INTERNAL EXON; p-value: NaN. FT GENSCAN 317 341 INTERNAL EXON; p-value: NaN. FT GENSCAN 342 342 AA on splice site: a/tg -> M. FT GENSCAN 343 359 LAST EXON; p-value: NaN. SQ SEQUENCE 359 AA; 40146 MW; 8F0D87F3B157818A CRC64; MDGVEGGTAM YGGLETVQYV RTHHQHLCRE NQCTSALVKH IKAPLHLVWS LVRRFDQPQK YKPFVSRCTV IGDPEIGSLR EVNVKSGLPA TTSTERLELL DDEEHILGIK IIGGDHRLKN YSSILTVHPE IIEGRAGTMV IESFVVDVPQ AGNFSAGDFH NRSFRRFSFP ILRFLSLSHG AISPAVKING ENQASRYQIS NLKFVDAAGA SSSQAAGFPM FLPFIVMIKF VYLSKLKTPT RRGGEGGDNT QQSSQKKSYR YRPGTVALKE IRHFQKQTNL LIPAASFIRE VRSITHMLAP PQINRWTAEA LVALQEAAED YLVGLFSDSM LCAIHARRVT LMRKDFELAR RLGGKGRPW // ID NC003070_26 HYPOTHETICAL; PRT; 469 AA. AC NC003070_26; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[149728...148319]; Length: 1410. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 469 SINGLE EXON; p-value: NaN. SQ SEQUENCE 469 AA; 51524 MW; 6C453D51BF357B2E CRC64; MPSPGMGHLI PFVELAKRLV QHDCFTVTMI ISGETSPSKA QRSVLNSLPS SIASVFLPPA DLSDVPSTAR IETRAMLTMT RSNPALRELF GSLSTKKSLP AVLVVDMFGA DAFDVAVDFH VSPYIFYASN ANVLSFFLHL PKLDKTVSCE FRYLTEPLKI PGCVPITGKD FLDTVQDRND DAYKLLLHNT KRYKEAKGIL VNSFVDLESN AIKALQEPAP DKPTVYPIGP LVNTSSSNVN LEDKFGCLSW LDNQPFGSVL YISFGSGGTL TCEQFNELAI GLAESGKRFI WVIRSPSEIV SSSYFNPHSE TDPFSFLPIG FLDRTKEKGL VVPSWAPQVQ ILAHPSTCGF LTHCGWNSTL ESIVNGVPLI AWPLFAEQKM NTLLLVEDVG AALRIHAGED GIVRREEVVR VVKALMEGEE GKAIGNKVKE LKEGVVRVLG DDGLSSKSFG EVLLKWKTHQ RDINQETSH // ID NC003070_27 HYPOTHETICAL; PRT; 228 AA. AC NC003070_27; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[152149...152086, 152002...151932, DE 151731...151393, 150400...150188]; Length: 687. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 21 FIRST EXON; p-value: NaN. FT GENSCAN 22 22 AA on splice site: g/aa -> E. FT GENSCAN 23 45 INTERNAL EXON; p-value: NaN. FT GENSCAN 46 158 INTERNAL EXON; p-value: NaN. FT GENSCAN 159 228 LAST EXON; p-value: NaN. SQ SEQUENCE 228 AA; 25152 MW; 866A61CB14AF76C3 CRC64; MEREEPIARN EVDHETPTKP DESASASSPR RSEETLNQST QINQTGPRSV LHLQSQESHL SNSTTTHASS PSVNVTPKPD LSSFSNTTAT QEPSPSAGLQ IRTIEKDRSP GSAFPRHKRL PVFDEVCSGA VFPWYGPQEP DLPSSSGIGT MSANSHDDLE KLMDLFEGNF VKIFYWASTT NHYVDSTTRC GEDSHQVRSS PQNYGHNNQR TQLWSKPSPS SHKAFAQG // ID NC003070_28 HYPOTHETICAL; PRT; 361 AA. AC NC003070_28; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[153113...154198]; Length: 1086. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 361 SINGLE EXON; p-value: NaN. SQ SEQUENCE 361 AA; 41256 MW; 7D85508941A3ADDC CRC64; MTHERGYDLA SQVLNICNVH HKKLFCHITY RHLLVLSSDD NGCVILKKVI TIADDFLKDE FLDLIAQHAH SLSMHDLGIS LIQHVLELDF TKKTTQDDKR LHELMAEFDE VLSTSVTADV DKLHKLASKL MLDSDLFFEF VITRRGSLMI QIILGKSEEV DQVILAGVKQ RFIDVTTNFY GYRIMIQTIK VFKKRGDLKV YDQILRLIGV HALYLTKDPD MGNKTFQHAI NLHHQDCTTF IACGLQSHYI ELSFLKHGSK IVEMLIDDRI SMVPLVLLMM EIVKCDEDTL VRLATDEYGN NILKKFLALA KEHKEDFFGD LVDKLNPLLD SLRGTLGENI VAIIDSETEM VKDRIVSQGN N // ID NC003070_29 HYPOTHETICAL; PRT; 481 AA. AC NC003070_29; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[156011...154566]; Length: 1446. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 481 SINGLE EXON; p-value: NaN. SQ SEQUENCE 481 AA; 52785 MW; CA775646EF05364B CRC64; MADGNTPHVA IIPSPGIGHL IPLVELAKRL LDNHGFTVTF IIPGDSPPSK AQRSVLNSLP SSIASVFLPP ADLSDVPSTA RIETRISLTV TRSNPALREL FGSLSAEKRL PAVLVVDLFG TDAFDVAAEF HVSPYIFYAS NANVLTFLLH LPKLDETVSC EFRELTEPVI IPGCVPITGK DFVDPCQDRK DESYKWLLHN VKRFKEAEGI LVNSFVDLEP NTIKIVQEPA PDKPPVYLIG PLVNSGSHDA DVNDEYKCLN WLDNQPFGSV LYVSFGSGGT LTFEQFIELA LGLAESGKRF LWVIRSPSGI ASSSYFNPQS RNDPFSFLPQ GFLDRTKEKG LVVGSWAPQA QILTHTSIGG FLTHCGWNSS LESIVNGVPL IAWPLYAEQK MNALLLVDVG AALRARLGED GVVGREEVAR VVKGLIEGEE GNAVRKKMKE LKEGSVRVLR DDGFSTKSLN EVSLKWKAHQ RKIDQEQESF L // ID NC003070_30 HYPOTHETICAL; PRT; 939 AA. AC NC003070_30; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[162219...160557, 160292...160079, DE 158283...158254, 157937...157829, 157756...156953]; Length: 2820. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 554 FIRST EXON; p-value: NaN. FT GENSCAN 555 555 AA on splice site: g/ag -> E. FT GENSCAN 556 625 INTERNAL EXON; p-value: NaN. FT GENSCAN 626 626 AA on splice site: ac/c -> T. FT GENSCAN 627 635 INTERNAL EXON; p-value: NaN. FT GENSCAN 636 636 AA on splice site: tg/g -> W. FT GENSCAN 637 672 INTERNAL EXON; p-value: NaN. FT GENSCAN 673 939 LAST EXON; p-value: NaN. SQ SEQUENCE 939 AA; 106755 MW; D9BB8F9D6D00B75B CRC64; MRTSGDVYGA DTGSSRSDED RGFKEDLNES ATSPMRNRLD DSNSRPGSQR FVKSSRKEEE TDSDSSSSKN TTTRNNPIQY TDKQQAELLR KLDSIKDHLL RGGGNNATVV DQPPMGFHAH HGPPPSYYNP YPEPFPYGMY PTASNQPHVP AYRDPYGFPV HRIPQNFYQG PSHYPNQMPP RPPYPQGQYV DIGSDILESQ LQDPRFFPGT PSRYGDVPFS PALHHGEKVG PFSPHGGVHT RWPSEIDSEM GGAFARGYVQ QAVSDTDSRR CHPLAGGAPF IACHSCFELL YVPKKKLLGQ ERQQKMQCGA CSEVITFRVV DKKLVFSSSA LGETTNRVSV EVEDRSSPIP VVDDYPLNDE EPRIHQETKI VHAVSPSDHS NDEDRSSISS EPRKQVVKSV RSRAQGAKVP PPPPPEKSNL LELFEYSNVN RAAITYGMAQ LGYYKQESYT KQDSLKSESV ATETDVSYNE YYTNTEESED SRISKASKEG RRPRNRKQSS EHSFAEVTNN ISSNDQNNEQ LEVWVNGYLI PEDLVISAEK QAGPVQAGKY WYVWEEFSRP MPDNCGAGNT SVFVNGRELH ERDLELLSSR GLPRGKNRSY IIDIAGRVLD GDSGEELKSL GRLAPTKHHN HCFTEWFNPE QFLDNMRNKW LAFIGDSISR NHVQSLLCIL SQVEEVEDIF HDKEYKSRIW RFPSYNFTLS VIWSPFLVKA ETFENGVPFS DIRVHLDKLD QKWTDQYINF DYVVISGGKW FLKTTIFHEN NTVTGCHYCQ GKNNMTELGY LYSYRKVLHL VLDFVAEPNH KAQVLFRTTT PDHFENGEWD SGGFCNRTMP FTEGSEGEMK SEDVSMRDIE LEEFYKTTTT QQEGSNSNIV LLDTTSMSLL RPDGHPGPYR YPNPFAGLKN KELNQVQNDC LHWCLPGPID SWNDLMVEVM LNRERQRRE // ID NC003070_31 HYPOTHETICAL; PRT; 965 AA. AC NC003070_31; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[167798...166941, 166470...166416, DE 166017...164203, 164104...163935]; Length: 2898. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 286 FIRST EXON; p-value: NaN. FT GENSCAN 287 304 INTERNAL EXON; p-value: NaN. FT GENSCAN 305 305 AA on splice site: t/cc -> S. FT GENSCAN 306 909 INTERNAL EXON; p-value: NaN. FT GENSCAN 910 910 AA on splice site: g/cc -> A. FT GENSCAN 911 965 LAST EXON; p-value: NaN. SQ SEQUENCE 965 AA; 109065 MW; B05D44140C56FD46 CRC64; MDQKRPTIAG DEEMAAEKPL QPALQKPPGF RDQQNQPSAP PSGTATLPRR RPRPIHPADK KRRCSFCRVF CCCVCILFAV ILLLILIAVA VFFLWYSPKL PVVRLASFKI SNFNFSDGKS DDGWSFLSAD TTSVLDFRNP NGKLTFYYGD TDVAVILGEK DFETNLESTK VKGFIEKPGN RTAVIVPTTV RKRQVDDPTA KRLQVELKSK KLLVTVTAKT KVGLAVGSRK IVTVGVSLRC GGVILQTLDS KMAQCTIKML KWYVETTWSK LDDLVDSKLN KSYYKKVGEA VSKPVALNAT AFVSSYESIS VSMRSNLRFK EKNTKWKILE QPLRELLWVV REGEAYVRMS LEPKLGFWAK AIVLHSNRDC TELHIHNLLS CLPIIVEAIE TASEVSGWDE EEMSKKRLVH SNKYMKQWND SQMFTWKFGR EYLVTEDFCN RFESAWTEDR WILIKELQEK KQSGSSKHER KMADFLLKHL GDGNESPKLF PSSLLDNTKD YQVKKRLGNG SQYKEITWLG ESFALRHFFG DIDALLPQIT PLLSLSHPNI VYYLCGFTDE EKKECFLVME LMRKTLGMHI KEVCGPRKKN TLSLPVAVDL MLQIALGMEY LHSKRIYHGE LNPSNILVKP RSNQSGDGYL LGKIFGFGLN SVKGFSSKSA SLTSQNENFP FIWYSPEVLE EQEQSGTAGS LKYSDKSDVY SFGMVSFELL TGKVPFEDSH LQGDKMSRNI RAGERPLFPF NSPKFITNLT KRCWHADPNQ RPTFSSISRI LRYIKRFLAL NPECYSSSQQ DPSIAPTVDY CEIETKLLQK LSWESTELTK VSQVPFQMFA YRVVERAKTC EKDNLREPSE SGSEWASCSE DEGGAGSDEQ LSYAKERRLS CSSNDVGMSK KQVSNLLKRA SSLKPIQKPA FGFCSGTTPR GRSRHPPLSP CGGQSMRANS ESQLILISPR IRRSNSGHAS DSELS // ID NC003070_32 HYPOTHETICAL; PRT; 381 AA. AC NC003070_32; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[169115...169257, 169347...169509, DE 169600...169710, 169806...169887, 170037...170162, 170266...170313, DE 170622...170760, 170851...171010, 171618...171718, 171819...171891]; DE Length: 1146. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 47 FIRST EXON; p-value: NaN. FT GENSCAN 48 48 AA on splice site: ag/a -> R. FT GENSCAN 49 102 INTERNAL EXON; p-value: NaN. FT GENSCAN 103 139 INTERNAL EXON; p-value: NaN. FT GENSCAN 140 166 INTERNAL EXON; p-value: NaN. FT GENSCAN 167 167 AA on splice site: g/ag -> E. FT GENSCAN 168 208 INTERNAL EXON; p-value: NaN. FT GENSCAN 209 209 AA on splice site: g/gt -> G. FT GENSCAN 210 224 INTERNAL EXON; p-value: NaN. FT GENSCAN 225 225 AA on splice site: g/at -> D. FT GENSCAN 226 270 INTERNAL EXON; p-value: NaN. FT GENSCAN 271 271 AA on splice site: ag/c -> S. FT GENSCAN 272 324 INTERNAL EXON; p-value: NaN. FT GENSCAN 325 357 INTERNAL EXON; p-value: NaN. FT GENSCAN 358 358 AA on splice site: ag/c -> S. FT GENSCAN 359 381 LAST EXON; p-value: NaN. SQ SEQUENCE 381 AA; 43372 MW; 7DCA535B5BC284BC CRC64; MFTREITAKD VKATEKNRIR YSSKHIKHLP PGTITEFEWK DYCPLGFRLI QELEDINHDE YMKSICNDET LRKLSTSKVG NMFLLSKDDR FLIKILRKSE IKVILEMLPG YFRHIHKYRS TLLSKNYGAH SVKPIGGVKT YFVVMSNILQ SDVFMNKVYD LKGSSQENDL DVNFQLCRQT KLDCELLEDE GIMDYSLMLG LQVKGSCHGS IDELIPVYDS FTSRDSVDSE NSVNIQSVAS ISPSPAQTNA SDSPYESLVS KTNLTNIFQN SSSTNFGMKI PGRARRVGRG ESGSVVGKQS REGGEEWYDV ILYLGIIDIF QDYGNISADY GFCNFKTLIL DIAPDEYYQT YHQIDKRSSQ AGRGNDRRNV SKLESNHKTK N // ID NC003070_33 HYPOTHETICAL; PRT; 151 AA. AC NC003070_33; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[172826...172621, 172544...172295]; Length: DE 456. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 68 FIRST EXON; p-value: NaN. FT GENSCAN 69 69 AA on splice site: ag/g -> R. FT GENSCAN 70 151 LAST EXON; p-value: NaN. SQ SEQUENCE 151 AA; 16544 MW; D50543B647D58011 CRC64; MASLLDKAKD FVADKLTAIP KPEGSVTDVD LKDVNRDSVE YLAKVSVTNP YSHSIPICEI SFTFHSAGRE IGKGKIPDPG SLKAKDMTAL DIPVVVPYSI LFNLARDVGV DWDIDYELQI GLTIDLPVVG EFTIPISSKG EIKLPTFKDF F // ID NC003070_34 HYPOTHETICAL; PRT; 39 AA. AC NC003070_34; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[174270...174265, 173591...173478]; Length: DE 120. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 2 FIRST EXON; p-value: NaN. FT GENSCAN 3 39 LAST EXON; p-value: NaN. SQ SEQUENCE 39 AA; 4641 MW; 8EFC23313037C0AB CRC64; MQLSTNWTRF FQSLSTNHPS TRDMPTVHDF TVIMSKFCF // ID NC003070_35 HYPOTHETICAL; PRT; 496 AA. AC NC003070_35; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[175862...176032, 176207...176338, DE 176592...176752, 177025...178051]; Length: 1491. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 57 FIRST EXON; p-value: NaN. FT GENSCAN 58 101 INTERNAL EXON; p-value: NaN. FT GENSCAN 102 154 INTERNAL EXON; p-value: NaN. FT GENSCAN 155 155 AA on splice site: gc/a -> A. FT GENSCAN 156 496 LAST EXON; p-value: NaN. SQ SEQUENCE 496 AA; 55533 MW; 766318AE9B5F1566 CRC64; MGLPGKNKGA VLSKIATNNQ HGENSEYFDG WKAYDKDPFH LSRNPHGIIQ MGLAENQLCL DLIKDWVKEN PEASICTLEG IHQFSDIANF QDYHGLKKFR QAIAHFMGKA RGGRVTFDPE RVVMSGGATG ANETIMFCLA DPGDVFLIPS PYYAAFDRDL RWRTGVEIIP VPCSSSDNFK LTVDAAEWAY KKAQESNKKV KGLILTNPSN PLGTMLDKDT LTNLVRFVTR KNIHLVVDEI YAATVFAGGD FVSVAEVVND VDISEVNVDL IHIVYSLSKD MGLPGFRVGI VYSFNDSVVS CARKMSSFGL VSSQTQLMLA SMLSDDQFVD NFLMESSRRL GIRHKVFTTG IKKADIACLT SNAGLFAWMD LRHLLRDRNS FESEIELWHI IIDRVKLNVS PGSSFRCTEP GWFRICFANM DDDTLHVALG RIQDFVSKNK NKIVEKASEN DQVIQNKSAK KLKWTQTNLR LSFRRLYEDG LSSPGIMSPH SPLLRA // ID NC003070_36 HYPOTHETICAL; PRT; 178 AA. AC NC003070_36; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[183623...183615, 181422...181347, DE 180852...180401]; Length: 537. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 3 FIRST EXON; p-value: NaN. FT GENSCAN 4 28 INTERNAL EXON; p-value: NaN. FT GENSCAN 29 29 AA on splice site: g/ga -> G. FT GENSCAN 30 178 LAST EXON; p-value: NaN. SQ SEQUENCE 178 AA; 19941 MW; 087172B7452CC0F7 CRC64; MRMKIVLKLD LHDDRAKQKA LKTVSTLPGI DSIAMDMKEK KLTVIGTVDP VNVVSKLRKY WPMTDIVLVG PAKEPEKEKK EEPKKEGGGE PPKKEGEAPK EEGKKEGEAP KKEEEKKEGG DKKEGEKKDQ PQAQPQPVVP PPDHVLELVK AYKAYNPHLT TYYYAQSIEE NPNACVIC // ID NC003070_37 HYPOTHETICAL; PRT; 327 AA. AC NC003070_37; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[185260...186009, 186340...186573]; Length: DE 984. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 250 FIRST EXON; p-value: NaN. FT GENSCAN 251 327 LAST EXON; p-value: NaN. SQ SEQUENCE 327 AA; 35927 MW; B7960DBE23A71205 CRC64; MISKDHLHHL DPLGTTKSYH MNTSTVSPPS PASSISLSQS AWLEVRLFYV RIAPCVVENV PDFLTLRHPR RETGASLEVN GVRVPSSQTA SLKLRRDRVD RESSEVTYVS TETVRVTGCV DFEVYDNEDM VLCGNLDRIE GAWNNGTVSD PKTGWGMDCY IAMGNGHVSG PSASVFFQPK FGVSSPSVEV YIAGCCGGVP VILTKTIQAS PRRKVARHVT LDAIPEDEEV GKEQDIGTIG DELARQSKVQ MMESEVDEYD DSDMKMAQRY YPEGMYVDED GQLSWFNAGV RVGVGIGLGM CLGVGIGVGL LMRSYQATTS NLRRRFL // ID NC003070_38 HYPOTHETICAL; PRT; 706 AA. AC NC003070_38; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[187235...187822, 188011...188176, DE 188284...188414, 188529...189320, 189372...189498, 189696...189861, DE 190941...190998, 191967...192059]; Length: 2121. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 196 FIRST EXON; p-value: NaN. FT GENSCAN 197 251 INTERNAL EXON; p-value: NaN. FT GENSCAN 252 252 AA on splice site: g/gg -> G. FT GENSCAN 253 295 INTERNAL EXON; p-value: NaN. FT GENSCAN 296 559 INTERNAL EXON; p-value: NaN. FT GENSCAN 560 601 INTERNAL EXON; p-value: NaN. FT GENSCAN 602 602 AA on splice site: g/aa -> E. FT GENSCAN 603 656 INTERNAL EXON; p-value: NaN. FT GENSCAN 657 657 AA on splice site: tt/a -> L. FT GENSCAN 658 676 INTERNAL EXON; p-value: NaN. FT GENSCAN 677 706 LAST EXON; p-value: NaN. SQ SEQUENCE 706 AA; 78103 MW; 4A09BB648C1BEAB7 CRC64; MSKIRSSATM PHRDQPSPAS PHVVTLNCIE DCALEQDSLA GVAGVEYVPL SRIADGKIES ATAVLLHSLA YLPRAAQRRL RPHQLILCLG SADRAVDSTL AADLGLRLVH VDTSRAEEIA DTVMALILGL LRRTHLLSRH ALSASGWLGS LQPLCRGMRR CRGMVLGIVG RSVSARYLAS RSLAFKMSVL YFDVPEGDEE RIRPSRFPRA ARRMDTLNDL LAASDVISLH CALTNDTVQI LNAECLQHIK PGAFLVNTGS CQLLDDCAVK QLLIDGTIAG CALDGAEGPQ WMEAWVKEMP NVLILPRSAD YSEEVWMEIR EKAISILHSF FLDGVIPSNT VSDEEVEESE ASEEEEQSPS KHEKLAIVES TSRQQGESTL TSTEIVRREA SELKESLSPG QQHVSQNTAV KPEGRRSRSG KKAKKRHSQQ KYMQKTDGSS GLNEESTSRR DDIAMSDTEE VLSSSSRCAS PEDSRSRKTP LEVMQESSPN QLVMSSKKFI GKSSELLKDG YVVALYAKDL SGLHVSRQRT KNGGWFLDTL SNVSKRDPAA QFIIAYRNKI DHYKLVLVHE LVFQDTVGLR SFAAGGKLLQ VALTLESVTS IENGPKLKNP TSWKTEIICV MNVAGGVGRS CGDIGNGRRR WYHTLDRLKR KTESLHLFDR DWKKIKAFVG SKTVIQVLLL MKNLSVNLTS PEFDEQVRAS RLFVTF // ID NC003070_39 HYPOTHETICAL; PRT; 247 AA. AC NC003070_39; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[193662...193351, 193071...192640]; Length: DE 744. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 104 FIRST EXON; p-value: NaN. FT GENSCAN 105 247 LAST EXON; p-value: NaN. SQ SEQUENCE 247 AA; 28030 MW; 13B58391A7FD3631 CRC64; MARKNLGRRK IELVKMTNES NLQVTFSKRR SGLFKKGSEL CTLCDAEIAI IVFSPSGKAY SFGHPNVNKL LDHSLGRVIR HNNTNFAESR TKLRIQMLNE SLTEVMAEKE KEQETKQSIV QNERENKDAE KWWRNSPTEL NLAQSTSMKC DLEALKKEVD EKVAQLHHRN LNFYVGSSSN VAAPAAVSGG NISTNHGFFD QNGNSTSAPT LPFGFNVMNR TPAGYNSYQL QNQEVKQVHP QYWARYY // ID NC003070_40 HYPOTHETICAL; PRT; 472 AA. AC NC003070_40; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[195980...196542, 197115...197219, DE 197300...197422, 197501...197671, 197775...197904, 197974...198183, DE 198267...198383]; Length: 1419. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 187 FIRST EXON; p-value: NaN. FT GENSCAN 188 188 AA on splice site: ag/g -> R. FT GENSCAN 189 222 INTERNAL EXON; p-value: NaN. FT GENSCAN 223 223 AA on splice site: ag/g -> R. FT GENSCAN 224 263 INTERNAL EXON; p-value: NaN. FT GENSCAN 264 264 AA on splice site: gg/a -> G. FT GENSCAN 265 320 INTERNAL EXON; p-value: NaN. FT GENSCAN 321 321 AA on splice site: gg/t -> G. FT GENSCAN 322 364 INTERNAL EXON; p-value: NaN. FT GENSCAN 365 434 INTERNAL EXON; p-value: NaN. FT GENSCAN 435 472 LAST EXON; p-value: NaN. SQ SEQUENCE 472 AA; 52326 MW; FA5E5AED90C5862D CRC64; MSVYDAAFLN TELSKPTSIF GLRLWVVIGI LLGSLIVIAL FLLSLCLTSR RKNRKPRADF ASAAIATPPI SKEIKEIVPA QNQSVPAEIQ VDIGKIEHRV VFSDRVSSGE SRGTASASET ASYSGSGNCG PEVSHLGWGR WYTLRELEAA TNGLCEENVI GEGGYGIVYR GILTDGTKVA VKNLLNNRGQ AEKEFKVEVE VIGRVRHKNL VRLLGYCVEG AYRMLVYDFV DNGNLEQWIH GDVGDVSPLT WDIRMNIILG MAKGLAYLHE GLEPKVVHRD IKSSNILLDR QWNAKVSDFG LAKLLGSESS YVTTRVMGTF GYVAPEYACT GMLNEKSDIY SFGILIMEII TGRNPVDYSR PQGETNLVDW LKSMVGNRRS EEVVDPKIPE PPSSKALKRV LLVALRCVDP DANKRPKMGH IIHMLEAEDL LYRDERRTTR DHGSRERQET AVVAAGSESG ESGSRHHQQK QR // ID NC003070_41 HYPOTHETICAL; PRT; 349 AA. AC NC003070_41; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[200526...201575]; Length: 1050. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 349 SINGLE EXON; p-value: NaN. SQ SEQUENCE 349 AA; 38584 MW; 7C0FF28E4BD05284 CRC64; MARPQDPPRG FFPFGNPFKN LSSKNSVLSS KLLPLLNNFE TNLASSISKL VPKEKSDILT VSWMKQAMES LCETHNGIKT LITDLELPVS DWEDKWVDVY LDISVKLLDL CNAFSSELTR LNQGHLLLQF ALHNLEANSP QNLSKAQSSL DSWKQHIVSK NPRIENCRAI LSSLVQTLNL PKVKNSAKGK VLMRALYGVK VKTLYISGVF AAAFSGSSQN LMYLTVSNEL PWAQSFMEVQ NTMNAEIKNI FLSDGLTVLK ELEAVASGVK KLYPAIQQGS IDPISLQPLK DSVTELSNGI DLVSKEVDCF FKILLSGRDT LLENLRSMGA STLQATSPKK AAGKNYRGF // ID NC003070_42 HYPOTHETICAL; PRT; 872 AA. AC NC003070_42; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[202345...202508, 202782...202911, DE 202991...203128, 203267...203599, 203760...203943, 204991...205949, DE 206644...207243, 207325...207435]; Length: 2619. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 54 FIRST EXON; p-value: NaN. FT GENSCAN 55 55 AA on splice site: tg/t -> C. FT GENSCAN 56 98 INTERNAL EXON; p-value: NaN. FT GENSCAN 99 144 INTERNAL EXON; p-value: NaN. FT GENSCAN 145 255 INTERNAL EXON; p-value: NaN. FT GENSCAN 256 316 INTERNAL EXON; p-value: NaN. FT GENSCAN 317 317 AA on splice site: g/tg -> V. FT GENSCAN 318 636 INTERNAL EXON; p-value: NaN. FT GENSCAN 637 836 INTERNAL EXON; p-value: NaN. FT GENSCAN 837 872 LAST EXON; p-value: NaN. SQ SEQUENCE 872 AA; 100564 MW; 7717CCA8FF612A00 CRC64; MSIEKPFFGD DSNRGVSING GRYVQYNVYG NLFEVSKKYV PPLRPIGRGA SGIVCAAWNS ETGEEVAIKK IGNAFGNIID AKRTLREIKL LKHMDHDNVI AIIDIIRPPQ PDNFNDVHIV YELMDTDLHH IIRSNQPLTD DHSRFFLYQL LRGLKYVHSA NVLHRDLKPS NLLLNANCDL KIGDFGLART KSETDFMTEY VVTRWYRAPE LLLNCSEYTA AIDIWSVGCI LGEIMTREPL FPGRDYVQQL RLITELIGSP DDSSLGFLRS DNARRYVRQL PQYPRQNFAA RFPNMSVNAV DLLQKMLVFD PNRRITVTHH HDKKIKKRRS FGYFVVVFLI SPEELSAMGD DKKDQKQLSS YLKAKNFSCD SHFSFYWLMS RLIFLILAIL FSLQFVFYPL NFISSSSQPL IKFSVSPVVS GSGSVHEPDQ TELKHVVFGI AASAKFWKHR KDYVKLWWKP NGEMNGVVWL DQHINQNDNV SKTLPPIRIS SDTSRFQYRY PKGLRSAIRI TRIVSETVRL LNGTELEKNV RWIVMGDDDT VFFPENLVKV LRKYDHNQFY YIGSSSESHI QNLKFSYGMA YGGGGFAISY PLAKALEKMQ DRCIQRYSEL YGSDDRIHAC MSELGVPLTK EVGFHQKIIL DNYCFDFVLL YQIDLYGKLL GLLSAHPLAP LVSIHHLDLV DPVFPNMGRV NAMRRFMVPA KLDSPSLAQQ SICYDADHRW TVSVSWGYTV QIIRGVLSAR EMVIPTRTFI DWYKQADERS YAFNTRPIAK SACQRPRVYY LSNALPDLAL RRTASEYVRW YDMWEPECDW DMSDPSEFER VIVYKKPDPD RWNKHRAPRR DCCRVLPTTK NGTMVIDVGA CKDDEFAEFL VK // ID NC003070_43 HYPOTHETICAL; PRT; 220 AA. AC NC003070_43; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[209395...209608, 210357...210619, DE 210690...210875]; Length: 663. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 71 FIRST EXON; p-value: NaN. FT GENSCAN 72 72 AA on splice site: g/tt -> V. FT GENSCAN 73 159 INTERNAL EXON; p-value: NaN. FT GENSCAN 160 220 LAST EXON; p-value: NaN. SQ SEQUENCE 220 AA; 24694 MW; ECDA8FD2CAA20C50 CRC64; MEIEKSNNGG SNPSAGEEFK DMIKGVTKFL MMVIFLGTIM LWIMMPTLTY RTKWLPHLRI KFGTSTYFGA TVNRLITYEL RWQAKLESAA LRLGLIGNIC LAFLFLPVAR GSSLLPAMGL TSESSIKYHI WLGHMVMALF TVHGLCYIIY WASMHEISQS TSGPYEKVND FKVVTLNGAA SIPRNAVKFI ILGLRMAYKC SVDIAINFVA PKRKLSNHAF // ID NC003070_44 HYPOTHETICAL; PRT; 480 AA. AC NC003070_44; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[211058...211379, 211474...211717, DE 211855...212573, 213107...213163, 213546...213646]; Length: 1443. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 107 FIRST EXON; p-value: NaN. FT GENSCAN 108 108 AA on splice site: a/ag -> K. FT GENSCAN 109 188 INTERNAL EXON; p-value: NaN. FT GENSCAN 189 189 AA on splice site: ag/g -> R. FT GENSCAN 190 428 INTERNAL EXON; p-value: NaN. FT GENSCAN 429 429 AA on splice site: a/tt -> I. FT GENSCAN 430 447 INTERNAL EXON; p-value: NaN. FT GENSCAN 448 448 AA on splice site: t/ta -> L. FT GENSCAN 449 480 LAST EXON; p-value: NaN. SQ SEQUENCE 480 AA; 54496 MW; 845D53873F0D86AB CRC64; MWDTKGVSNL AGEIALAAGL VMWATTYPKI RRRFFEVFFY THYLYIVFML FFVLHVGISF SFIALPGFYI FLVDRFLRFL QSRENVRLLA ARILPSDTME LTFSKNSKLV YSPTSIMFVN IPSISKLQWH PFTITSSSKL EPEKLSIVIK KEGKWSTKLH QRLSSSDQID RLAVSVEGPY GPASADFLRH EALVMVCGGS GITPFISVIR DLIATSQKET CKIPKITLIC AFKKSSEISM LDLVLPLSGL ETELSSDINI KIEAFITRDN DAGDEAKAGK IKTLWFKPSL SDQSISSILG PNSWLWLGAI LASSFLIFMI IIGIITRYYI YPIDHNTNKI YSLTSKTIIY ILVISVSIMA TCSAAMLWNK KKYGKVESKQ VQNVDRPSPT SSPTSSWGYN SLREIESTPQ ESLVQRTNLH FGERPNLKIK HTYEKVNLLE TSWVIHRLQG IKSKKKKKRK DNRNIRLHAS AYKLNDSAFR // ID NC003070_45 HYPOTHETICAL; PRT; 1047 AA. AC NC003070_45; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[214229...214406, 214480...214582, DE 214697...214897, 215761...215960, 216044...216293, 216414...217090, DE 219330...219621, 219752...220994]; Length: 3144. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 59 FIRST EXON; p-value: NaN. FT GENSCAN 60 60 AA on splice site: g/ga -> G. FT GENSCAN 61 93 INTERNAL EXON; p-value: NaN. FT GENSCAN 94 94 AA on splice site: ag/c -> S. FT GENSCAN 95 160 INTERNAL EXON; p-value: NaN. FT GENSCAN 161 161 AA on splice site: ct/t -> L. FT GENSCAN 162 227 INTERNAL EXON; p-value: NaN. FT GENSCAN 228 228 AA on splice site: a/tg -> M. FT GENSCAN 229 310 INTERNAL EXON; p-value: NaN. FT GENSCAN 311 311 AA on splice site: cg/a -> R. FT GENSCAN 312 536 INTERNAL EXON; p-value: NaN. FT GENSCAN 537 537 AA on splice site: a/ag -> K. FT GENSCAN 538 633 INTERNAL EXON; p-value: NaN. FT GENSCAN 634 634 AA on splice site: ag/g -> R. FT GENSCAN 635 1047 LAST EXON; p-value: NaN. SQ SEQUENCE 1047 AA; 118326 MW; AA0B352B34E8DB63 CRC64; MGVGEMNKEV IDKVIKFLMM VILMGTIVIW IMMPTSTYKE IWLTSMRAKL GKSIYYGRPG VNLLVYMFPM ILLAFLGCIY LHLKKSTTVN QFNSGVEKKR AKFGALRRPM LVNGPLGIVT VTEVMFLTMF MALLLWSLAN YMYRTFVNVT SESAATDGNN LHYLYIVFML FFVFHVGISH ALIPLPGFYI FLVDRFLRFL QSRNNVKLVS ARVLPCDTVE LNFSKNPMLM YSPTSTMFVN IPSISKLQWH PFTIISSSKL EPETLSVMIK SQGKWSTKLY DMLSSSSSDQ INRLAVSVEG PYGPSSTDFL RHESLVMVSG GSGITPFISI VRDLFYMSST HKCKIPKMTL ICAFKNSSDL SMLDLILPTS GLTTDMASFV DIQIKAFVTR EEKTSVKEST HNRNIIKTRH FKPNVSDQPI SPILGPNSWL CLAAILSSSF MIFIVIIAII TRYHIHPIDQ NSEKYTWAYK SLIYLVSISI TVVTTSTAAM LWNKKKYYAK NDQYVDNLSP VIIESSPQQL ISQSTDIHYG ERPNLNKQRD RMHEWITENL RACGGTYQTC IFAVPFLAKK QGLVTVTCDP KNLEHMLKTR FDNYPKGPTW QSVFHDLLGQ GIFNSDGDTW LFQRKTAALE FTTRTLRQAM GRWVNRGIKL RFCPILATAQ DNAEPVDLQD LILRLTFDNI CGLAFGKDTR TCAPGLPENG FASAFDRATE ASLQRFIIPK FMWKLKKWLG LGLEVSLSRS LGEIDEYLAA VINTRKQELM SQQESGTHQR HDDLLSRFMM KKTESYSDTF LQHVALNFIL AGRDTSSVAL SWFFWLITMH PTVEDKIVRE ICSVLIETRG TDDVASWTEE PLGFDEIDRL VYLKAAISET LRLYPSVPED SKHVENDDVL PDGTFVPAGS SVTYSIYAAG RMKSTWGEDC LEFNPERWIS PIDGKFINHD QYRFVAFNAG PRICLGKDLA YLQMKTIAAA VLLRHRLTVV PGHKVEQKMS LTLFMKNGLL VNLYKRDLQG IIKSLVVKKS DGVSNGQCNG VIGEGVAVYL NTGVAVV // ID NC003070_46 HYPOTHETICAL; PRT; 503 AA. AC NC003070_46; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[224255...223945, 223866...223551, DE 223096...222622, 222359...221950]; Length: 1512. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 103 FIRST EXON; p-value: NaN. FT GENSCAN 104 104 AA on splice site: cg/g -> R. FT GENSCAN 105 209 INTERNAL EXON; p-value: NaN. FT GENSCAN 210 367 INTERNAL EXON; p-value: NaN. FT GENSCAN 368 368 AA on splice site: g/gt -> G. FT GENSCAN 369 503 LAST EXON; p-value: NaN. SQ SEQUENCE 503 AA; 56263 MW; 827EEB7EBE337766 CRC64; MSPAKKSRSF PPISECKSRE YDSIAADLDG TLLLSRSSFP YFMLVAIEAG SLFRGLILLL SLPIVIIAYL FVSESLGIQI LIFISFAGIK IKNIELVSRA VLTRFYAADV RKDSFEVFDK CKKRKVVVTA NPIVMVEPFV KDYLGGDKVL GTEIEVNPKT MKATGFVKKP GVLVGDLKRL AILKEFGDDS PDLGLGDRTS DHDFMSICKE GYMVHETKSA TTVPIESLKN RIIFHDGRLV QRPTPLNALI IYLWLPFGFM LSVFRVYFNL PLPERFVRYT YEILGIHLTI RGHRPPPPSP GKPGNLYVLN HRTALDPIII AIALGRKITC VTYSVSRLSL MLSPIPAVAL TRDRVADAAR MRQLLEKGDL VICPEGTTCR EPYLLRFSAL FAELSDRIVP VAMNCKQGMF NGTTVRGVKF WDPYFFFMNP RPSYEATFLD RLPEEMTVNG GGKTPFEVAN YVQKVIGGVL GFECTELTRK DKYLLLGGND GKVESINKTK SME // ID NC003070_47 HYPOTHETICAL; PRT; 265 AA. AC NC003070_47; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[227176...226849, 226691...226396, DE 226311...226171, 225381...225349]; Length: 798. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 109 FIRST EXON; p-value: NaN. FT GENSCAN 110 110 AA on splice site: g/gt -> G. FT GENSCAN 111 208 INTERNAL EXON; p-value: NaN. FT GENSCAN 209 255 INTERNAL EXON; p-value: NaN. FT GENSCAN 256 265 LAST EXON; p-value: NaN. SQ SEQUENCE 265 AA; 28293 MW; 15E07542B9858D23 CRC64; MEGKEEDVRV GANKFPERQP IGTSAQTDKD YKEPPPAPFF EPGELSSWSF YRAGIAEFIA TFLFLYITVL TVMGVKRAPN MCASVGIQGI AWAFGGMIFA LVYCTAGISG GHINPAVTFG LFLARKLSLT RAVFYIVMQC LGAICGAGVV KGFQPNPYQT LGGGANTVAH GYTKGSGLGA EIIGTFVLVY TVFSATDAKR SARDSHVPIL APLPIGFAVF LVHLATIPIT GTGINPARSL GAAIIYNKDH AWDDHFLLFS FISSK // ID NC003070_48 HYPOTHETICAL; PRT; 255 AA. AC NC003070_48; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[229206...229343, 229429...229724, DE 229829...229934, 230349...230459, 230559...230675]; Length: 768. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 46 FIRST EXON; p-value: NaN. FT GENSCAN 47 144 INTERNAL EXON; p-value: NaN. FT GENSCAN 145 145 AA on splice site: ag/a -> R. FT GENSCAN 146 180 INTERNAL EXON; p-value: NaN. FT GENSCAN 181 217 INTERNAL EXON; p-value: NaN. FT GENSCAN 218 255 LAST EXON; p-value: NaN. SQ SEQUENCE 255 AA; 29217 MW; 390A903CD431B13C CRC64; MENKETKQEP AAAAEQKTVP LIEDEIERSK VGIMRALCDR QDPETKEVDD LMIRRFLRAR DLDIEKASTM FLNYLTWKRS MLPKGHIPEA EIANDLSHNK MCMQGHDKMG RPIAVAIGNR HNPSKGNPDE FKRFVVYTLE KICARMPRGQ EKFVAIGDLQ GWGYSNCDIR GYLAALSTLQ DCYPERLGKL YIVHAPYIFM TAWKVIYPFI DANTKKKIVF VENKKLTPTL LEDIDESQLP DIYGGKLPLV PIQET // ID NC003070_49 HYPOTHETICAL; PRT; 627 AA. AC NC003070_49; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[238860...238750, 237023...236865, DE 236784...236673, 236548...236429, 236320...236145, 236027...235845, DE 234988...234935, 234621...234529, 234392...234250, 233695...233619, DE 233386...233298, 231858...231784, 231655...231164]; Length: 1884. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 37 FIRST EXON; p-value: NaN. FT GENSCAN 38 90 INTERNAL EXON; p-value: NaN. FT GENSCAN 91 127 INTERNAL EXON; p-value: NaN. FT GENSCAN 128 128 AA on splice site: g/at -> D. FT GENSCAN 129 167 INTERNAL EXON; p-value: NaN. FT GENSCAN 168 168 AA on splice site: g/tt -> V. FT GENSCAN 169 226 INTERNAL EXON; p-value: NaN. FT GENSCAN 227 287 INTERNAL EXON; p-value: NaN. FT GENSCAN 288 305 INTERNAL EXON; p-value: NaN. FT GENSCAN 306 336 INTERNAL EXON; p-value: NaN. FT GENSCAN 337 383 INTERNAL EXON; p-value: NaN. FT GENSCAN 384 384 AA on splice site: ag/t -> S. FT GENSCAN 385 409 INTERNAL EXON; p-value: NaN. FT GENSCAN 410 410 AA on splice site: g/gt -> G. FT GENSCAN 411 439 INTERNAL EXON; p-value: NaN. FT GENSCAN 440 464 INTERNAL EXON; p-value: NaN. FT GENSCAN 465 627 LAST EXON; p-value: NaN. SQ SEQUENCE 627 AA; 70120 MW; B67E143AE3CA628A CRC64; MRDNGDTELS RSRSDVAQIQ PWEKRNKPSR EEAKVLKVKV PTRVNGSEYT EYVGVGARFG PTLESKEKHA TLIKLAIADP PDCCSTPKNK LTGEVILVHR GKCSFTTKTK VAEAAGASAI LIINNSTDLF KMVCEKGENV LDITIPVVML PVDAGRSLEN IVKSNAIVGV DYICSVTLQL YSPKRPAVDV AEVFLWLMAV GTILCASYWS AWTVREEAIE QDKLLKDGSD ELLQLSTTSS RGVVEVTVIS AILFVVVASC FLIMLYKLMS FWFIEVLVVL FCIGGVEGIS LIITVLQIVR VPNLKVGFVL LSCAFMYDIF WVFVSKWWFR ESVMIVVARG DRSGEDGIPM LLKIPRMFDP WGGYSIIGFG DIILPGLLVT FALSNKKETY YDKYLNLMLN PIKIVHRSAG KNTEPTTDVL EPKAMYVLKL RMVCGAMCQE QMHTDVLVKP GEEAPPIPTH KAVLAARSKV FRNMLDSDEC KTSPEESITL PDLSHDELKS LLEFLYSGNL KAPYNQYRSL YLAADKYDIS YLQDVCRNHF IASLSSRNVL DILELASIPC DTILKDAAIN HIVKHMEEVV VPMKYETFVQ RNPDLSVEIT RAYLRETKAK AKDHGAPLNG NTRPRIW // ID NC003070_50 HYPOTHETICAL; PRT; 885 AA. AC NC003070_50; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[245656...245630, 245358...245244, DE 244873...244770, 244635...244505, 244182...244035, 243746...243410, DE 243100...242962, 242660...242449, 242290...242214, 242113...241983, DE 241719...241539, 241443...241306, 241226...240965, 240892...240744, DE 240672...240352, 240242...240057]; Length: 2658. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 9 FIRST EXON; p-value: NaN. FT GENSCAN 10 47 INTERNAL EXON; p-value: NaN. FT GENSCAN 48 48 AA on splice site: a/ca -> T. FT GENSCAN 49 82 INTERNAL EXON; p-value: NaN. FT GENSCAN 83 125 INTERNAL EXON; p-value: NaN. FT GENSCAN 126 126 AA on splice site: ga/g -> E. FT GENSCAN 127 175 INTERNAL EXON; p-value: NaN. FT GENSCAN 176 287 INTERNAL EXON; p-value: NaN. FT GENSCAN 288 288 AA on splice site: g/cc -> A. FT GENSCAN 289 333 INTERNAL EXON; p-value: NaN. FT GENSCAN 334 334 AA on splice site: tg/t -> C. FT GENSCAN 335 404 INTERNAL EXON; p-value: NaN. FT GENSCAN 405 405 AA on splice site: g/ct -> A. FT GENSCAN 406 430 INTERNAL EXON; p-value: NaN. FT GENSCAN 431 473 INTERNAL EXON; p-value: NaN. FT GENSCAN 474 474 AA on splice site: ga/g -> E. FT GENSCAN 475 534 INTERNAL EXON; p-value: NaN. FT GENSCAN 535 580 INTERNAL EXON; p-value: NaN. FT GENSCAN 581 667 INTERNAL EXON; p-value: NaN. FT GENSCAN 668 668 AA on splice site: g/gt -> G. FT GENSCAN 669 717 INTERNAL EXON; p-value: NaN. FT GENSCAN 718 824 INTERNAL EXON; p-value: NaN. FT GENSCAN 825 885 LAST EXON; p-value: NaN. SQ SEQUENCE 885 AA; 102598 MW; 0E1D8AD47C835C9B CRC64; MIVGNVSFTR EFVSLRIYLI EKSGESMAKA LNDDDPVYVA VSEDVDQTSG LEQSEIDAIQ ELEQTSRNDT LLKYHDICID EGVIEQDVDM SCFSANSVGE WIVELIYQNN IKKLIMGATA DSHYSEEGRF EHAGSAYSSS SSLHSIDSAL IPYGGAGRAE RVTEPHALSS SEEQSARGIE KMYYEEQRRR LEIEELKREK EQRDKMRRVR EEALSSSSGV TKILYNEEVM RRREVEAELN RAKAEIEDMK RVQIELKEQH YADCRLLEKE RDEAIKTTEE LLRALEKADG YTYEADEFRR WLNHGGEKSP MTNLRLENRN LIPNLVLRSA IKDCFQGRKK DRKRRRKERT SMAELMAMGN DVVHVAVKSD VRESRSTLLW ALRNLGAKKV CILHVYQPKT ASPAARKLEE LEAIMYETLH DYFDFCQQEG VNEDDIYISC IEMNDVKQGI LELIHESKIK KLVMGAASDH HYSEEANLED CMGETESEAG QSKPKLYSSA SPKCSAELVS AIVAYIDTRR DRDMLEPNAS EDQSESDRND QLYRQLKQAL MEVEESKREA YEECVRRFKA ENTAVEAIRS AREYEAMYNE EAKLRKEGKE ALAKQRKMVE KTKQERDDAL IIILNGRKLY NEELRRRVEA EEMLGKEKEE HERTKKEIEE VRAIVQDGMQ LYNEQLRHRK EMEESMKRQE EELEKTKKEK EEACMISKNL MQLYEDEVRQ RKEAEELVKR RREELEKVKK EKEEACSVGQ NFMRLYEEEA RRRKGTEEEL SKVAAEKDAA SSVCSEILLL LQSYTRRHGT PSGFSDEDSV TRQPPSYFIC PISQEVMREP RVAADGFTYE AESLREWLDN GHETSPMTNL KLAHNNLVPN HALRSAIQEW LQRNS // ID NC003070_51 HYPOTHETICAL; PRT; 239 AA. AC NC003070_51; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[248329...248191, 247680...247577, DE 247510...247362, 247078...246949, 246608...246411]; Length: 720. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 46 FIRST EXON; p-value: NaN. FT GENSCAN 47 47 AA on splice site: t/ca -> S. FT GENSCAN 48 81 INTERNAL EXON; p-value: NaN. FT GENSCAN 82 130 INTERNAL EXON; p-value: NaN. FT GENSCAN 131 131 AA on splice site: ag/a -> R. FT GENSCAN 132 174 INTERNAL EXON; p-value: NaN. FT GENSCAN 175 239 LAST EXON; p-value: NaN. SQ SEQUENCE 239 AA; 27433 MW; 5A146CD4D2781C9F CRC64; MEDAIYVAVN QDVRESKKTL LWALKNLQVK KIFLLHVHLP FSLTTSSSRL EQSEIDAIQD SELNTSVNSL YKYRDICINK GKSILMQVNE KDVDTSMISG HDVGEGIVEL IYQNIITNLV MGAAADPHYS RERSFYLGNP SDSFSEFSTS AEKPISKGRR RDEEEEPESP KEHPEIMRDP HVAADGFTYE AEEFRKWLRS GGRTSPKTNK PLENHNLVPN HTLRIIIKDW LEKNPNYKR // ID NC003070_52 HYPOTHETICAL; PRT; 812 AA. AC NC003070_52; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[249248...249306, 249540...249693, DE 249774...249930, 250014...250151, 250536...250588, 250669...250863, DE 251004...251066, 251101...251184, 251260...251316, 251399...251440, DE 251729...251827, 251898...252006, 252835...252885, 252954...253457, DE 253531...253662, 253954...254495]; Length: 2439. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: NaN. FT GENSCAN 20 20 AA on splice site: ag/a -> R. FT GENSCAN 21 71 INTERNAL EXON; p-value: NaN. FT GENSCAN 72 123 INTERNAL EXON; p-value: NaN. FT GENSCAN 124 124 AA on splice site: t/ta -> L. FT GENSCAN 125 169 INTERNAL EXON; p-value: NaN. FT GENSCAN 170 170 AA on splice site: a/ct -> T. FT GENSCAN 171 187 INTERNAL EXON; p-value: NaN. FT GENSCAN 188 252 INTERNAL EXON; p-value: NaN. FT GENSCAN 253 273 INTERNAL EXON; p-value: NaN. FT GENSCAN 274 301 INTERNAL EXON; p-value: NaN. FT GENSCAN 302 320 INTERNAL EXON; p-value: NaN. FT GENSCAN 321 334 INTERNAL EXON; p-value: NaN. FT GENSCAN 335 367 INTERNAL EXON; p-value: NaN. FT GENSCAN 368 403 INTERNAL EXON; p-value: NaN. FT GENSCAN 404 404 AA on splice site: g/ag -> E. FT GENSCAN 405 420 INTERNAL EXON; p-value: NaN. FT GENSCAN 421 421 AA on splice site: t/tc -> F. FT GENSCAN 422 588 INTERNAL EXON; p-value: NaN. FT GENSCAN 589 589 AA on splice site: g/at -> D. FT GENSCAN 590 632 INTERNAL EXON; p-value: NaN. FT GENSCAN 633 633 AA on splice site: g/gt -> G. FT GENSCAN 634 812 LAST EXON; p-value: NaN. SQ SEQUENCE 812 AA; 93245 MW; E9A7BDE9F1B64538 CRC64; MNINKACDLK SISVFPPNLR RRSAEPQASQ QLRSQQSQQS FSQGPSSSQR GCGGFSQMTQ SSIDELLIND QRFSSQERDL SLKKVSSCLP PINHKREDSQ LVASRSSSGL SRRWSSASIG ESKLAQISEE LEQRFGMMET SLSRFGMMLD SIQSDIMQAN RGTKEVFLET ERIQQKLTLQ DTSLQQLRKE QADSKASLDG GVKFILEEFS KDPNQEKLQK ILQMLTTIPE QVETALQKIQ REICHTFTRE IQVLASLRTP EPRVRVPTAP QVKLHISAIN VAFVNYQAKE NLPEQRGQAA KVLTSLKMPE PRVQVPAAPQ AKENFPEQRG PVAKIQVGCW KTVKPEKSNF KKRATRKPVK SESTRTQECA IQFMQFEQCS VVIDSDEEDI DGGFSCLINE NTRESHRRKL LAFKSFVVSV FEKHVEEKLR SNRKERGWIS GIFRRKKRSP QEEEVDNENN SSEDSRLMGA KNLHLFLSEI MRKLKHAIRK EKPDKRLLGK KKSFEKSLST KDHFFLERMT SISQKRFHQE HNDPNLATSK QVQRNHERIV WLPEYSSPFS SPGRIWKQNS TVLLRSSSYD FIKSETIADD SITLKEIGTA SSSEGSSSPL LSKSNQIVDE MSKVAAYGAE NEGEAKIQPL CNQLQEKNRH KESVFKYVKA VLEAIDSNWE ELYLKTEISD QLLYPALISN IPFYPNQLCV EHELLFDCIN EVLFEFCRFP QWVSFAETRT QVLPYSVESI VPEVQEKVYC HLLPMQLRRS LEERVREDMA KHRSWLDIRC DLECIGFETS ELILNELLEQ LMLELEDNHK NG // ID NC003070_53 HYPOTHETICAL; PRT; 56 AA. AC NC003070_53; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[255317...255147]; Length: 171. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 56 SINGLE EXON; p-value: NaN. SQ SEQUENCE 56 AA; 6496 MW; 5618DDB85121094B CRC64; MVMGWNSLFL LGFVEFRRHV AVRAECDRLS DVYDTKQRVS TTTAKSRRRD DGKSTT // ID NC003070_54 HYPOTHETICAL; PRT; 258 AA. AC NC003070_54; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[256939...257572, 259029...259171]; Length: DE 777. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 211 FIRST EXON; p-value: NaN. FT GENSCAN 212 212 AA on splice site: g/cg -> A. FT GENSCAN 213 258 LAST EXON; p-value: NaN. SQ SEQUENCE 258 AA; 29907 MW; 0677D36094C0168B CRC64; MWSFLRKICF STNAIGNFFT MVEERYESLL TTVEDFQTRN PAVLLCENFL FFFLGHVSLF LSGLSHSFRF AEHPYYFSLY LIAANVVFGY FFLRAARRDK NETDDDNPYA MLPDFPRNPP PPALHHHSMS AWFLWGVRSV AVYLLKTHSG ESDLFATGAL LFFHAYCITL LKLNVSDFGV SGAFMSLASA FSETLMKSPH FKLLNSLICF YAWLWWFLSD DSEFYRRSPH QRSGLLLNLK DLRVIFCRSS LLLLLKKS // ID NC003070_55 HYPOTHETICAL; PRT; 516 AA. AC NC003070_55; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[261474...261129, 261040...260921, DE 260824...260670, 260582...260118, 260034...259690, 259614...259495]; DE Length: 1551. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 115 FIRST EXON; p-value: NaN. FT GENSCAN 116 116 AA on splice site: g/ca -> A. FT GENSCAN 117 155 INTERNAL EXON; p-value: NaN. FT GENSCAN 156 156 AA on splice site: g/ct -> A. FT GENSCAN 157 207 INTERNAL EXON; p-value: NaN. FT GENSCAN 208 362 INTERNAL EXON; p-value: NaN. FT GENSCAN 363 477 INTERNAL EXON; p-value: NaN. FT GENSCAN 478 516 LAST EXON; p-value: NaN. SQ SEQUENCE 516 AA; 58115 MW; 39F1C4A761A51E97 CRC64; MENLPNHEEN DDVGYHQSPG PIDPNDHSAS ETPVYSTMST DSFAYHRTCS ETSGGGFSDQ IDETSSFCTE ASPSDWPVLT ESNNSASSNF PTVFDLKHNQ IETDEHLAVQ EISEPAELET MKERFSKLLL GEDMSGSGKG VCTAVTISNA ITNLYATVFG QNLRLEPLEI EQKTTWKREM NCLLSVCDYI FEFIPKSQNL SNGATVEVME SRPRADIYIN LPALRKLDSM LMVVTTPEQE AIINKIDSVN IPHKSHSLCF LQEALDSFQK TEFWYAEEGS LSMKSTRSAT GSFRKVIVQR KEEKWWLPIP LVPLQGLSEK ARKQLKSKRE STNQIHKAAM AINSSILGEM DIPDSYMATL PKSGKASTGD AIYRHMTSSG RFSPEKLLDR LKIVSEHEAL QLADRVEASM YTWRRKACLN NSKSSWNMVK DLMSITERSD KNYVLAERAE SLLFCLKQRY PELSQTSLDI CKIHCNKDVG KAVLESYSRV LEGLAFNIVA WIDDVLYVDK TMRGEE // ID NC003070_56 HYPOTHETICAL; PRT; 785 AA. AC NC003070_56; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[262950...262965, 263047...263139, DE 263232...263299, 263369...263444, 263526...263572, 263645...263770, DE 263857...263934, 264073...264160, 264280...264392, 264520...264582, DE 264657...264712, 264797...264901, 265021...265077, 266534...266724, DE 267263...267443, 267993...268127, 268441...268463, 268499...268630, DE 268717...268994, 269083...269514]; Length: 2358. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 5 FIRST EXON; p-value: NaN. FT GENSCAN 6 6 AA on splice site: g/tt -> V. FT GENSCAN 7 36 INTERNAL EXON; p-value: NaN. FT GENSCAN 37 37 AA on splice site: g/gg -> G. FT GENSCAN 38 59 INTERNAL EXON; p-value: NaN. FT GENSCAN 60 84 INTERNAL EXON; p-value: NaN. FT GENSCAN 85 85 AA on splice site: g/gt -> G. FT GENSCAN 86 100 INTERNAL EXON; p-value: NaN. FT GENSCAN 101 142 INTERNAL EXON; p-value: NaN. FT GENSCAN 143 168 INTERNAL EXON; p-value: NaN. FT GENSCAN 169 197 INTERNAL EXON; p-value: NaN. FT GENSCAN 198 198 AA on splice site: a/tc -> I. FT GENSCAN 199 235 INTERNAL EXON; p-value: NaN. FT GENSCAN 236 256 INTERNAL EXON; p-value: NaN. FT GENSCAN 257 274 INTERNAL EXON; p-value: NaN. FT GENSCAN 275 275 AA on splice site: ag/g -> R. FT GENSCAN 276 309 INTERNAL EXON; p-value: NaN. FT GENSCAN 310 310 AA on splice site: ag/a -> R. FT GENSCAN 311 328 INTERNAL EXON; p-value: NaN. FT GENSCAN 329 329 AA on splice site: cg/a -> R. FT GENSCAN 330 392 INTERNAL EXON; p-value: NaN. FT GENSCAN 393 393 AA on splice site: g/tt -> V. FT GENSCAN 394 452 INTERNAL EXON; p-value: NaN. FT GENSCAN 453 453 AA on splice site: aa/a -> K. FT GENSCAN 454 497 INTERNAL EXON; p-value: NaN. FT GENSCAN 498 498 AA on splice site: aa/t -> N. FT GENSCAN 499 505 INTERNAL EXON; p-value: NaN. FT GENSCAN 506 506 AA on splice site: g/gt -> G. FT GENSCAN 507 549 INTERNAL EXON; p-value: NaN. FT GENSCAN 550 550 AA on splice site: g/gt -> G. FT GENSCAN 551 642 INTERNAL EXON; p-value: NaN. FT GENSCAN 643 785 LAST EXON; p-value: NaN. SQ SEQUENCE 785 AA; 89413 MW; C37E544353871276 CRC64; MNTESVVEFL GNVPLLQKLP SSSLKKIAQV VVPKRYGKGD YVVREDQTWD GCYFILQGEA QVSGPDEEDN RSEFLLKQYD YFGVGLSGNV HSADIVAMSQ LTCLVLPRDH CHLLETNSIW QSDTSLDKCS LVERILQLDP LELNIFRGIT LPDAPIFGKV FGGQFVGQAL AAASKTVDFL KVVHSLHSYF LLVGDIDIPI IYQVHRIRDG NNFATRRVDA VQKGNIIFIL LASFQKEQQG FEHQESTMPS VPDPDTLLSL EELRESRITD PHLPRSYRNK VATRNFVPWP IEIRFCEPSN STNQTKSPPR LNYWFRAKGR LSDDQALHRN EDLNSLDSEE DLEKMESEVE QMSRKVSEYR KSLPSNLSNS VISSRRSNFP NIDSLGFRTT STVILFLASL EEPLVSRDGA HIGGIEQESR DSLIQLKERV YENIATVPLV VERMRESKER IDKQKKNING GTRQWDRRST DRKRRIHSKI EPRGSVVSDR SDLDSNDNNN NIKKEGFRFH PTDEELVMHY LCRKCASQSI AVPIIAEIDL YKYDPWELPG LALYGEKEWY FFSPRDRKYP NGSRPNRSAG SGYWKATGAD KPIGLPKPVG IKKALVFYAG KAPKGEKTNW IMHEYRLADV DRSVRKKKNS LRLDDWVLCR IYNKKGATER RGPPPPVVYG DEIMEEKPKV TEMVMPPPPQ QTSEFAYFDT SDSVPKLHTT DSSCSEQVVS PEFTSEVQSE PKWKDWSAVS NDNNNTLDFG FNYIDATVDN AFGGGGSSNQ MFPLQDMFMY MQKPY // ID NC003070_57 HYPOTHETICAL; PRT; 62 AA. AC NC003070_57; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[270735...270622, 270090...270016]; Length: DE 189. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 38 FIRST EXON; p-value: NaN. FT GENSCAN 39 62 LAST EXON; p-value: NaN. SQ SEQUENCE 62 AA; 6882 MW; C07AA68322D33C12 CRC64; MNPQIEKVVK VTSVVATAVV SYFLLTADYG PEPNALDPIR QRILSAQDSV KEFIFPSKKS DK // ID NC003070_58 HYPOTHETICAL; PRT; 225 AA. AC NC003070_58; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[271002...271289, 271483...271872]; Length: DE 678. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 96 FIRST EXON; p-value: NaN. FT GENSCAN 97 225 LAST EXON; p-value: NaN. SQ SEQUENCE 225 AA; 24470 MW; D92202D2597A7641 CRC64; MEKSTSPATR VTEPPPAATS SLPAQAPPLP TSADQRSAEL PSPAQPAQMT APPSTVTTDP SSRGRKRALE ANLQIESSNY YKMRLLVKDL RPHVLELNKR PNLFDSVSLE MMMFAEMKLM LQLYEEMIGE SPKREKTAKS DSLSNGKATT TTTTTTSVLR SSETEKHSSD GDGDKVVGGS AFGWNFITSG GPGTEPVYSG MSKEEYRSSH PIMQVEAELE LPNTF // ID NC003070_59 HYPOTHETICAL; PRT; 896 AA. AC NC003070_59; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[278487...278316, 278252...278091, DE 277863...277717, 277606...277493, 277401...277232, 277153...277045, DE 277017...276870, 276793...276706, 276352...276247, 275864...275788, DE 273988...273863, 273788...273429, 273339...273210, 273149...273064, DE 272983...272651, 272555...272445, 272362...272111]; Length: 2691. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 57 FIRST EXON; p-value: NaN. FT GENSCAN 58 58 AA on splice site: g/ac -> D. FT GENSCAN 59 111 INTERNAL EXON; p-value: NaN. FT GENSCAN 112 112 AA on splice site: g/gt -> G. FT GENSCAN 113 160 INTERNAL EXON; p-value: NaN. FT GENSCAN 161 161 AA on splice site: g/gt -> G. FT GENSCAN 162 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 199 AA on splice site: g/gg -> G. FT GENSCAN 200 255 INTERNAL EXON; p-value: NaN. FT GENSCAN 256 291 INTERNAL EXON; p-value: NaN. FT GENSCAN 292 292 AA on splice site: t/gt -> C. FT GENSCAN 293 340 INTERNAL EXON; p-value: NaN. FT GENSCAN 341 341 AA on splice site: aa/a -> K. FT GENSCAN 342 370 INTERNAL EXON; p-value: NaN. FT GENSCAN 371 405 INTERNAL EXON; p-value: NaN. FT GENSCAN 406 406 AA on splice site: g/tt -> V. FT GENSCAN 407 431 INTERNAL EXON; p-value: NaN. FT GENSCAN 432 473 INTERNAL EXON; p-value: NaN. FT GENSCAN 474 593 INTERNAL EXON; p-value: NaN. FT GENSCAN 594 636 INTERNAL EXON; p-value: NaN. FT GENSCAN 637 637 AA on splice site: c/gg -> R. FT GENSCAN 638 665 INTERNAL EXON; p-value: NaN. FT GENSCAN 666 776 INTERNAL EXON; p-value: NaN. FT GENSCAN 777 813 INTERNAL EXON; p-value: NaN. FT GENSCAN 814 896 LAST EXON; p-value: NaN. SQ SEQUENCE 896 AA; 99982 MW; 650915B593DEE31B CRC64; MNESTDLFVG IVAMEEDWGK TVSEKVISAY MSLPKKGKPQ GREVTVLSAF LVSSPSQDPK VIALGTGTKC VSGSLLSPRG DIVNDSHAEV VARRALIRFF YSEIQRMQLT SGGYASTSSP LYALKKIPST QVDDSLLVQA SDICSSRHSD VPEIGSNSNK GNGSQVADMV QRKPGRGETT LSVSCSDKIA RWNVLGVQGA LLYQVLQPVY ISTITVGQSL HSPDNFSLAD HLRRSLYERI LPLSDELLTS FRLNKPLFFV APVPPSEFQH SETAQATLTC GLVFNTLPHM TCGVIYLLAL RYSLCWNYSG LHEVILGTTG RKQGTSAKGA LYPSTQSSIC KQRLLELFLK ETHGHKRESS KSKKSYRELK FISINKLLEI ASFVQGSKVI IYWDDYCVPK RKKERVLRGG EIIVIDSISA LILGNRSGEI FNVVSEHGET APNVVYQGKL ENHMKIAIKR FSGTAWPDPR QFLEEARLVG QLRSKRMANL LGYCCEGGER LLVAEFMPNE TLAKHLFHCK KKMPLINCLS IFFQDFLLRF AYHFYHLPGD TEPMKWAMRL RVALYISEAL EYCSNNGHTL YHDLNAYRVL FDEECNPRLS TFGLMKNSRD GKSYSTNLAF TPPEYLRTGE IDLHPPRRIT AESVIYSFGT LLLDLLTGKH IPPSHALDLI RDRNLQTLTD SCLEGQFSDS DGTELVRLTS CCLQYEARER PNIKSLVTAL ISLQKDTEVL SHVLMGLPQS GTFASPPSPF AEACSGKDLT SMVEILEKIG YKDDEDLSFM WTEQMQEAIN SKKKGDIAFR RKDFSEAIEF YTQFLDLGMI SATVLVRRSQ SYLMSNMAKE ALDDAMKAQG ISPVWYVALY LQSAALSVLG MEKESQIALT EGSILEARKI SASTQN // ID NC003070_60 HYPOTHETICAL; PRT; 631 AA. AC NC003070_60; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[278759...278809, 278887...279088, DE 279166...279240, 279327...279425, 279536...279576, 279797...279910, DE 279974...280088, 280180...280370, 280449...280568, 280657...280737, DE 280889...281121, 281309...281348, 281438...281536, 281759...282193]; DE Length: 1896. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 17 FIRST EXON; p-value: NaN. FT GENSCAN 18 84 INTERNAL EXON; p-value: NaN. FT GENSCAN 85 85 AA on splice site: g/tt -> V. FT GENSCAN 86 109 INTERNAL EXON; p-value: NaN. FT GENSCAN 110 110 AA on splice site: a/tt -> I. FT GENSCAN 111 142 INTERNAL EXON; p-value: NaN. FT GENSCAN 143 143 AA on splice site: g/gc -> G. FT GENSCAN 144 156 INTERNAL EXON; p-value: NaN. FT GENSCAN 157 194 INTERNAL EXON; p-value: NaN. FT GENSCAN 195 232 INTERNAL EXON; p-value: NaN. FT GENSCAN 233 233 AA on splice site: g/gt -> G. FT GENSCAN 234 296 INTERNAL EXON; p-value: NaN. FT GENSCAN 297 336 INTERNAL EXON; p-value: NaN. FT GENSCAN 337 363 INTERNAL EXON; p-value: NaN. FT GENSCAN 364 440 INTERNAL EXON; p-value: NaN. FT GENSCAN 441 441 AA on splice site: ag/t -> S. FT GENSCAN 442 454 INTERNAL EXON; p-value: NaN. FT GENSCAN 455 487 INTERNAL EXON; p-value: NaN. FT GENSCAN 488 631 LAST EXON; p-value: NaN. SQ SEQUENCE 631 AA; 68888 MW; 1AD29D165ABAA41B CRC64; MANPDVKEIL CDCVINLREN PKRRRETVYV GCGAGFGGDR PLAALKLLQR VEELNYLVLE CLAERTLADR WLSMASGGLG YDPRVSEWMQ LLLPLAVERG TCIITNMGAI DPSGAQKKVL EVAGELGLTI SVAVAHEVHF ETGSGSSFGG QYCSAGGTST YLGAAPIVEC LEKYQPNVII TSRVADAALF LAPMVYELGW NWNDLELLAQ GTLAGHLLEC GCQLTGGYFM HPGDQYRDMA FPLLQDLSLP YAEIGYDGKV CVSKVEGSGG ILNTSTCAEQ LLYEIADPSA YITPDVVIDI RGVSFLPLSD CKVQCSGAKP SSNTSVPEKL LRLIPKECGW KGWGEISYGG NGSIQRAKAS EFLVRSWMEE TIPGVNHCIL SYVIGVDSLK ATSNGTESWQ SCGDIRLRMD GLFKLKEHAV QLTKEFTALY TNGPAGGGGI STGHKMEIVL EKRLVSRESV MWKTGLQHTN TSEPETSEHH SPEKMPKLPK ENPKNLTMRG YQSGFHHSPA PSGQKIPLYS VAHSRAGDKG NDINFSIIPH YSPDVERLKL IITPQWVKHV MSVLLSTSSF LELDAKPMDE NVSVEIYDVE GIHAMNVVVR NILDGGVNCS RRIDRHGKTI SDLILCQQVV L // ID NC003070_61 HYPOTHETICAL; PRT; 246 AA. AC NC003070_61; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[282919...283053, 283132...283231, DE 283314...283336, 283444...283540, 283647...283853, 283904...284082]; DE Length: 741. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 45 FIRST EXON; p-value: NaN. FT GENSCAN 46 78 INTERNAL EXON; p-value: NaN. FT GENSCAN 79 79 AA on splice site: g/ga -> G. FT GENSCAN 80 86 INTERNAL EXON; p-value: NaN. FT GENSCAN 87 118 INTERNAL EXON; p-value: NaN. FT GENSCAN 119 119 AA on splice site: g/aa -> E. FT GENSCAN 120 187 INTERNAL EXON; p-value: NaN. FT GENSCAN 188 188 AA on splice site: g/at -> D. FT GENSCAN 189 246 LAST EXON; p-value: NaN. SQ SEQUENCE 246 AA; 27749 MW; 39F19944282C18A7 CRC64; MSFTGTLDKC NVCDKTVYVV DMLSIEGMPY HKSCFRCTHC KGTLQMSNYS SMDGVLYCKT HFEQLFKESG NFSKNFQPGK TEKPELTRTP SKISSIFCGT QDKCAACEKT VYPLEKVSEF FCSDKIITRS NGNKILILVW FSILLVRFEN VMLVGPSKCG INFGLVLIGL FGNRYKWKEN ASTRHVSDSV LYCRHHFNQL FMEKGNYAHV LQAANHRRTA SGNTLPPEPT EDVAVEAKEE NGVSES // ID NC003070_62 HYPOTHETICAL; PRT; 1293 AA. AC NC003070_62; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[284781...286351, 286452...286623, DE 286717...286807, 286997...287133, 287395...287514, 287789...287854, DE 288087...288176, 288401...288505, 288593...288721, 288882...289091, DE 289255...289386, 289718...289776, 290028...290094, 290380...290495, DE 293500...293756, 294205...294386, 294511...294888]; Length: 3882. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 523 FIRST EXON; p-value: NaN. FT GENSCAN 524 524 AA on splice site: gg/a -> G. FT GENSCAN 525 581 INTERNAL EXON; p-value: NaN. FT GENSCAN 582 611 INTERNAL EXON; p-value: NaN. FT GENSCAN 612 612 AA on splice site: g/gc -> G. FT GENSCAN 613 657 INTERNAL EXON; p-value: NaN. FT GENSCAN 658 697 INTERNAL EXON; p-value: NaN. FT GENSCAN 698 719 INTERNAL EXON; p-value: NaN. FT GENSCAN 720 749 INTERNAL EXON; p-value: NaN. FT GENSCAN 750 784 INTERNAL EXON; p-value: NaN. FT GENSCAN 785 827 INTERNAL EXON; p-value: NaN. FT GENSCAN 828 897 INTERNAL EXON; p-value: NaN. FT GENSCAN 898 941 INTERNAL EXON; p-value: NaN. FT GENSCAN 942 960 INTERNAL EXON; p-value: NaN. FT GENSCAN 961 961 AA on splice site: ag/t -> S. FT GENSCAN 962 983 INTERNAL EXON; p-value: NaN. FT GENSCAN 984 1021 INTERNAL EXON; p-value: NaN. FT GENSCAN 1022 1022 AA on splice site: ag/a -> R. FT GENSCAN 1023 1107 INTERNAL EXON; p-value: NaN. FT GENSCAN 1108 1108 AA on splice site: a/tg -> M. FT GENSCAN 1109 1168 INTERNAL EXON; p-value: NaN. FT GENSCAN 1169 1293 LAST EXON; p-value: NaN. SQ SEQUENCE 1293 AA; 139722 MW; 7914AA19E59590CE CRC64; MEYASTFQRP ILFHGGDGAS YCFPNRLISP KGISITSGDS KVHSCFRLRR NVAQSGTLNL MNACFSGRFY SGHLHSTKSI LGNGHQAKRI PFGFRLRCQG HESLGNADSN DHRIGESSES SDETEATDLK DARVENDTDS LEELKELLHK AIKELEVARL NSTMFEEKAQ RISERAIALK DEAATAWLDV NKTLDVIRDT VYEEALAKEA VQTATMALSL AEARLQVIVE SLEAGAGNDI PHVSEETEET IDVNDKEEAL LAAKDDIKEC QVNLDNCESQ LSALLSKKDE LQKEVDKLNE FAETIQISSL KAEEDVTNIM KLAEQAVAFE LEATQRVNDA EIALQRAEKS LSISQTPEET QGQLSDEETS QEDAMVLSGN VEDVTHQVEK ESPKDGDLPV VQITAELVPD IVGQRNKKLT QPYESSDHEN GKPSVESSKV VEADSEKPKI NVQTKKQETQ KDLPKEGSSL NSPKASFNKS SRFFSASFFS SNPDGTATVF GSLVGSVKQQ WPKLVLGLAL LGAGLTLYSN GVGGNNQLLQ QPDVTSTSTE DVSSSTKPLI RQVQKLPKRI KKLLEMIPHQ EVNEEEASLF DFLWLLLASV IFVPLFQKIP GGSPVLGYLA AGILIGPYGL SIIRNVHGTR AIAEFGVVFL LFNIGLEVLV TAAVVGLLAH YVAGQAGPAA IVIGNGLALS STAVVLQDLA VVVLLILIPL ISPNSSKGGI GFQAIAEALG LAAVKAAVAI TAIIAGGRLL LRPIYKQIAE NRNAEIFSAN TLLVILGTSL LTARAGLSMA LGAFLAGLLL AETEFSLQVE SDIAPYRGLL LGLFFMTVGM SIDPKLLLSN FPVIVGTLGL LIVGKTMLVV IMGKLFGISI ISAIRVGLLL APGGEFAFVA FGEAVNQLSS LLFLVVGISM AITPWLAAGG QLIASRFELH DVRSLLPVES EIIAQLLSER LIPFVALDVS SDRVTIGRSL DLPVYFGDAG SKETIDMLCQ SYTNNTSITK TFRLSRRLWS LVYSWQLLFL LRVAVVTGSN KGIGFEICRQ LANNGITVVL TARDENKGLA AVQKLKTENG FSDQAISFHP LDVSNPDTIA SLAAFVKTRF GKLDILAMQA GAPTDISKIM SDTYEIVEEC VKTNYYGVKR MCEAMIPLLQ SSDSPRIVSI ASTMGKLENV SNEWAKGVLS DAENLTEEKI DEVINEYLKD YKEGALQVKG WPTVMSGYIL SKAAVIALTR VLAKRHKSFI INSVCPGFVN TEINFNTGIL SVEEGAASPV KLALVPNGDP SGLFFDRANV SNF // ID NC003070_63 HYPOTHETICAL; PRT; 148 AA. AC NC003070_63; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[295242...295668, 295718...295737]; Length: DE 447. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 142 FIRST EXON; p-value: NaN. FT GENSCAN 143 143 AA on splice site: t/at -> Y. FT GENSCAN 144 148 LAST EXON; p-value: NaN. SQ SEQUENCE 148 AA; 17094 MW; E2C75E9532D981B9 CRC64; MKPETKRSTA TTRRRTNDGG SPFNFSSLKQ AIEAIDAAKD TSINLSSLSV ALSAASREMK QRKEKSHRRR ETRKRDKKKK KKKDSNNNTK LVIGTKPETD AELRRWFSYF DKSFKLSLPR KVKLTSPRKL NDHNKNLKEA PIYTRNNA // ID NC003070_64 HYPOTHETICAL; PRT; 210 AA. AC NC003070_64; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[297723...297532, 297175...297101, DE 296862...296782, 296700...296626, 296535...296362, 296248...296213]; DE Length: 633. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 64 FIRST EXON; p-value: NaN. FT GENSCAN 65 89 INTERNAL EXON; p-value: NaN. FT GENSCAN 90 116 INTERNAL EXON; p-value: NaN. FT GENSCAN 117 141 INTERNAL EXON; p-value: NaN. FT GENSCAN 142 199 INTERNAL EXON; p-value: NaN. FT GENSCAN 200 210 LAST EXON; p-value: NaN. SQ SEQUENCE 210 AA; 23047 MW; 321EE18EAC8CF2E7 CRC64; MSTLETTRAE LGLVVVYLNK AEARDKICRA IQYGSKFLSD GQPGTAQNVD KNTSLARKVF RLFKFVNDLH ALISPVPKGT PLPLVLLGKD KERAEILGRI SLFCWMGSSV CTSLVEVGEL GRLSASIKKL EKEIGNKDKH QNEQYRAKVE KSNERSLALI KAGMDVVVAF GLLQLAPKKV TPRVTGAFGF ASSLISCYQL LPSHPKSKMV // ID NC003070_65 HYPOTHETICAL; PRT; 571 AA. AC NC003070_65; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[300432...298717]; Length: 1716. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 571 SINGLE EXON; p-value: NaN. SQ SEQUENCE 571 AA; 61636 MW; 807765A65439856A CRC64; MAELGTMGKF GAMAEEQQVN QMIMDKQSVE EWLSRVNSLI PSVLSKAKTV KKFTGRWKTI ISKIEQIPAC LSDLSSHPCF SKNKLCNEQL QSVAKTLSEV IELAEQCSTD KYEGKLRMQS DLDSLSGKLD LNLRDCGVLI KTGVLGEATL PLYISSSSET PKISSLKELL ARLQIGHLES KHNALESLLG AMQEDEKMVL MPLIGRANVA ALVQLLTATS TRIREKAVNL ISVLAESGHC DEWLISEGVL PPLVRLIESG SLETKEKAAI AIQRLSMTEE NAREIAGHGG ITPLIDLCKT GDSVSQAASA AALKNMSAVS ELRQLLAEEG IIRVSIDLLN HGILLGSREH MAECLQNLTA ASDALREAIV SEGGVPSLLA YLDGPLPQQP AVTALRNLIP SVNPEIWVAL NLLPRLRHVL KSGSLGAQQA AASAICRFAC SPETKRLVGE SGCIPEIVKL LESKSNGCRE AAAQAIAGLV AEGRIRRELK KDGKSVLTNL VMLLDSNPGN TAKKYAVAGL LGMSGSEKSK KMMVSYGAIG YLKKLSEMEV MGADKLLEKL ERGKLRSFFH R // ID NC003070_66 HYPOTHETICAL; PRT; 152 AA. AC NC003070_66; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[303650...304108]; Length: 459. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 152 SINGLE EXON; p-value: NaN. SQ SEQUENCE 152 AA; 16943 MW; 08780E7D5A298F72 CRC64; MASIEQVAEL ISSLEQATLM AQQIGTTVGQ NQLLQISSLR IAHQRLSAFL ASIPTAEAEK SFSSVEPMQL GEEEKGEAEP AEEERYSAIE KVEEKMRECF IRNKRLKRPL SPSSAVVETS ATEERSGRDY GFDSDPHATK LRALDLIYQF HA // ID NC003070_67 HYPOTHETICAL; PRT; 797 AA. AC NC003070_67; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[308603...308397, 308318...308148, DE 308070...307889, 307805...307441, 307359...306893, 306800...306585, DE 306309...306138, 305810...305785, 305702...305547, 305258...305174, DE 304875...304697, 304393...304265, 304224...304186]; Length: 2394. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 69 FIRST EXON; p-value: NaN. FT GENSCAN 70 126 INTERNAL EXON; p-value: NaN. FT GENSCAN 127 186 INTERNAL EXON; p-value: NaN. FT GENSCAN 187 187 AA on splice site: ag/a -> R. FT GENSCAN 188 308 INTERNAL EXON; p-value: NaN. FT GENSCAN 309 309 AA on splice site: g/ga -> G. FT GENSCAN 310 464 INTERNAL EXON; p-value: NaN. FT GENSCAN 465 536 INTERNAL EXON; p-value: NaN. FT GENSCAN 537 593 INTERNAL EXON; p-value: NaN. FT GENSCAN 594 594 AA on splice site: g/gt -> G. FT GENSCAN 595 602 INTERNAL EXON; p-value: NaN. FT GENSCAN 603 654 INTERNAL EXON; p-value: NaN. FT GENSCAN 655 682 INTERNAL EXON; p-value: NaN. FT GENSCAN 683 683 AA on splice site: g/ct -> A. FT GENSCAN 684 742 INTERNAL EXON; p-value: NaN. FT GENSCAN 743 785 INTERNAL EXON; p-value: NaN. FT GENSCAN 786 797 LAST EXON; p-value: NaN. SQ SEQUENCE 797 AA; 89410 MW; 484F29F675C22465 CRC64; MIVMRFRVLQ FGAYPVFVVD GTPSPLKSQA RISRFFRSSG IDTCNLPVIK DGVSVERNKL FSEWVRECVE LLELLGIPVL KANGEAEALC AQLNSQGFVD ACITPDSDAF LFGAMCVIKD IKPNSREPFE CYHMSHIESG LGLKRKHLIA ISLLVGNDYD SGGVLGIGVD KALRIVREFS EDQVLERLQD IGNGLQPAVP GGIKSGDDGE EFRSEMKKRS PHCSRCGHLG SKRTHFKSSC EHCGCDSGCI KKPLGFRCEC SFCSKDRDLR EQKKTNDWWI KVCDKIALAP EFPNRKIIEL YLSDGLMTGD GSSMSWGTPD TGMLVDLMVF KLHWDPSYVR KMLLPMLSTI YLREKARNNT GYALLCDQYE FHSIKCIKTR YGHQSFVIRW RKPKSTSGYS HSHSEPEESI VVLEEEEESV DPLDGLNEPQ VQNDNGDCFL LTDECIGLVQ SAFPDETEHF LHEKKLRESK KKNVSEEETA TPRATTMGVQ RSITDFYRSA KKAAAGQSIE TGGSSKASAE KKRQATSTSS SNLTKSREQK KFIGLIGNDE CGNNISHHQL QLSVAVMDMR RQFAVKASVG GNICRVITTE DREGATVLAI EKDPHMVDLV SERFAGSDKF KVLQEDFVKC HIRSHMLSIL ETRRLSHPDS ALAKDEAALR LVEPALRTSE YRPINILINF YSAKTCFRLF LRFLTRKHLN RIPKQVNSAF NGKRKMLRKS LQHISSSPDI EKALGVAGLP ATSYTKPPEN RTVTATRRGR YNVAVQAVDC RKTNVTNIHT VRNQGNFSVY EMIFVNC // ID NC003070_68 HYPOTHETICAL; PRT; 756 AA. AC NC003070_68; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[310386...310981, 311337...313011]; Length: DE 2271. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 198 FIRST EXON; p-value: NaN. FT GENSCAN 199 199 AA on splice site: ag/g -> R. FT GENSCAN 200 756 LAST EXON; p-value: NaN. SQ SEQUENCE 756 AA; 81001 MW; 69E13DF1687420A7 CRC64; MFFRSFIVFF FLIFFASNVS SRKQTYVIHT VTTSTKHIVT SLFNSLQTEN INDDDFSLPE IHYIYENAMS GFSATLTDDQ LDTVKNTKGF ISAYPDELLS LHTTYSHEFL GLEFGIGLWN ETSLSSDVII GLVDTGISPE HVSFRDTHMT PVPSRWRGSC DEGTNFSSSE CNKKIIGASA FYKGYESIVG KINETTDFRS TRDAQGHGTH TASTAAGDIV PKANYFGQAK GLASGMRFTS RIAAYKACWA LGCASTDVIA AIDRAILDGV DVISLSLGGS SRPFYVDPIA IAGFGAMQKN IFVSCSAGNS GPTASTVSNG APWLMTVAAS YTDRTFPAIV RIGNRKSLVG SSLYKGKSLK NLPLAFNRTA GEESGAVFCI RDSLKRELVE GKIVICLRGA SGRTAKGEEV KRSGGAAMLL VSTEAEGEEL LADPHVLPAV SLGFSDGKTL LNYLAGAANA TASVRFRGTA YGATAPMVAA FSSRGPSVAG PEIAKPDIAA PGLNILAGWS PFSSPSLLRS DPRRVQFNII SGTSMACPHI SGIAALIKSV HGDWSPAMIK SAIMTTARIT DNRNRPIGDR GAAGAESAAT AFAFGAGNVD PTRAVDPGLV YDTSTVDYLN YLCSLNYTSE RILLFSGTNY TCASNAVVLS PGDLNYPSFA VNLVNGANLK TVRYKRTVTN VGSPTCEYMV HVEEPKGVKV RVEPKVLKFQ KARERLSYTV TYDAEASRNS SSSSFGVLVW ICDKYNVRSP IAVTWE // ID NC003070_69 HYPOTHETICAL; PRT; 288 AA. AC NC003070_69; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[315831...315577, 315015...314891, DE 314802...314678, 314167...314087, 313995...313880, 313759...313595]; DE Length: 867. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 85 FIRST EXON; p-value: NaN. FT GENSCAN 86 126 INTERNAL EXON; p-value: NaN. FT GENSCAN 127 127 AA on splice site: aa/g -> K. FT GENSCAN 128 168 INTERNAL EXON; p-value: NaN. FT GENSCAN 169 169 AA on splice site: a/ag -> K. FT GENSCAN 170 195 INTERNAL EXON; p-value: NaN. FT GENSCAN 196 196 AA on splice site: g/at -> D. FT GENSCAN 197 234 INTERNAL EXON; p-value: NaN. FT GENSCAN 235 288 LAST EXON; p-value: NaN. SQ SEQUENCE 288 AA; 32215 MW; EA95680F83175CD4 CRC64; MAADLPEATV QNILDQESLK WVFVGGKGGV GKTTCSSILA ICLASVRSSV LIISTDPAHN LSDAFQQRFT KSPTLVQGFS NLFAMEVDPT VETDDMAGTD GMDGLFSDLA NAIPGIDEAM SFAEMLKLVQ TMDYATIVFD TAPTGHTLRL LQFPATLEKG LSKLMSLKKR LVQELAKFEI DTHNIIINQV LYDDEDVESK LLRARMRMQQ KYLDQFYMLY DDFNITKLPL LPEEVTGVEA LKAFSHKFLT PYHPTTSRSN VEELERKVHT LRLQLKTAEE ELERVKSG // ID NC003070_70 HYPOTHETICAL; PRT; 446 AA. AC NC003070_70; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[316204...316257, 316344...316440, DE 316539...316713, 316792...316897, 317008...317090, 317167...317193, DE 317363...317394, 317553...317642, 318258...318433, 318565...318813, DE 318953...319101, 319204...319296, 319536...319545]; Length: 1341. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 18 FIRST EXON; p-value: NaN. FT GENSCAN 19 50 INTERNAL EXON; p-value: NaN. FT GENSCAN 51 51 AA on splice site: g/aa -> E. FT GENSCAN 52 108 INTERNAL EXON; p-value: NaN. FT GENSCAN 109 109 AA on splice site: cc/g -> P. FT GENSCAN 110 144 INTERNAL EXON; p-value: NaN. FT GENSCAN 145 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 172 AA on splice site: ag/c -> S. FT GENSCAN 173 180 INTERNAL EXON; p-value: NaN. FT GENSCAN 181 181 AA on splice site: tg/c -> C. FT GENSCAN 182 191 INTERNAL EXON; p-value: NaN. FT GENSCAN 192 192 AA on splice site: a/aa -> K. FT GENSCAN 193 221 INTERNAL EXON; p-value: NaN. FT GENSCAN 222 222 AA on splice site: g/aa -> E. FT GENSCAN 223 280 INTERNAL EXON; p-value: NaN. FT GENSCAN 281 363 INTERNAL EXON; p-value: NaN. FT GENSCAN 364 412 INTERNAL EXON; p-value: NaN. FT GENSCAN 413 413 AA on splice site: aa/g -> K. FT GENSCAN 414 443 INTERNAL EXON; p-value: NaN. FT GENSCAN 444 444 AA on splice site: ag/a -> R. FT GENSCAN 445 446 LAST EXON; p-value: NaN. SQ SEQUENCE 446 AA; 50791 MW; 3ACBB76F545AFF4B CRC64; MAISEEEAKL ERFLDWLQVN GGELRGCNIK YSDSLKGFGI FASTSTQASD EVLLVVPLDL AITPMRVLQD PLLGPECQKM FEQGQVDDRF LMILFLTLER LRINSSWKPY LDMLPTRFGN PLWFSDDDIL ELKGTNLYHA TELQKKKLLS LYHDKVEVLV TKLLILDGDS ESKVSFEHFL CQDDTGECTS TKIQAQPAPS VGSGDTIWVE GLVPGIDFCN HEVLRMMTRN ITFKIKEMLV NFVLTSVVTF NNGFIQVHYP VEAIPSIPFS DSKGQLLEAQ NAQLRCLLPK SVLNHGFFPR TTSVIRESDE KETVRSCNFS WSGKRKMPTY MNKLVFPEDF MTGLRTIAMQ EEEIYKVSAM LEEIMRQKYC FPEKLVESRQ GEQPSETEVR MAVWEACGDS GALQLLVDLL NSKMMKLEEN SGTEEQDARL LEEACVLESH EESRLF // ID NC003070_71 HYPOTHETICAL; PRT; 560 AA. AC NC003070_71; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[322809...322513, 322406...321958, DE 321865...321751, 321422...320769, 320457...320380, 320296...320207]; DE Length: 1683. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 99 FIRST EXON; p-value: NaN. FT GENSCAN 100 248 INTERNAL EXON; p-value: NaN. FT GENSCAN 249 249 AA on splice site: ag/a -> R. FT GENSCAN 250 287 INTERNAL EXON; p-value: NaN. FT GENSCAN 288 505 INTERNAL EXON; p-value: NaN. FT GENSCAN 506 531 INTERNAL EXON; p-value: NaN. FT GENSCAN 532 560 LAST EXON; p-value: NaN. SQ SEQUENCE 560 AA; 63092 MW; 9CE88350F5D0BF6B CRC64; MATTGAAATT EIGNPEYKQP RSIFDLTADF FDSCRLSNPS ETQSAFGPPK TFDPEEAEED KSSKDGVILD RWTCNTCKIE FLSLQDQRYH FKSDIHRLNI KLSVAGKAIL KEEDVDELTS ESVQDYDVSS ISGSEDEAET RPPSFHFDAQ KGIDKKKLFF RLQSGDKVSI WKCLIMDDAE SVSFENDRGV SVDCCGSLVE NEVTERLRNL IRENKDDRQM RVVLLASGGH FAGTVFNGKS VVAHKTFHRY VVRAKAGKKQ STKDASGRSI HSAGASLRRY NELALKKDIQ ELLASWKPYF DGAACVFVHA PSSSRQLLFN GGKPYFSSQN CAVRNVPFTI RRPTFKESQR IYNQLTQIAH VTEEIFVNRP EVTKANTVVQ THNEDSGKTS RKEEPDETSS SNIILEEPNR IEEDIEDGVT GTSTALHEAA KSGDCERVME FLEEGMDPCA KDERGRTPYM LANEKEVRNT FRRFMALNLE KWNWHDAKVP SPLTKEMEES QAAKQAEKDA KQKARTKELK KLRKAREKKA QAEAAQAEKE KPISKVVSFT NPNLYNLISV // ID NC003070_72 HYPOTHETICAL; PRT; 943 AA. AC NC003070_72; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[323082...323084, 323208...323282, DE 323679...323840, 324258...324416, 324636...324714, 325574...325641, DE 326106...326228, 326332...326517, 326594...326815, 326931...327084, DE 327237...327386, 327488...327576, 327663...327866, 328000...328107, DE 328188...328322, 328424...328555, 328898...329030, 329119...329211, DE 329311...329436, 329573...329698, 329781...329924, 330243...330403]; DE Length: 2832. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 1 FIRST EXON; p-value: NaN. FT GENSCAN 2 26 INTERNAL EXON; p-value: NaN. FT GENSCAN 27 80 INTERNAL EXON; p-value: NaN. FT GENSCAN 81 133 INTERNAL EXON; p-value: NaN. FT GENSCAN 134 159 INTERNAL EXON; p-value: NaN. FT GENSCAN 160 160 AA on splice site: g/ga -> G. FT GENSCAN 161 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 223 INTERNAL EXON; p-value: NaN. FT GENSCAN 224 285 INTERNAL EXON; p-value: NaN. FT GENSCAN 286 359 INTERNAL EXON; p-value: NaN. FT GENSCAN 360 410 INTERNAL EXON; p-value: NaN. FT GENSCAN 411 411 AA on splice site: g/gc -> G. FT GENSCAN 412 460 INTERNAL EXON; p-value: NaN. FT GENSCAN 461 461 AA on splice site: g/gc -> G. FT GENSCAN 462 490 INTERNAL EXON; p-value: NaN. FT GENSCAN 491 558 INTERNAL EXON; p-value: NaN. FT GENSCAN 559 594 INTERNAL EXON; p-value: NaN. FT GENSCAN 595 639 INTERNAL EXON; p-value: NaN. FT GENSCAN 640 683 INTERNAL EXON; p-value: NaN. FT GENSCAN 684 727 INTERNAL EXON; p-value: NaN. FT GENSCAN 728 728 AA on splice site: g/tg -> V. FT GENSCAN 729 758 INTERNAL EXON; p-value: NaN. FT GENSCAN 759 759 AA on splice site: g/ag -> E. FT GENSCAN 760 800 INTERNAL EXON; p-value: NaN. FT GENSCAN 801 801 AA on splice site: g/aa -> E. FT GENSCAN 802 842 INTERNAL EXON; p-value: NaN. FT GENSCAN 843 843 AA on splice site: g/ac -> D. FT GENSCAN 844 890 INTERNAL EXON; p-value: NaN. FT GENSCAN 891 891 AA on splice site: g/ag -> E. FT GENSCAN 892 943 LAST EXON; p-value: NaN. SQ SEQUENCE 943 AA; 104340 MW; EA51C7689920DA76 CRC64; MSVTLHTNLG DIKCEIFCDE VPKSAENFLA LCASGYYDGT IFHRNIKGFM IQGGDPKGTG KGGTSIWGKK FNDEIRDSLK HNARGMLSMA NSGPNTNGSQ FFITYAKQPH LNGLYTIFGK VIHGFEVLDI MEKTQTGPGD RPLAEIRLNR VTIHANPLAG FLPRRLLPVG APLLLSEPQT MELKRLKLRK NNWDTETYEF DEVLTEAASQ KRVYEVVAKP VVESVLEGYN GTVMAYGQTG TGKTFTLGRL GDEDTAARGI MVRSMEDIIG GTSLDTDSIS VSYLQLYMET IQDLLDPTND NIAIVEDPRT GDVSLPGATH VEIRNQQNFL ELLQLGETHR VAANTKLNTE SSRSHAILMV HVKRSVVENE FPVSNEMESS SHFVRPSKPL VRRSKLVLVD LAGSERVHKS GSEGHMLEEA KSINLSLSAL GKCINAIAEN SPHVPLRDSK LTRLLRDSFG GTARTSLIVT IGPSPRHRGE TTSTILFGQR AMKVENMLKI KEEFDYKSLS KKLEVQLDKV IAENERQLKA FDDDVERINR QAQNRISEVE KNFAEALEKE KLKCQMEYME SVKKLEEKLI SNQRNHENGK RNGEVNGVVT ASEFTRLKES LENEMKLRKS AEEEVSKVKS QSTLKTRSGE GEDAGITRLQ KLLEDEALQK KKLEEEVTIL RSQLVQLTFE ADQMRRCLDR GAPGNSYSGT DSLPSRHSQA RESVNGQKAP FATLCEQVGL QKILQLLESD DANIRIHAVK VVANLAAEEA NQEKIVEAGG LTSLLMLLRS YEDETVRRVA AGAIANLAMN EVSQQLIVDQ GGISLLSLTA ADAEDPQTLR MVAGAIANLC GNDKLQARLW SDGGIKALLG MVRCGHPDVL AQVARGIANF AKCESRATTQ EVNAKEMISG GALWELVRIS KECSREDIRS LAHRTLSSSP VFRSEIRRLG IQF // ID NC003070_73 HYPOTHETICAL; PRT; 1657 AA. AC NC003070_73; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[337582...336417, 336062...335916, DE 335773...335419, 335321...334899, 334763...334471, 334178...333905, DE 333628...333129, 333045...332724, 332633...332276, 332183...331534, DE 331423...331074, 331050...330915]; Length: 4974. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 388 FIRST EXON; p-value: NaN. FT GENSCAN 389 389 AA on splice site: ag/a -> R. FT GENSCAN 390 437 INTERNAL EXON; p-value: NaN. FT GENSCAN 438 438 AA on splice site: ag/g -> R. FT GENSCAN 439 556 INTERNAL EXON; p-value: NaN. FT GENSCAN 557 697 INTERNAL EXON; p-value: NaN. FT GENSCAN 698 794 INTERNAL EXON; p-value: NaN. FT GENSCAN 795 795 AA on splice site: ga/g -> E. FT GENSCAN 796 886 INTERNAL EXON; p-value: NaN. FT GENSCAN 887 1052 INTERNAL EXON; p-value: NaN. FT GENSCAN 1053 1053 AA on splice site: gc/g -> A. FT GENSCAN 1054 1160 INTERNAL EXON; p-value: NaN. FT GENSCAN 1161 1279 INTERNAL EXON; p-value: NaN. FT GENSCAN 1280 1280 AA on splice site: g/gt -> G. FT GENSCAN 1281 1496 INTERNAL EXON; p-value: NaN. FT GENSCAN 1497 1612 INTERNAL EXON; p-value: NaN. FT GENSCAN 1613 1613 AA on splice site: ag/g -> R. FT GENSCAN 1614 1657 LAST EXON; p-value: NaN. SQ SEQUENCE 1657 AA; 184659 MW; AEB5571AA04D20EF CRC64; MASTEVDSRL GRVVIPALDK VIKNASWRKH SKLAHECKSV IERLRSPENS SPVADSESGS SIPGPLHDGG AAEYSLAESE IILSPLINAS STGVLKIVDP AVDCIQKLIA HGYVRGEADP TGGPEALLLS KLIETICKCH ELDDEGLELL VLKTLLTAVT SISLRIHGDS LLQIVRTCYG IYLGSRNVVN QATAKASLVQ MSVIVFRRME ADSSTVPIQP IVVAELMEPM DKSESDPSTT QSVQGFITKI MQDIDGVFNS ANAKGTFGGH DGAFETSLPG TANPTDLLDS TDKDMLDAKY WEISMYKSAL EGRKGELADG EVEKDDDSEV QIGNKLRRDA FLVFRALCKL SMKTPPKEDP ELMRGKIVAL ELLKILLENA GAVFRTSDRV LENVAQPDFQ QKMIVLRFLD KLCVDSQILV DIFINYDCDV NSSNIFERMV NGLLKTAQGV PPGTVTTLLP PQEAAMKLEA MKCLVAVLRS MGDWVNKQLR LPDPYSAKML EIVDRNLEEG SHPVENGKGD GGHGGFERSD SQSELSSGNS DALAIEQRRA YKLELQEGIS IFNQKPKKGI EFLIKANKVG DSPEEIAAFL KDASGLNKTL IGDYLGERED LSLKVMHAYV DSFEFQGMEF DEAIRAFLRG FRLPGEAQKI DRIMEKFAER FCKCNPKDFS SADTAYVLAY SVILLNTDAH NPMVKSKMTA DGFIRNNRGI DDGKDLPEEY LRALYERISR NEIKMKDDGL GPQQKQPTNS SRLLGLDTIL NIVVPRRGDD MNMETSDDLI RHMQERFKEK ARKSESVYYA ASDVIILRFM VEVCWAPMLA AFSVPLDQSD DAVITTLCLE GFHHAIHVTS VMSLKTHRDA FVTSLAKFTS LHSPADIKQK NIEAIKAIVK LAEEEGNYLQ DAWEHILTCV SRFEHLHLLG EGAPPDATFF AFPQTESGNS PLAKPNSVPA IKERAPGKLQ YAASAMIRGS YDGSGVAGKA SNTVTSEQMN NLISNLNLLE QVGDMSRIFT RSQRLNSEAI IDFVKALCKV SMDELRSPSD PRVFSLTKIV EIAHYNMNRI RLVWSSIWHV LSDFFVTIGC SDNLSIAIFA MDSLRQLSMK FLEREELANY NFQNEFMKPF VVVMRKSGAV EIRELIIRCV SQMVLSRVDN VKSGWKSMFM IFTTAAHDAH KNIVFLSFEM VEKIIRDYFP HITETETTTF TDCVNCLVAF TNCKFEKDIS LQAIAFLQYC ARKLAEGYVG SSLRRNPPLS PQGGKIGKQD SGKFLESDEH LYSWFPLLAG LSELSFDPRA EIRKVALKVL FDTLRNHGDH FSLALWERVF ESVLFRIFDY VRQDVDPSED DSTDQRGYNG EVDQESWLYE TCSLALQLVV DLFVNFYKTV NPLLKKVLML FVSLIKRPHQ SLAGAGIAAL VRLMRDVGHQ FSNEQWLEVV SCIKEAADAT SPDFSYVTSE DLMEDVSNED ETNDNSNDAL RRRNRQLHAV VTDAKSKASI QIFVIQAVTD IYDMYRMSLT ANHMLMLFDA MHGIGSNAHK INADLLLRSK LQELGSSLES QEAPLLRLEN ESFQTCMTFL DNLISDQPVG YNEAEIESHL ISLCREVLEF YINISCSKEQ SSRKKERANS AGSASGGCNS DIGKHGRVTV QEELAGVVSI DSDVDKL // ID NC003070_74 HYPOTHETICAL; PRT; 381 AA. AC NC003070_74; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[337942...337985, 338666...338987, DE 339126...339905]; Length: 1146. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 14 FIRST EXON; p-value: NaN. FT GENSCAN 15 15 AA on splice site: cg/g -> R. FT GENSCAN 16 122 INTERNAL EXON; p-value: NaN. FT GENSCAN 123 381 LAST EXON; p-value: NaN. SQ SEQUENCE 381 AA; 42608 MW; 873F0906EC3A50BB CRC64; MTNSVSVDRP GGLPRLCSCK CNASLAIGEV VEKEDAEQSR SFNWADVGLN LTEEQDEAIT RIPIKMSKRC QALMRQIICF SPEKGSFCDL LGAWLRRMNP IRADWLSILK ELKNLDSPFY IKVAEFSLLQ DSFEANARDY TKIIHYYGKL NQVEDAERTL LSMKNRGFLI DQVTLTAMVQ LYSKAGCHKL AEETFNEIKL LGEPLDYRSY GSMIMAYIRA GVPEKGESLL REMDSQEICA GREVYKALLR DYSMGGDAEG AKRVFDAVQI AGITPDVKLC GLLINAYSVS GQSQNARLAF ENMRKAGIKA TDKCVALVLA AYEKEEKLNE ALGFLVELEK DSIMLGKEAS AVLAQWFKKL GVVEEVELLL REFSSSQSQP L // ID NC003070_75 HYPOTHETICAL; PRT; 546 AA. AC NC003070_75; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[342397...342273, 341889...340374]; Length: DE 1641. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 41 FIRST EXON; p-value: NaN. FT GENSCAN 42 42 AA on splice site: tg/c -> C. FT GENSCAN 43 546 LAST EXON; p-value: NaN. SQ SEQUENCE 546 AA; 60762 MW; 18A3FCE4F71CD777 CRC64; MGLAQVQALA SRYATTIPYL ISEEYFIILV NGLSLLTKLS HCNQTGAPPE KLCDVVLPQS SASFTPTLRA YIRNARFNTS TSPKPLLVIA ARSECHVQAT VLCTKSLNFQ LKTRSGGHDY DGVSYISNRP FFVLDMSYLR NITVDMSDDG GSAWVGAGAT LGEVYYNIWQ SSKTHGTHGF PAGVCPTVGA GGHISGGGYG NMIRKYGLSV DYVTDAKIVD VNGRILDRKS MGEDLFWAIG GGGGASFGVI LSFKIKLVPV PPRVTVFRVE KTLVENALDM VHKWQFVAPK TSPDLFMRLM LQPVTRNTTQ TVRASVVALF LGKQSDLMSL LTKEFPELGL KPENCTEMTW IQSVMWWANN DNATVIKPEI LLDRNPDSAS FLKRKSDYVE KEISKDGLDF LCKKLMEAGK LGLVFNPYGG KMSEVATTAT PFPHRKRLFK VQHSMNWKDP GTDVESSFME KTRSFYSYMA PFVTKNPRHT YLNYRDLDIG INSHGPNSYR EAEVYGRKYF GENFDRLVKV KTAVDPENFF RDEQSIPTLP TKPSSS // ID NC003070_76 HYPOTHETICAL; PRT; 245 AA. AC NC003070_76; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[344199...343462]; Length: 738. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 245 SINGLE EXON; p-value: NaN. SQ SEQUENCE 245 AA; 27915 MW; 9075D4C53D0FF1A6 CRC64; MGSLVKAYYC NQMHHNSIIP HFHHYRPSSL QNPISLFAIT SPSSSEPPSP PVKHPLPQFG GGGHFIIPTI AVAASAWFFF RLHQYPPIIT SPVDLHLDLE EEGAIKELPL ESKPGYVKAL HFYKIKPGTV LKLLDVFDSD SYDSLKARIR LSAEWLETAR RELEEVVERD PGRVMEYSQV VDELMEILRD MEVYIDKCQK DNVKGYLRSC NRLLARVRRM EAQILNVLKE FHDDHDQGGG GGDTY // ID NC003070_77 HYPOTHETICAL; PRT; 883 AA. AC NC003070_77; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[346097...347304, 348735...348785, DE 348880...349099, 349238...349366, 349550...349720, 350226...350296, DE 350474...350570, 350658...350783, 350871...350966, 351044...351160, DE 351252...351329, 351412...351699]; Length: 2652. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 402 FIRST EXON; p-value: NaN. FT GENSCAN 403 403 AA on splice site: tg/c -> C. FT GENSCAN 404 419 INTERNAL EXON; p-value: NaN. FT GENSCAN 420 420 AA on splice site: aa/g -> K. FT GENSCAN 421 493 INTERNAL EXON; p-value: NaN. FT GENSCAN 494 536 INTERNAL EXON; p-value: NaN. FT GENSCAN 537 593 INTERNAL EXON; p-value: NaN. FT GENSCAN 594 616 INTERNAL EXON; p-value: NaN. FT GENSCAN 617 617 AA on splice site: ag/g -> R. FT GENSCAN 618 649 INTERNAL EXON; p-value: NaN. FT GENSCAN 650 691 INTERNAL EXON; p-value: NaN. FT GENSCAN 692 723 INTERNAL EXON; p-value: NaN. FT GENSCAN 724 762 INTERNAL EXON; p-value: NaN. FT GENSCAN 763 788 INTERNAL EXON; p-value: NaN. FT GENSCAN 789 883 LAST EXON; p-value: NaN. SQ SEQUENCE 883 AA; 99480 MW; EF18209EAA0E106B CRC64; MMDKSPFFLH RTRWQSSVAK LAFWSLVFFG LLFIFFYRSP ISNPDSSRRS LRTYSWGGPA WEKRVRSSAR VRTRNGVSVL VTGAAGFVGT HVSAALKRRG DGVLGLDNFN DYYDTSLKRS RQALLERSGV FIVEGDINDL SLLKKLFEVV PFTHVMHLAA QAGVRYAMEN PGSYVHSNIA GFVNLLEVCK SANPQPAIVW ASSSSVYGLN TKVPFSEKDR TDQPASLYAA TKKAGEEIAH TYNHIYGLSL TGLRFFTVYG PWGRPDMAYF FFTRDILKGK AISIFEGANH GTVARDFTYI DDIVKGCLGA LDTAEKSTGS GGKKRGAAQL RVFNLGNTSP VPVTDLVSIL ERLLKVKAKR NMMKLPRNGD VPFTHANISS AQREFGYKPS TDLQTGLKKF VRCIVMFLSD MSGREPLYRK AFIFFSSTIP KELVNHIKSD SSVLPRIGAL REVFLFYAII PSLFTYSFIA FWFLNFLLLS IQMNMEYFPI DNQGFLTDHE QALETLYAED AENSRHFHIC LNIMATRIAT VFASLKELPF VRYRAAKSTA SRDLVPSKLA AAIWDCISKY KAIPNFPQTE TCELLIVDRS VDQASERLHE KMTNFASKNK AAQMRSRDGS ELSTRDLQKI VQALPQYGEQ VDKLSTHVEL AGKINRIIRD TGLRDLGQLE QDLVFGDAGA KDVINFLRTN QDTNPENKLR LLMIYATVYP EKFEGDKGVK LMQLARLSPV DMKVISNMQL IAGSPENKAK SGSFSLKFDA GKTKQANRKD RSGEEETWQL FRFYPMIEEL LEKLVKGDLS KSDYLCMNQS SHKEESEART GSVRKSSAPT AVPERKATPH SMRSRRTATW ARPHSSDDGY SRHSILHNDF LFFGPFERSL DFF // ID NC003070_78 HYPOTHETICAL; PRT; 953 AA. AC NC003070_78; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[356188...355398, 354987...353238, DE 353162...352842]; Length: 2862. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 263 FIRST EXON; p-value: NaN. FT GENSCAN 264 264 AA on splice site: tc/g -> S. FT GENSCAN 265 847 INTERNAL EXON; p-value: NaN. FT GENSCAN 848 953 LAST EXON; p-value: NaN. SQ SEQUENCE 953 AA; 106363 MW; 40628C5EE93DBAD2 CRC64; MEERHKCKLC WKSFANGRAL GGHMRSHMLI HPLPSQPESY SSSMADPGFV LQDRESETES SKKPSRKRSR LNRRSISSLR HQQSNEEGKS ETARAADIKI GVQELSESCT EQEPMSSVSD AATTEEDVAL SLMLLSRDKW EKEEEESDEE RWKKKRNKWF ECETCEKVFK SYQALGGHRA SHKKKIAETD QLGSDELKKK KKKSTSSHHE CPICAKVFTS GQALGGHKRS HASANNEFTR RSGIIISLID LNLPAPSEEE EMASKENKGV IAVELFPKSQ REITGNFGYP LTGPSISEPS RYKDPLVSGF EMSNPRQRKG EAMMNPTISW RFASSTSLLS IPRTPKSAFI FAMTFSSSSS SSSSSSSVEN PNKDDSSSSL ELVLKYHNQT KHSLNGYARG PRGLDWANQP NPFRRYLSAP LLPLQHPNHD IDDDSDSPLY STLFDSLPPP KPISLPTISH LFYHSLALSA WKTTGSSTWP LRVNPSSGNL HPTEAYLIAP PIPSLSQSAF VSHYAPKEHS LEVRAHIPSS FFPNFFPENS FLIGISSIFW REAWKYGERA FRYCNHDVGH AIAALSIAAA DLGWDLKLLD AFGADDLKRL MGLPEFQLPE GKGKAELPEI EFEHPDCLLL VFPNGTSREH LNLDYLAISS ALRDFPSLEW TGNPNTLSKE HLCWDIIYRT AKAVEKPPLI YSTSSSSIDV ASFTSSRALF SHSSYNKLTV RQVVRTRRSA VDMDAVTCID MSSFYQMLMH CLPSTGESQK EQLALPFRAL PWDTAEVHLA LFVHRVSGLP KGLYLLVRNE DHLSDLKTAT RPEFEWTKPD GCPDNLPLYK LAEGDCQRLA KGLSCHQDIA GDGCFSLGMI ARFEPALREK GSWMYPRLFW ETGVVGQVLY LEAHAMGISA TGIGCYFDDP VHEVLGINDS SFQSLYHFTV GGPVVDKRIM TLPAYPGPTT TVA // ID NC003070_79 HYPOTHETICAL; PRT; 263 AA. AC NC003070_79; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[358895...358104]; Length: 792. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 263 SINGLE EXON; p-value: NaN. SQ SEQUENCE 263 AA; 29646 MW; EE102F508C643016 CRC64; MNSPCIDFVM FSRGQHDEDN MSRRPPWKRE RSMSTQHHHL NLSPNEDEEL ANCLVLLSNS GDAHGGDQHK QHGHGKGKTV KKQKTAQVFQ CKACKKVFTS HQALGGHRAS HKKVKGCFAS QDKEEEEEEE YKEDDDDNDE DEDEEEDEED KSTAHIARKR SNAHECTICH RVFSSGQALG GHKRCHWLTP SNYLRMTSLH DHHHSVGRPQ PLDQPSLDLN LACQEYSVDP TAMSVGMIER DGGGNNHNAT SSSWLKLASG DWS // ID NC003070_80 HYPOTHETICAL; PRT; 390 AA. AC NC003070_80; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[360426...360240, 360149...359164]; Length: DE 1173. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 62 FIRST EXON; p-value: NaN. FT GENSCAN 63 63 AA on splice site: t/gc -> C. FT GENSCAN 64 390 LAST EXON; p-value: NaN. SQ SEQUENCE 390 AA; 43043 MW; CF78FCBA72429D15 CRC64; MNGVEKLSSK STRRVANAGK ATLLALGKAF PSQVVPQENL VEGFLRDTKC DDAFIKEKLE HLCKTTTVKT RYTVLTREIL AKYPELTTEG SPTIKQRLEI ANEAVVEMAL EASLGCIKEW GRPVEDITHI VYVSSSEIRL PGGDLYLSAK LGLRNDVNRV MLYFLGCYGG VTGLRVAKDI AENNPGSRVL LTTSETTILG FRPPNKARPY DLVGAALFGD GAAAVIIGAD PRECEAPFME LHYAVQQFLP GTQNVIEGRL TEEGINFKLG RDLPQKIEEN IEEFCKKLMG KAGDESMEFN DMFWAVHPGG PAILNRLETK LKLEKEKLES SRRALVDYGN VSSNTILYVM EYMRDELKKK GDAAQEWGLG LAFGPGITFE GLLIRSLTSS // ID NC003070_81 HYPOTHETICAL; PRT; 710 AA. AC NC003070_81; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[363050...360918]; Length: 2133. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 710 SINGLE EXON; p-value: NaN. SQ SEQUENCE 710 AA; 80833 MW; 6B545411C7EA88FB CRC64; MVSSVPKLHA LFVSKSQPVL RAAKVTNEER STKSKLARSL ARAVNSNPWS DELESSLSSL HPSQTISRTT VLQTLRLIKV PADGLRFFDW VSNKGFSHKE QSFFLMLEFL GRARNLNVAR NFLFSIERRS NGCVKLQDRY FNSLIRSYGN AGLFQESVKL FQTMKQMGIS PSVLTFNSLL SILLKRGRTG MAHDLFDEMR RTYGVTPDSY TFNTLINGFC KNSMVDEAFR IFKDMELYHC NPDVVTYNTI IDGLCRAGKV KIAHNVLSGM LKKATDVHPN VVSYTTLVRG YCMKQEIDEA VLVFHDMLSR GLKPNAVTYN TLIKGLSEAH RYDEIKDILI GGNDAFTTFA PDACTFNILI KAHCDAGHLD AAMKVFQEML NMKLHPDSAS YSVLIRTLCM RNEFDRAETL FNELFEKEVL LGKDECKPLA AAYNPMFEYL CANGKTKQAE KVFRQLMKRG VQDPPSYKTL ITGHCREGKF KPAYELLVLM LRREFVPDLE TYELLIDGLL KIGEALLAHD TLQRMLRSSY LPVATTFHSV LAELAKRKFA NESFCLVTLM LEKRIRQNID LSTQVVRLLF SSAQKEKAFL IVRLLYDNGY LVKMEELLGY LCENRKLLDA HTLVLFCLEK SQMVDIDTCN TVIEGLCKHK RHSEAFSLYN ELVELGNHQQ LSCHVVLRNA LEAAGKWEEL QFVSKRMATL RESDDCSVLE // ID NC003070_82 HYPOTHETICAL; PRT; 331 AA. AC NC003070_82; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[370947...370859, 370751...370476, DE 367179...366997, 366212...366089, 366074...365822, 364014...363944]; DE Length: 996. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 29 FIRST EXON; p-value: NaN. FT GENSCAN 30 30 AA on splice site: tc/t -> S. FT GENSCAN 31 121 INTERNAL EXON; p-value: NaN. FT GENSCAN 122 122 AA on splice site: ag/c -> S. FT GENSCAN 123 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 183 AA on splice site: tc/a -> S. FT GENSCAN 184 224 INTERNAL EXON; p-value: NaN. FT GENSCAN 225 308 INTERNAL EXON; p-value: NaN. FT GENSCAN 309 309 AA on splice site: g/tg -> V. FT GENSCAN 310 331 LAST EXON; p-value: NaN. SQ SEQUENCE 331 AA; 35972 MW; 18D5C0426153FE92 CRC64; MERVCCMCGD VGFSDKLFSC GHCRCRFQHS YCSNYYGQFA EPTEICDWCR SDDRKLSNVA RHGGSSSKKP SSSVKYENDF SNRSEYSPGH RIKHNNNRHD QVAKGVAGDG GGVTSPKTAT RSIIYHMSKA YNYPLEKNIE FEDEDADVDE DAVVWNCLCR KHSSDIVGGA LDDGDDDALT PESSALHPSA WHRGVLSEFA IPDSPGRDLL YSLLTKSSSA AEKYVQPDPV GREIHFCLGD EEVVGRSGSR ARSCLVVGEQ GGWRSGRVGM VDGIGGAVRS GGGGGPVQGV VVMVMMKKLV VAVGLTVEVY NIENRNAAPT TTYSDVHIVR C // ID NC003070_83 HYPOTHETICAL; PRT; 2110 AA. AC NC003070_83; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[373335...373405, 373606...373786, DE 373985...374189, 374272...374500, 374583...374922, 375125...375238, DE 375325...375453, 375808...376002, 376069...376251, 376352...376474, DE 376562...376815, 376925...377028, 377154...377305, 377391...377615, DE 377881...378036, 378504...378649, 378722...378854, 378943...379089, DE 379183...379230, 379305...379433, 379528...379770, 379866...380003, DE 380086...380174, 380270...380375, 380846...380998, 381150...381332, DE 381405...381634, 381730...381832, 381912...382040, 382373...382567, DE 382803...382929, 383030...383130, 383395...383529, 383608...383688, DE 384084...384230, 384318...384449, 384694...384840, 384908...384969, DE 385542...385623, 385713...385817, 385909...386043, 386133...386333, DE 387049...387093]; Length: 6333. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 23 FIRST EXON; p-value: NaN. FT GENSCAN 24 24 AA on splice site: ag/c -> S. FT GENSCAN 25 84 INTERNAL EXON; p-value: NaN. FT GENSCAN 85 152 INTERNAL EXON; p-value: NaN. FT GENSCAN 153 153 AA on splice site: g/ga -> G. FT GENSCAN 154 228 INTERNAL EXON; p-value: NaN. FT GENSCAN 229 229 AA on splice site: ag/g -> R. FT GENSCAN 230 342 INTERNAL EXON; p-value: NaN. FT GENSCAN 343 380 INTERNAL EXON; p-value: NaN. FT GENSCAN 381 423 INTERNAL EXON; p-value: NaN. FT GENSCAN 424 488 INTERNAL EXON; p-value: NaN. FT GENSCAN 489 549 INTERNAL EXON; p-value: NaN. FT GENSCAN 550 590 INTERNAL EXON; p-value: NaN. FT GENSCAN 591 674 INTERNAL EXON; p-value: NaN. FT GENSCAN 675 675 AA on splice site: ag/g -> R. FT GENSCAN 676 709 INTERNAL EXON; p-value: NaN. FT GENSCAN 710 710 AA on splice site: g/at -> D. FT GENSCAN 711 760 INTERNAL EXON; p-value: NaN. FT GENSCAN 761 835 INTERNAL EXON; p-value: NaN. FT GENSCAN 836 887 INTERNAL EXON; p-value: NaN. FT GENSCAN 888 935 INTERNAL EXON; p-value: NaN. FT GENSCAN 936 936 AA on splice site: ag/a -> R. FT GENSCAN 937 980 INTERNAL EXON; p-value: NaN. FT GENSCAN 981 1029 INTERNAL EXON; p-value: NaN. FT GENSCAN 1030 1045 INTERNAL EXON; p-value: NaN. FT GENSCAN 1046 1088 INTERNAL EXON; p-value: NaN. FT GENSCAN 1089 1169 INTERNAL EXON; p-value: NaN. FT GENSCAN 1170 1215 INTERNAL EXON; p-value: NaN. FT GENSCAN 1216 1244 INTERNAL EXON; p-value: NaN. FT GENSCAN 1245 1245 AA on splice site: ag/a -> R. FT GENSCAN 1246 1280 INTERNAL EXON; p-value: NaN. FT GENSCAN 1281 1331 INTERNAL EXON; p-value: NaN. FT GENSCAN 1332 1392 INTERNAL EXON; p-value: NaN. FT GENSCAN 1393 1468 INTERNAL EXON; p-value: NaN. FT GENSCAN 1469 1469 AA on splice site: ag/c -> S. FT GENSCAN 1470 1503 INTERNAL EXON; p-value: NaN. FT GENSCAN 1504 1546 INTERNAL EXON; p-value: NaN. FT GENSCAN 1547 1611 INTERNAL EXON; p-value: NaN. FT GENSCAN 1612 1653 INTERNAL EXON; p-value: NaN. FT GENSCAN 1654 1654 AA on splice site: a/ag -> K. FT GENSCAN 1655 1687 INTERNAL EXON; p-value: NaN. FT GENSCAN 1688 1732 INTERNAL EXON; p-value: NaN. FT GENSCAN 1733 1759 INTERNAL EXON; p-value: NaN. FT GENSCAN 1760 1808 INTERNAL EXON; p-value: NaN. FT GENSCAN 1809 1852 INTERNAL EXON; p-value: NaN. FT GENSCAN 1853 1901 INTERNAL EXON; p-value: NaN. FT GENSCAN 1902 1921 INTERNAL EXON; p-value: NaN. FT GENSCAN 1922 1922 AA on splice site: ag/t -> S. FT GENSCAN 1923 1949 INTERNAL EXON; p-value: NaN. FT GENSCAN 1950 1984 INTERNAL EXON; p-value: NaN. FT GENSCAN 1985 2029 INTERNAL EXON; p-value: NaN. FT GENSCAN 2030 2096 INTERNAL EXON; p-value: NaN. FT GENSCAN 2097 2110 LAST EXON; p-value: NaN. SQ SEQUENCE 2110 AA; 233913 MW; 319763767B65E391 CRC64; MYSKTIENPD ETSPRIMWAL KRVSYSRGST ENPRRKSPSP VVPGISDLGF PYLMTPSKVA GHTRFLLHSF HDSDVDSIAL QLSQLEKVVS LLFKHVLKLS NLATLLPHAL NDFELTQESV DDLTTTLNFS ISENIGFALA LTDFERLDAK TTGRNLLLAQ IEQLCANTGQ ILSSELIHSV LSFLRKSEDL SMHLDSFLQF LSSAQPRDDF SFALTPMLAQ QVHEAPVFRS MDFHTDSADN DLDAILAEID KEVSVGDLMG ELGCGFTADA QQCKEILSSF APLGEATISR IVGNVSRTCA DLEDNQTTFS TFTVALGSCI PTELPTPRSW NVDILVDTIK QLAPGISWRK VIENLDHDGF DIPNMESFSF FMRIYKAACK EPFPLDAVCG SVWKNMDGQL SFLKHAISAP PEVFTFMHSP RKLTAYNLIQ REVVSAILPV IITSPQDSGF IHNLWHQNAE LVLWGIIDAQ HLKADSMLRI IEICHELKQP LCVYVLIMAY LFQILSVVLE SVPVSSSIRL AVLASLRGLL DIENWLPNCL YMYKDLFAEE CLKFVKNVHF SESDDFRAKI FHPSDPLSDL HLEATTSLLK VLKAHDNAIT SSQLVEEIEK VNAAILDCNP KLQNGEAKDS SAPNAYGDDV EAEANAYFHQ MFSSHLSVDA MVQMLSRYKE SLVPREKLIF ECMIANLFEE YRFFPKYPER QLKIASILFD LYCCRDILLN LLRLIQISGS VIKHQLISSL TLGMALRLVL DSLRKPADSK MFLFGSKALE QFVNRLVELP QYCNHILQIS HLRSTHPELV TVIEQALSRI SSGNLESDAS VSHPGPSQSF PGNGELSGSG IGQPALQLSS PLQLQQKNEV PSVPSNEAKP LLPSLSTTSV DVSVNPKAPP SDVQDKVSFI INNISTTNIE SKGKEFAEIL PQQYYPWFAQ YMVMKRAMCL IPDRASIEPN FHDLYLKFLD KVDSKLLFKE ILQNTYENCK VLLGSELIKS SSEERSLLKN LGSWLGRLTI GRNYVLRARE IDPKSLIVEA YEKGLMIAVI PFTSKVLEPC QNSIAYQPPN PWTMAILGLL AEIYSMPNLK MNLKFDIEVL FKNLGVEMKE VVPTSLLKDR KREIDGNPDF SNKDPGVTQI SQPQMIPEPK TISPLKQIDL PLDVANSPNT DVPSKLLSQY VAPQRVYTNT LMDEEKVATL GLPEQLPSPQ GLFQSTPSPL FSISQQLSAA LPNIGNHVVI NQKLSAFGMH FPFQRVVPLA MDRAIKEIVS GIVQRSVCIA CQTTKELVLK EPLRTSISGH LRNSLQGLNI SNDALEQIVQ LVTNDNLDLG CAAIEQAATE KAIQTIDADI AQQLLLRRKH RDGAGSSFFD PNILSQNSVS FIPESLRPKP GHLSLSQQRV YEDFVQHPWQ KQSTQTSHGL SAASSSSGDV ALGSGYGPVS GKVASEFLSN AGNARMDMVS RPSDISVDGF ESSPVSLLSS QVDPAGDSSS LQFTKSLPTS ELNLAESSDA ATKETGTSLQ TLTSAATMER LGASNITQPS LSTRDALDKC QIVTQKAVIS EVPEIILRCI SRDEAAFAVA QKAFKALYEN ASSNLHVSAN LAILVAIRDV CKRVVKELTS WVIYSEEDRK LNKDITIGLI QRELLSLAEY NVHMAKHLDG GRNKTATDFA ISLLQSLVTE ESSVISELHS LVDALAKLAS KSGSSESLQQ LIDIIRNPVT NTAGLSDSST GNDNNDRQKD EKVACNTTNT EESTSLDYVE SDPAGFQNRV STLFKNWYQI CELPGANETA CSQYVLHLHQ TGLLKGDDTT ESFFRILLEL SVAHCISSED INSGAVQSPQ QPQSPSFLII DMYAKLVFSI LKIMADTVRF IQKDAEDKKT SLNSKPYFRL FINWLLDLCS LDPGTDGANF QVLTAFANAF HALQPLKIPA FSCIQMRNII LSSFPRNMRL PDPSTPNLKI DLLPEIVEAP CILSEVDAAL KAKQMKNDVD EYLTSRQQNS TFLSELKTKL LLSSSEASSA GTRYSVPLIN SLVLYTGMQA IQQLQAGETQ AQNVVALQMF KYLSMELDTE GRYLFLNAIA NQLRYPNNHT HYFSFIMLYL FFESDQEHVS VRIPMVGSSR // ID NC003070_84 HYPOTHETICAL; PRT; 173 AA. AC NC003070_84; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[389568...389410, 388940...388851, DE 388517...388477, 388406...388268, 387672...387580]; Length: 522. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 53 FIRST EXON; p-value: NaN. FT GENSCAN 54 83 INTERNAL EXON; p-value: NaN. FT GENSCAN 84 96 INTERNAL EXON; p-value: NaN. FT GENSCAN 97 97 AA on splice site: tt/g -> L. FT GENSCAN 98 143 INTERNAL EXON; p-value: NaN. FT GENSCAN 144 173 LAST EXON; p-value: NaN. SQ SEQUENCE 173 AA; 19642 MW; 11366FE7C7164152 CRC64; MDIEQKQAEI IDQLVKRAST CKSEALGPLI IEATSHPSLF AFSEILALPN VAQVLPYDTL MVELDVSNVR ELEDFLINEC MYAIQDFLPS ISNYKNLLNT SENLLISIQD KIKWADNMSE MDKKHRKEAE EGVEEVKKSL SMKGDVDIRG NKEMFGEPSG VMDYEEDGIR PKR // ID NC003070_85 HYPOTHETICAL; PRT; 315 AA. AC NC003070_85; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[389876...389943, 390037...390250, DE 390352...390399, 390483...390564, 390762...390864, 391003...391123, DE 391216...391263, 391458...391541, 391652...391831]; Length: 948. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 22 FIRST EXON; p-value: NaN. FT GENSCAN 23 23 AA on splice site: tt/g -> L. FT GENSCAN 24 94 INTERNAL EXON; p-value: NaN. FT GENSCAN 95 110 INTERNAL EXON; p-value: NaN. FT GENSCAN 111 137 INTERNAL EXON; p-value: NaN. FT GENSCAN 138 138 AA on splice site: g/ac -> D. FT GENSCAN 139 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 172 AA on splice site: ag/t -> S. FT GENSCAN 173 212 INTERNAL EXON; p-value: NaN. FT GENSCAN 213 228 INTERNAL EXON; p-value: NaN. FT GENSCAN 229 256 INTERNAL EXON; p-value: NaN. FT GENSCAN 257 315 LAST EXON; p-value: NaN. SQ SEQUENCE 315 AA; 35623 MW; 1E8DEC86FFC9D3C3 CRC64; MAESRSNRAA VQATNDDASA SKLSCVKKGY MKDDYVHLFV KRPVRRSPII NRGYFSRWAA FRKLMSQFLL SGTSSKKQIL SLGAGFDTTY FQLLDEGNGP NLYVELDFKE VTSKKAAVIQ NSSQLRDKLG ANASISIDEG QVLSDHYKLL PVDLRDIPKL RDVISFADMD LSLPTFIIAE CVLIYLDPDS SRAIVNWSSK TFSTAVFFLY EQIHPDDAFG HQMIRNLESR GCALLSIDAS PTLLAKERLF LDNGWQRAVA WDMLKVYGSF VDTQEKRRSV LYVLLKLCKG VMLFIPSIWL NSVKKYLKNA VARDF // ID NC003070_86 HYPOTHETICAL; PRT; 703 AA. AC NC003070_86; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[392939...393875, 393966...394153, DE 394228...394451, 394672...395434]; Length: 2112. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 312 FIRST EXON; p-value: NaN. FT GENSCAN 313 313 AA on splice site: a/ag -> K. FT GENSCAN 314 375 INTERNAL EXON; p-value: NaN. FT GENSCAN 376 449 INTERNAL EXON; p-value: NaN. FT GENSCAN 450 450 AA on splice site: gg/a -> G. FT GENSCAN 451 703 LAST EXON; p-value: NaN. SQ SEQUENCE 703 AA; 79181 MW; 90747643158072AE CRC64; MGCTASKLDS EDAVRRCKER RRLMKDAVYA RHHLAAAHSD YCRSLRLTGS ALSSFAAGEP LSVSENTPAV FLRPSSSQDA PRVPSSHSPE PPPPPIRSKP KPTRPRRLPH ILSDSSPSSS PATSFYPTAH QNSTYSRSPS QASSVWNWEN FYPPSPPDSE YFERKARQNH KHRPPSDYDA ETERSDHDYC HSRRDAAEEV HCSEWGDDHD RFTATSSSDG DGEVETHVSR SGIEEEPVKQ PHQDPNGKEH SDHVTTSSDC YKTKLVVRHK NLKEILDAVQ DYFDKAASAG DQVSAMLEIG RAELDRSFSK LRKTVYHSSS VFSNLSASWT SKPPLAVKYK LDASTLNDEQ GGLKSLCSTL DRLLAWEKKL YEDVKAREGV KIEHEKKLSA LQSQEYKGGD ESKLDKTKTS ITRLQSLIIV SSEAVLTTSN AILRLRDTDL VPQLVELCHG LMYMWKSMHE YHEIQNNIVQ QVRGLINQTE RGESTSEVHR QVTRDLESAV SLWHSSFCRI IKFQREFICS LHAWFKLSLV PLSNGDPKKQ RPDSFALCEE WKQSLERVPD TVASEAIKSF VNVVHVISIK QAEEVKMKKR TESAGKELEK KASSLRSIER KYYQAYSTVG IGPGPEVLDS RDPLSEKKCE LAACQRQVED EVMRHVKAVE VTRAMTLNNL QTGLPNVFQA LTSFSSLFTE SLQTVCSRSY SIN // ID NC003070_87 HYPOTHETICAL; PRT; 556 AA. AC NC003070_87; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[395623...395946, 396029...396088, DE 396291...396356, 396439...396534, 396609...396725, 396793...396863, DE 396941...397098, 397703...397722, 397877...397964, 398292...398346, DE 398460...398526, 398614...398713, 399182...399393, 399484...399720]; DE Length: 1671. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 108 FIRST EXON; p-value: NaN. FT GENSCAN 109 128 INTERNAL EXON; p-value: NaN. FT GENSCAN 129 150 INTERNAL EXON; p-value: NaN. FT GENSCAN 151 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 221 INTERNAL EXON; p-value: NaN. FT GENSCAN 222 244 INTERNAL EXON; p-value: NaN. FT GENSCAN 245 245 AA on splice site: ag/g -> R. FT GENSCAN 246 297 INTERNAL EXON; p-value: NaN. FT GENSCAN 298 298 AA on splice site: t/tg -> L. FT GENSCAN 299 304 INTERNAL EXON; p-value: NaN. FT GENSCAN 305 333 INTERNAL EXON; p-value: NaN. FT GENSCAN 334 334 AA on splice site: g/gt -> G. FT GENSCAN 335 351 INTERNAL EXON; p-value: NaN. FT GENSCAN 352 352 AA on splice site: ag/c -> S. FT GENSCAN 353 374 INTERNAL EXON; p-value: NaN. FT GENSCAN 375 407 INTERNAL EXON; p-value: NaN. FT GENSCAN 408 408 AA on splice site: g/gg -> G. FT GENSCAN 409 478 INTERNAL EXON; p-value: NaN. FT GENSCAN 479 556 LAST EXON; p-value: NaN. SQ SEQUENCE 556 AA; 62443 MW; E6C2ECADA944FD66 CRC64; MGQESESPKE ADRSFLTQWP LTLTSYAAPN DRQTANNNTF PFLSLNMAML STASVSGSVD LPRGTMKVDS SASPEVVSDL PPSSPKGSPD RHDPSTSSPS PSRGGDNQSE VISKSEEYRQ LFRLPADEGH MYLFIHYICF YSNIFGYETK KIIPFAEISC VKRAKTAGIF PNAIEILAGG KKYFFASFLS RDEAFKLIHD GWLEYGSAVK SEGEILVTEP QVSDGVVKRA RSSMDLANEL DIPVRDETLH LSSSSSLPVI SQNGVPPSSV QRHAEPDVDV VAANTFNWKP EDTDAPKLSS DFTKIPVEEF FRLFFSDGAV SFVESFHKNC GDKGAKFGGC QESQKFRMYR NSHLVIETSQ EISDVPYADY FTVEGVWDLK RDCRDSVEGC ILDVYVNVAF SKRTVWKGNK LIEDGEPLAA REERVSECDE EGKVEMVGEG VVKKSLKEAW VNLTSFVKRQ SGTRQVIVLA FAVILLMQVT IVVLLKKGGG GQVEYHERYD EYSVNGETLG WLEKRMHFLR EEMMMVEDRL QRMRQDHAAL KAQFHHLERL LRRNKQ // ID NC003070_88 HYPOTHETICAL; PRT; 305 AA. AC NC003070_88; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[404401...404186, 403947...403771, DE 401508...401461, 401370...401323, 401235...401164, 401086...400931, DE 400828...400725, 400446...400350]; Length: 918. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 72 FIRST EXON; p-value: NaN. FT GENSCAN 73 131 INTERNAL EXON; p-value: NaN. FT GENSCAN 132 147 INTERNAL EXON; p-value: NaN. FT GENSCAN 148 163 INTERNAL EXON; p-value: NaN. FT GENSCAN 164 187 INTERNAL EXON; p-value: NaN. FT GENSCAN 188 239 INTERNAL EXON; p-value: NaN. FT GENSCAN 240 273 INTERNAL EXON; p-value: NaN. FT GENSCAN 274 274 AA on splice site: ag/a -> R. FT GENSCAN 275 305 LAST EXON; p-value: NaN. SQ SEQUENCE 305 AA; 34617 MW; B9DE11CABB4F7BE3 CRC64; MAAEEATEFY LRYYVGHKGK FGHEFLEFEF REDGKLRYAN NSNYKNDTII RKEVFLTPAV LKECKRIVSE SEILKEDDNN WPEPDRVGKQ ELEIVLGNEH ISFATSKIGS LVDCQSSNDP EGLRIFYYLV QDDSYVESYI STIGVDFKIR TVEQDGKTIK LQIWDTAGQE RFRTITSSYY RGAHGIIIVY DVTDEESFNN VKQWLSEIDR YASDNVNKLL VGNKSDLTEN RAIPYETAKA FADEIGIPFM ETSAKDATNV EQAFMAMSAS IKERMASQPA GNNARPPTVQ IRGQPVAQKN GCCST // ID NC003070_89 HYPOTHETICAL; PRT; 606 AA. AC NC003070_89; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[407493...407550, 407722...407821, DE 408073...408138, 408236...408337, 408859...409138, 409219...410433]; DE Length: 1821. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: NaN. FT GENSCAN 20 20 AA on splice site: g/gc -> G. FT GENSCAN 21 52 INTERNAL EXON; p-value: NaN. FT GENSCAN 53 53 AA on splice site: ag/a -> R. FT GENSCAN 54 74 INTERNAL EXON; p-value: NaN. FT GENSCAN 75 75 AA on splice site: aa/c -> N. FT GENSCAN 76 108 INTERNAL EXON; p-value: NaN. FT GENSCAN 109 109 AA on splice site: cg/a -> R. FT GENSCAN 110 202 INTERNAL EXON; p-value: NaN. FT GENSCAN 203 606 LAST EXON; p-value: NaN. SQ SEQUENCE 606 AA; 69751 MW; E3CE1584BE47DC95 CRC64; MASYYNYPSG YALKRLHQIG HPANVAGEEW VHIDTFGAMN GISRFCEDDF PWRYSKEEEI VVEELRNRNF TYLVNEHSSV DGYKCLFYEE GFERLELRRG FPPIVLVNRS PVVSVAALSK KTAAIVCSIS QVYGYGTVDY ERRPIVQWNA IYKKISLMEK PELGAASVLN QWEKAGRKLT KWELCRVVKE LRKYKRANQA LEVYDWMNNR GERFRLSASD AAIQLDLIGK VRGIPDAEEF FLQLPENFKD RRVYGSLLNA YVRAKSREKA EALLNTMRDK GYALHPLPFN VMMTLYMNLR EYDKVDAMVF EMKQKDIRLD IYSYNIWLSS CGSLGSVEKM ELVYQQMKSD VSIYPNWTTF STMATMYIKM GETEKAEDAL RKVEARITGR NRIPYHYLLS LYGSLGNKKE LYRVWHVYKS VVPSIPNLGY HALVSSLVRM GDIEGAEKVY EEWLPVKSSY DPRIPNLLMN AYVKNDQLET AEGLFDHMVE MGGKPSSSTW EILAVGHTRK RCISEALTCL RNAFSAEGSS NWRPKVLMLS GFFKLCEEES DVTSKEAVLE LLRQSGDLED KSYLALIDVD ENRTVNNSEI DAHETDALLT QLQDDL // ID NC003070_90 HYPOTHETICAL; PRT; 545 AA. AC NC003070_90; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[410803...410893, 411177...411253, DE 411906...412093, 412119...412258, 412460...412790, 412883...413003, DE 413044...413295, 413646...413835, 413855...413991, 414541...414651]; DE Length: 1638. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 30 FIRST EXON; p-value: NaN. FT GENSCAN 31 31 AA on splice site: t/gt -> C. FT GENSCAN 32 56 INTERNAL EXON; p-value: NaN. FT GENSCAN 57 118 INTERNAL EXON; p-value: NaN. FT GENSCAN 119 119 AA on splice site: cg/g -> R. FT GENSCAN 120 165 INTERNAL EXON; p-value: NaN. FT GENSCAN 166 166 AA on splice site: g/ag -> E. FT GENSCAN 167 275 INTERNAL EXON; p-value: NaN. FT GENSCAN 276 276 AA on splice site: ag/a -> R. FT GENSCAN 277 316 INTERNAL EXON; p-value: NaN. FT GENSCAN 317 400 INTERNAL EXON; p-value: NaN. FT GENSCAN 401 463 INTERNAL EXON; p-value: NaN. FT GENSCAN 464 464 AA on splice site: g/cc -> A. FT GENSCAN 465 509 INTERNAL EXON; p-value: NaN. FT GENSCAN 510 545 LAST EXON; p-value: NaN. SQ SEQUENCE 545 AA; 60120 MW; 754EC4CA5A923966 CRC64; MSTKGAAAAY PSAARISDSP CYLQYSASLK CLEEFGSDKS KCQDHFDVYK ECKKKEHLRS SDAGELLRLP DASPAPIRRP IYSLRSLPGC YSYRRPSHRP SSATFLRPFS ASPNPRASRK RAVICGISYR FSRHELKGCI NDAKCMRHLL INKFKFSPDS ILMLTEEETD PYRIPTKQNM RMALYWLVQG CTAGDSLVFH YSGHGSRQRN YNGDEVDGYD ETLCPLDFET QGMIVDDEIN ATIVRPLPHG VKLHSIIDAC HSGTVLDLPF LCRMNRAGQY VWEDHRPRSG LWKGTAGGEA ISISGCDDDQ TSADTSLVPP LMIKTHTQAL SKITSTGAMT FCFIQAIERS AQGTTYGSLL NSMRTTIRNT GNDGGGSGGV VTTVLSMLLT GGSAIGGLRQ TEGREEKSQK AVQKAHTNHV CVAAARKSTS AVAACSLFDT PAVIFQYKGG VFHTIAKEKT PKEAIPTPTS VCDLFLSTDN ALLSLKRMSE KATSGAAYLE LQPRIWPVML ELADFRKIEK NKKQKRRKFL KGDELDGDVD WSLNK // ID NC003070_91 HYPOTHETICAL; PRT; 932 AA. AC NC003070_91; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[415247...415670, 415742...415961, DE 416045...416435, 416743...416838, 416962...417162, 417248...417520, DE 419097...419247, 419875...419894, 420539...420658, 420984...421139, DE 421191...421391, 421526...421633, 421816...421908, 421981...422097, DE 424894...425028, 426368...426460]; Length: 2799. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 141 FIRST EXON; p-value: NaN. FT GENSCAN 142 142 AA on splice site: t/ct -> S. FT GENSCAN 143 214 INTERNAL EXON; p-value: NaN. FT GENSCAN 215 215 AA on splice site: tc/g -> S. FT GENSCAN 216 345 INTERNAL EXON; p-value: NaN. FT GENSCAN 346 377 INTERNAL EXON; p-value: NaN. FT GENSCAN 378 444 INTERNAL EXON; p-value: NaN. FT GENSCAN 445 535 INTERNAL EXON; p-value: NaN. FT GENSCAN 536 585 INTERNAL EXON; p-value: NaN. FT GENSCAN 586 586 AA on splice site: c/tt -> L. FT GENSCAN 587 592 INTERNAL EXON; p-value: NaN. FT GENSCAN 593 632 INTERNAL EXON; p-value: NaN. FT GENSCAN 633 684 INTERNAL EXON; p-value: NaN. FT GENSCAN 685 751 INTERNAL EXON; p-value: NaN. FT GENSCAN 752 787 INTERNAL EXON; p-value: NaN. FT GENSCAN 788 818 INTERNAL EXON; p-value: NaN. FT GENSCAN 819 857 INTERNAL EXON; p-value: NaN. FT GENSCAN 858 902 INTERNAL EXON; p-value: NaN. FT GENSCAN 903 932 LAST EXON; p-value: NaN. SQ SEQUENCE 932 AA; 106120 MW; E6DF4DF68C0180CD CRC64; MHSYVTAVDE EKDLSRLMIV VLMLWRIVHS QIWISVSRQR TAKGTNKIVD KPIEFEQVDR ERTWDDQVIF NTLLMYLANI KLPGASHLPP WRLDGAILMA LLHAGPVEFL YYWFHRALHH HFLYSRYHSH HHSSIVTEPI TSVVHPFAEH IAYTLLFAIP MVTASLCGIL SIVSIMGYIT YIDFMNNMGH CNFELFPKRL FHLFPPLKFL CYTPSFHSLH HTQFRTNYSL FMPIYDFIYG TTDNLTDSLY ERSLEIEEES PDVIHLTHLT THNSIYQMRL GFPSLSSCPL WSRPPWYLTC FMWPFTLLCS FALTSAIPLR TFVFERNRLR DLTVHSHLLP KFSFHRHHES INTIIEEAIL EADEKGVKVM SLGLMNNREE LNGSGEMYVQ KYPKLKIRLV DGSSMAATVV INNIPKEATE IVFRGNLTKV ASAVVFALCQ KGVKVVVLRE EEHSKLIKSG VDKNLVLSTS NSYYSPKVWL VGDGIENEEQ MKAKEGTLFV PFSHFPPNKL RKDCFYQSTP AMRVPKSAQN IDSCENSAQP GLDLSVPLLY VLGKETHRRQ GNRLQSGRQG DQLVRFSKFL TTCVPLPLLH PLFRTNYSLF MPLYDYIYGT MDESTDTLYE KTLERGDDIV DVYLLKWRKE AINNMIEKAI LEADKKGVKV LSLGLMNQVK KLSLTVLVLY WVDAGEELNR NGEVYIHNHP DMKVRLVDGS RLAAAVVINS VPKATTSVVM TGNLTKVAYT IASALCQRGV QVSTLRLDEY EKIRSCVPQE CRDHLVYLTS EALSSNKFPL KQLRRDCIYH TTPALIVPKS LVNVHSCENW LPRKAMSATR VAGILHALEG WEMHECGTSL LLSDLDQLMT SMKLKLSHSS SLPSPLNMIC SIAKLFFAAT ELVVVFTNLM RTSLEQGESS GLLLGENGHA DGRYTTEKSN VS // ID NC003070_92 HYPOTHETICAL; PRT; 3020 AA. AC NC003070_92; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[451977...450664, 450438...450151, DE 450046...449939, 449838...449714, 449489...449225, 449123...449055, DE 448973...448812, 445942...445844, 445752...445648, 445559...445454, DE 445066...444846, 444759...444683, 444597...444489, 444383...444258, DE 444089...443979, 443736...443629, 443525...443337, 442999...442906, DE 442620...442168, 442014...441398, 441325...441136, 441013...440761, DE 439592...439548, 439536...439406, 439314...439147, 438898...438783, DE 438679...438422, 438334...438240, 436788...436622, 436539...436277, DE 436128...435997, 435916...435650, 435563...435450, 433795...433068, DE 430571...430414, 430340...430069, 429997...429851, 429776...429462, DE 429370...429248, 429099...429018, 427840...427548]; Length: 9063. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 438 FIRST EXON; p-value: 0.000. FT GENSCAN 439 534 INTERNAL EXON; p-value: 0.000. FT GENSCAN 535 570 INTERNAL EXON; p-value: 0.000. FT GENSCAN 571 611 INTERNAL EXON; p-value: 0.000. FT GENSCAN 612 612 AA on splice site: gg/g -> G. FT GENSCAN 613 700 INTERNAL EXON; p-value: 0.000. FT GENSCAN 701 723 INTERNAL EXON; p-value: 0.000. FT GENSCAN 724 777 INTERNAL EXON; p-value: 0.000. FT GENSCAN 778 810 INTERNAL EXON; p-value: 0.000. FT GENSCAN 811 845 INTERNAL EXON; p-value: 0.000. FT GENSCAN 846 880 INTERNAL EXON; p-value: 0.000. FT GENSCAN 881 881 AA on splice site: g/gt -> G. FT GENSCAN 882 954 INTERNAL EXON; p-value: 0.000. FT GENSCAN 955 979 INTERNAL EXON; p-value: 0.000. FT GENSCAN 980 980 AA on splice site: gg/t -> G. FT GENSCAN 981 1016 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1017 1058 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1059 1095 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1096 1131 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1132 1194 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1195 1225 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1226 1226 AA on splice site: g/ga -> G. FT GENSCAN 1227 1376 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1377 1377 AA on splice site: g/cc -> A. FT GENSCAN 1378 1582 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1583 1645 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1646 1646 AA on splice site: g/tg -> V. FT GENSCAN 1647 1729 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1730 1730 AA on splice site: aa/t -> N. FT GENSCAN 1731 1744 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1745 1745 AA on splice site: cg/a -> R. FT GENSCAN 1746 1788 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1789 1789 AA on splice site: t/cc -> S. FT GENSCAN 1790 1844 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1845 1845 AA on splice site: g/tt -> V. FT GENSCAN 1846 1883 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1884 1969 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1970 2000 INTERNAL EXON; p-value: 0.000. FT GENSCAN 2001 2001 AA on splice site: aa/t -> N. FT GENSCAN 2002 2056 INTERNAL EXON; p-value: NaN. FT GENSCAN 2057 2057 AA on splice site: t/cc -> S. FT GENSCAN 2058 2144 INTERNAL EXON; p-value: NaN. FT GENSCAN 2145 2188 INTERNAL EXON; p-value: NaN. FT GENSCAN 2189 2277 INTERNAL EXON; p-value: NaN. FT GENSCAN 2278 2315 INTERNAL EXON; p-value: NaN. FT GENSCAN 2316 2557 INTERNAL EXON; p-value: NaN. FT GENSCAN 2558 2558 AA on splice site: ac/a -> T. FT GENSCAN 2559 2610 INTERNAL EXON; p-value: NaN. FT GENSCAN 2611 2611 AA on splice site: t/cg -> S. FT GENSCAN 2612 2701 INTERNAL EXON; p-value: NaN. FT GENSCAN 2702 2750 INTERNAL EXON; p-value: NaN. FT GENSCAN 2751 2855 INTERNAL EXON; p-value: NaN. FT GENSCAN 2856 2896 INTERNAL EXON; p-value: NaN. FT GENSCAN 2897 2923 INTERNAL EXON; p-value: NaN. FT GENSCAN 2924 2924 AA on splice site: g/cc -> A. FT GENSCAN 2925 3020 LAST EXON; p-value: NaN. SQ SEQUENCE 3020 AA; 341720 MW; 7A71CC825D70BABB CRC64; MNKNQTVGLD SEDDDSDDYD IAQVNCELAL VEGQLCNIPY ELYDLPDLTG ILSVETWNSL LTEEERFFLS CFLPDMDPQT FSLTMQELLD GANLYFGNPE DKFYKNLLGG LFTPKVACFK EGVMFVKRRK YYYSLKFYHE KLIRTFTEMQ RVWVQYGNKL GNYSRLLIWS GRTQTGNLKL LDLNRVPSKE MDSATCRFKT PNVVKPVERN RSKSLTFPRS GSSKNSLKIK ITKEGVFRYQ GSSLVSAGHH HQTLPKGVLK LVPKSSSAIL RKPYVAPGNN LLQIHETGSK STRFAASPYL GTTFEKPPYG TLGCSIPDPF LTYLETTQRS IQGTEPVFHD PTHTVPSALR VSNYSITEQN IPLKQEEYVR YHLKSPGFKP RTVDRGSETT ESKRISSSNN FQREAKALRK PLGVLSEDNH AREANSDHLF SLTYKRRKVT FFSFSLRNTS RELLPKTSNA PHTTVSSSKS PKTLSLSKLI IIIVITQRLF TKRIFLLKGL GVMGSLVREW VGFQQFPAAT QEKLIEFFGK LKQKDMNSMT VLVLGKGGVG KSSTVNSLIG EQVVRVSPFQ AEGLRPVMVS RTMGGFTINI IDTPGLVEAG YVNHQALELI KGFLVNRTID VLLYVDRLDV YRVDELDKQV VIAITQTFGK EIWCKTLLVL THAQFSPPDE LSYETFSSKR SDSLLKTIRA GSKMRKQEFE DSAIAVVYAE NSGRCSKNDK DEKALPNGEA WIPNLVKAIT DVATNQRKAI HVDKKMVDGS YSDDKGKKLI PLIIGAQRLN SNLASSMVES NISCTTFNIL APIYKRVDQK NHSTRESDFR TLWLARNQRI LDLLLHQRSS VICLQEVWVG NEELVNMYHH QLSSSGYTIY QLARTNSRGD GLLTAIHKDH FKVVNYRELL FNDFGDRVAQ LLHVKTVIPF PLNGKQDVQQ EVIIVNTHLL FPHDSSLSIV RLHQVYKILE YLEAYQKENK LNHMPIILCG DWNGSKRGHV YKFLRSQGFI SSYDDAHQYT DSDAHRWVSH RNHRGNICGV DFIWLCNPSD SRKPLRTSWV EAVFSIIKYQ LHKASIAEDD AFTFLGAKNH SDSLTYSDFC LALQKVNLTG IPHGLSFEET KELWVRADLD GNGVFDYEEL KKIWNMTMVN QPGNCKESVM ESKKEEGEDE AIGLKVNKAI LFPQEAEKGL WPENYNISDH ACLTDVAGQI VNFMKIEKGE EERSETANTT ERLLLGRFFY WESVNVLKFI RDLRVSNQIF DSSKSPTRVY KVHAFRSLIR ELSRNICKKM AMAPVIKLVL GSVAFAIFWI LAVFPSVPFL PIGRTAGSLF GAMLMVIFQV ITPEQAYAAI DLPILGLLFG TMVVSIYLER ADMFKYLGTL LSWKSRATSA NIGSSATPIG NPQNLVIAVQ SKIPFWEFLL GVFPAMIVGI TVNAMLLLGM YWRLLSDHKE DEEEVQNADS EVVAQEDVQS HRFSPATFSP VSSEDSNLRM DAAETLRNRA GSAGESELIS CNSNASREQH NDAESQGESN NTNNMFQTKR WRRVLWKSSV YFITLGMLIS LLMGLNMSWT AITAALALVV LDFKDARPSL EKVSYSLLIF FCGMFITVDG FNKTGIPTAL WDLMEPYAKI DQAKGIAVLA VVILVLSNVA SNVPTVLLLG ARVAASAMGR EEEKKAWLLL AWVSTVAGNL TLLGSAANLI VCEQARRAVS HGYTLTFTKH FKFGLPSTLI VTAIGLFLIN FDLNLFVAVE KWRIRFRPTD GEIVDIYLRP KNLESNTSHV DEVISTVDIC SFDPWDLPSH SRMKTRDQVW YFFGRKENKY GKGDRQIRKT KSGFWKKTGV TMDIMRKTGD REKIVKFKGE RREFSVATGS GIKHTHSLIP PTNNSGVLSV ETEGSLFHSQ ESQNPSQFSG FLDVDALDRD FCNILSDDFK GFFNDDDEQS KIVSMQDDRN NHTPQKPLTG VFSDHSTDGS DSDPISATTI SIQTLSTCPS FGSSNPLYQI TDLQESPNSI NRSRTMMNPV GFRFRPNDEE IVDHYLRPKN LDSDTSHVDE VISTVDICSF EPWDLPSKSM IKSRDGVWYF FSVKEMKYNR GDQQRRRTNS GFWKKTGKTM TVMRKRGNRE KIGEKRVLVF KNRDGSKTDW VMHEYHATSL FPNQMMTYTV CKVEFKGEET EISSSSTGSE IEQIHSLIPL VNSSGGSEGS SFHSQELQNS SQSGVFANVQ GESQIDDATT PIEEEWKTWL NNDGDEQRNI MFMQDHRSDY TPLKSLTGVF SDDSSDDNDS DLISPKTNSI GTSSTCASFA SSNHQIDQTQ HSPDSTVQLV SLTQEIWALS QIPTAPRIFP SNSIFTNLDH LFWRIPSGVD SASYPWIIWY IWKARNEKVF ENVDKDPMEI LLLSVKEAQS WQEAQVELHS EKHGSLSIDS RIRVRDVSQD TTFSGFRCFI DGSWKASDQF SGTGWFCLSS LGESPTMGAA NVRRSLSPLH TEMEALLWAM KCMIGADNQN VAFFTDCSDL VKMVSSPTEW PAFSVYLEEL QSDREEFTNF SLSLISRSAN VKADKLARKI RTVPHHVTEM ETPVGLRFCP TDEEIVVDYL WPKNSDRDTS HVDRFINTVP VCRLDPWELP SFSRFGMVGQ SRIKLKDVAW CFFRPKENKY GRGDQQMRKT KSGFWKSTGR PKPIMRNRQQ IGEKKILMFY TSKESKSDWV IHEYHGFSHN QMMMTYTLCK VMFNGGMREK SSSSPSSSGV SGIEQSRRDS LIPQLVNNSE GSSLHREDPS QFGDVLQEAP IEDAKLTEEL VKWLMNDEDD AQIEDAIPIE EWETWLNDID DAKEKSIMFM HDNRSDYRPP NSLTGVFSDD VSSDDNDSDL LTPKTNSIQT SSTCDSFGSS NHRIDQIKDL QESPTSTINL VSLTQEVSQA LITSIDTAEK KKNPYDDAQG TEIAKSKSEW ADTMVWFFFS CDEYNEEILK TNSGYWKETV SNTPIIGKWI TSNGVKIGEK QVLVFQSYEN INGSKSDWVM HVYQPTFLPP NQVIFRIYDV // ID NC003070_93 HYPOTHETICAL; PRT; 535 AA. AC NC003070_93; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[453018...453085, 453604...453647, DE 453698...453810, 453907...453975, 454415...454540, 454622...454840, DE 454927...455040, 455112...455147, 455382...455425, 455871...455902, DE 456282...456394, 456490...456558, 457010...457135, 457211...457429, DE 457514...457627, 457716...457817]; Length: 1608. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 22 FIRST EXON; p-value: 0.000. FT GENSCAN 23 23 AA on splice site: tg/c -> C. FT GENSCAN 24 37 INTERNAL EXON; p-value: 0.000. FT GENSCAN 38 38 AA on splice site: g/at -> D. FT GENSCAN 39 75 INTERNAL EXON; p-value: 0.000. FT GENSCAN 76 98 INTERNAL EXON; p-value: 0.000. FT GENSCAN 99 140 INTERNAL EXON; p-value: 0.000. FT GENSCAN 141 213 INTERNAL EXON; p-value: 0.000. FT GENSCAN 214 251 INTERNAL EXON; p-value: 0.000. FT GENSCAN 252 263 INTERNAL EXON; p-value: 0.000. FT GENSCAN 264 277 INTERNAL EXON; p-value: 0.000. FT GENSCAN 278 278 AA on splice site: tt/g -> L. FT GENSCAN 279 288 INTERNAL EXON; p-value: 0.000. FT GENSCAN 289 289 AA on splice site: g/at -> D. FT GENSCAN 290 326 INTERNAL EXON; p-value: 0.000. FT GENSCAN 327 349 INTERNAL EXON; p-value: 0.000. FT GENSCAN 350 391 INTERNAL EXON; p-value: 0.000. FT GENSCAN 392 464 INTERNAL EXON; p-value: 0.000. FT GENSCAN 465 502 INTERNAL EXON; p-value: 0.000. FT GENSCAN 503 535 LAST EXON; p-value: 0.000. SQ SEQUENCE 535 AA; 59648 MW; C507B21066FF1736 CRC64; MISSKYQEMM INDQVRHHYL IFCGKSFQAE TDLTDSSDCL NVDDQLLQNE IVKEVNENPN AGWKAAFNDR FANATVAEFK RLLGVIQTPK TAYLGVPINV SLSANDVIAC CGLLCGFGCN GGFPMGAWLY FKYHGVVTQE CDPYFDNTGC SHPGCEPTYP TPKCERKCVS RNQLWGESKH YGVGAYRINP DPQDIMAEVY KNGPVEVAFT VYEDFAHYKS GVYKYITGTK IGGHAVKLIG WGTSDDGEDY WLLANQWNRS WGDTMCPYFC NISFATVLAS NFILQLVADC LSVDDQLLQN EIVKEVNENP NAGWKASFND RFANATVAEF KRLLGVKPTP KTEFLGVPIN VSLSVNDLLA CCGFLCGQGC NGGYPIAAWR YFKHHGVVTE ECDPYFDNTG CSHPGCEPAY PTPKCARKCV SGNQLWRESK HYGVSAYKVR SHPDDIMAEV YKNGPVEVAF TVYEDFAHYK SGVYKHITGT NIGGHAVKLI GWGTSDDGED YWLLANQWNR SWGDVSSSSL QSNFYFRPSK ETTEP // ID NC003070_94 HYPOTHETICAL; PRT; 661 AA. AC NC003070_94; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[463521...463421, 463323...463123, DE 462999...462876, 462800...462701, 462431...462225, 460669...460323, DE 459682...459503, 459143...459021, 458940...458732, 458636...458243]; DE Length: 1986. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 33 FIRST EXON; p-value: 0.000. FT GENSCAN 34 34 AA on splice site: ag/a -> R. FT GENSCAN 35 100 INTERNAL EXON; p-value: 0.000. FT GENSCAN 101 101 AA on splice site: at/g -> M. FT GENSCAN 102 142 INTERNAL EXON; p-value: 0.000. FT GENSCAN 143 175 INTERNAL EXON; p-value: 0.000. FT GENSCAN 176 176 AA on splice site: t/tc -> F. FT GENSCAN 177 244 INTERNAL EXON; p-value: 0.000. FT GENSCAN 245 245 AA on splice site: c/tc -> L. FT GENSCAN 246 360 INTERNAL EXON; p-value: 0.000. FT GENSCAN 361 420 INTERNAL EXON; p-value: 0.000. FT GENSCAN 421 461 INTERNAL EXON; p-value: 0.000. FT GENSCAN 462 530 INTERNAL EXON; p-value: 0.000. FT GENSCAN 531 531 AA on splice site: tg/g -> W. FT GENSCAN 532 661 LAST EXON; p-value: 0.000. SQ SEQUENCE 661 AA; 74937 MW; 1A0DCEDCC5869519 CRC64; MPPKRNFRKR SFEEEEEDND VNKAAISEEE EKRRLALEEV KFLQKLRERK LGIPALSSTA QSSIGKVKPV EKTETEGEKE ELVLQDTFAQ ETAVLIEDPN MVKYIEQELA KKRGRNIDDA EEVENELKRV EDELYKIPDH LKVKKRSSEE SSTQWTTGIA EVQLPIEYEY ILNHRFHVRV GSCIDINFEN AEHPELYKDR GGPQADGEAA KPSTSSSTNN NADSGKSRQA ATDQIMLERF RKRELGKSIT MLNILPFFLF FLPFLIGNNR ICVAVKTGFV GRNGTQFVLN GEQVYLNGFN AYWMMTTAAD TASKGRATVT TALRQASAVG MNVARIWGFN EGDYIPLQIS PGSYSEDVFK GLDFVVYEAG RFNIKLIISL VNNFEDYGGR KKYVEWAGLD EPDEFYTNSA VKQFYKNHVK TVLTRKNTIT GRMYKDDPTI FSWELINEPR CNDSTASNIL QDWVKEMASY VKSIDSNHLL EIGLEGFYGE SIPERTVYNP GGRVLTGTDF ITNNQIPDID FATIHIYPDS WLPLQSSRTG EQDTFVDRWI GAHIEDCDNI IKKPLLITEF GKSSKYPGFS LEKRNKFFQR VYDVIYDSAR AGGSCTGGVF WQLTTNRTGL LGDGYEVFMQ AGPNTTAQLI ADQSSKLKNL KYPPLVTHSA E // ID NC003070_95 HYPOTHETICAL; PRT; 219 AA. AC NC003070_95; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[464876...464762, 464523...463979]; Length: DE 660. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 38 FIRST EXON; p-value: 0.000. FT GENSCAN 39 39 AA on splice site: g/ga -> G. FT GENSCAN 40 219 LAST EXON; p-value: 0.000. SQ SEQUENCE 219 AA; 23442 MW; 4D6D80460BDFDFE8 CRC64; MMNSRISIII ALSCIMITSI RAYDPDALQD LCVADKSHGT KLNGFPCKET LNITESDFFF AGISKPAVIN STMGSAVTGA NVEKIPGLNT LSVSLARIDY APGGLNPPHT HPRATEVVYV LEGELEVGFI TTANKLFTKT IKIGEVFVFP RGLVHFQKNN GKSPASVLSA FNSQLPGTAS VAATLFAAEP ALPEDVLTKT FQVGSKMVDK IKERLATKK // ID NC003070_96 HYPOTHETICAL; PRT; 246 AA. AC NC003070_96; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[467685...467684, 467552...467186, DE 467086...466954, 466797...466732, 466247...466075]; Length: 741. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 0 FIRST EXON; p-value: 0.000. FT GENSCAN 1 1 AA on splice site: at/g -> M. FT GENSCAN 2 123 INTERNAL EXON; p-value: 0.000. FT GENSCAN 124 167 INTERNAL EXON; p-value: 0.000. FT GENSCAN 168 168 AA on splice site: g/ac -> D. FT GENSCAN 169 189 INTERNAL EXON; p-value: 0.000. FT GENSCAN 190 190 AA on splice site: g/at -> D. FT GENSCAN 191 246 LAST EXON; p-value: 0.000. SQ SEQUENCE 246 AA; 28128 MW; FDA16F175E709C2C CRC64; MSNNQAFMEL GWRNDVGSLA VKDQGMMSER ARSDEDRLIN GLKWGYGYFD HDQTDNYLQI VPEIHKEVEN AKEDLLVVVP DEHSETDDHH HIKDFSERSD HRFYLRNKHE NPKKRRIQVL SSDDESEEFT REVPSVTRKG SKRRRRDEKM SNKMRKLQQL VPNCHKVDGQ GFGSRQDHRV YEKPSTSTSD DVNSGGESLF SSGDIRIWNA QPHADGNGFG SRPKSGESHD AIAANSGVKL AITTVY // ID NC003070_97 HYPOTHETICAL; PRT; 73 AA. AC NC003070_97; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[469818...469971, 471350...471417]; Length: DE 222. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 51 FIRST EXON; p-value: 0.000. FT GENSCAN 52 52 AA on splice site: g/at -> D. FT GENSCAN 53 73 LAST EXON; p-value: 0.000. SQ SEQUENCE 73 AA; 7855 MW; 2C498EBEC0226C28 CRC64; MEISNVSSPS GGVAAISLII LLRPRLLAYD STNLCCFRLI NRRSSKLEGG GDGEEEDGVR IMEEEEGNSE IVG // ID NC003070_98 HYPOTHETICAL; PRT; 249 AA. AC NC003070_98; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[473087...472797, 472596...472138]; Length: DE 750. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 97 FIRST EXON; p-value: 0.000. FT GENSCAN 98 249 LAST EXON; p-value: 0.000. SQ SEQUENCE 249 AA; 28549 MW; 6A7E0E3A45233E7F CRC64; MLLPLNLLFT IFRPNRSHLN REISAKRPLQ QNFHPQGQYR LSRKWFLHLR IICPSDEAVS EIRERRKPRD TTTRSGCIFG SDFTRDDRRM GNGTRRTFSI CIKLYHARFH IVIILFSNCH FTFWDIRNYN YGPAGRALGF DGLRNPETVS NNSVIAFQTA LWFWMTPQSP KPSCHDVMIG KYRPTAADLA ANRTGGFGLT TNIINGGLEC GIPGDGRVND RIGFFQRYTG LFKVATGPNL DCENQRPYA // ID NC003070_99 HYPOTHETICAL; PRT; 537 AA. AC NC003070_99; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[474516...474887, 475142...476383]; Length: DE 1614. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 124 FIRST EXON; p-value: 0.000. FT GENSCAN 125 537 LAST EXON; p-value: 0.000. SQ SEQUENCE 537 AA; 60354 MW; DE27DB5C3566D789 CRC64; MNFRNLIASG SRLGKRFCAT VFAPASATGI VEASVSSPAA ANVVEASVSS PAAENGVRTS VAAPTVASRQ RELYKKLSML SVTGGTVAET LNQFIMEGIT VRKDDLFRCA KTLRKFRRPQ HAFEIFDWME KRKMTFSVSD HAICLDLIGK TKGLEAAENY FNNLDPSAKN HQSTYGALMN CYCVELEEEK AKAHFEIMDE LNFVNNSLPF NNMMSMYMRL SQPEKVPVLV DAMKQRGISP CGVTYSIWMQ SCGSLNDLDG LEKIIDEMGK DSEAKTTWNT FSNLAAIYTK AGLYEKADSA LKSMEEKMNP NNRDSHHFLM SLYAGISKGP EVYRVWESLK KARPEVNNLS YLVMLQAMSK LGDLDGIKKI FTEWESKCWA YDMRLANIAI NTYLKGNMYE EAEKILDGAM KKSKGPFSKA RQLLMIHLLE NDKADLAMKH LEAAVSDSAE NKDEWGWSSE LVSLFFLHFE KAKDVDGAED FCKILSNWKP LDSETMTFLI KTYAAAEKTS PDMRERLSQQ QIEVSEEIQD LLKTVCP // ID NC003070_100 HYPOTHETICAL; PRT; 231 AA. AC NC003070_100; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[477814...478303, 478525...478669, DE 478790...478850]; Length: 696. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 163 FIRST EXON; p-value: 0.000. FT GENSCAN 164 164 AA on splice site: g/ga -> G. FT GENSCAN 165 211 INTERNAL EXON; p-value: 0.000. FT GENSCAN 212 212 AA on splice site: ag/a -> R. FT GENSCAN 213 231 LAST EXON; p-value: 0.000. SQ SEQUENCE 231 AA; 25748 MW; 0FD0653D739E63EA CRC64; MDQMLSGEQD LEVDIEAGRS DVTQESTSDT VSGNGVWSER ANFGVSEKIA DDLSYPLIRD ENRVETSSQS LDLSEKKCGN GKFKKSRKAS KPPRPPKGPS LSENDRKIMR DIQELAMRKR ARIERMKKSL KRLKAAKTSP SSPCITIFSM IITAIFFAFL VFQGFSTGSS SMNSDKSPAP TVSPNNQMIS VQFYNDFAPV EQTDPSPTTS LRYTRKRISG AEEEDSRDVT R // ID NC003070_101 HYPOTHETICAL; PRT; 530 AA. AC NC003070_101; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[480921...481619, 482270...483163]; Length: DE 1593. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 233 FIRST EXON; p-value: 0.000. FT GENSCAN 234 530 LAST EXON; p-value: 0.000. SQ SEQUENCE 530 AA; 59969 MW; 600A3D3243F11071 CRC64; MSGNKISTLQ ALVFFLYRFF ILRRWCHRSP KQKYQKCPSH GLHQYQDLSN HTLIFNVEGA LLKSNSLFPY FMVVAFEAGG VIRSLFLLVL YPFISLMSYE MGLKTMVMLS FFGVKKESFR VGKSVLPKYF LEDVGLEMFQ VLKRGGKRVA VSDLPQVMID VFLRDYLEIE VVVGRDMKMV GGYYLGIVED KKNLEIAFDK VVQEERLGSG RRLIGITSFN SPSHRSLFSQ FCQEIYFVRN SDKKSWQTLP QDQYPKPLIF HDGRLAVKPT PLNTLVLFMW APFAAVLAAA RLVFGLNLPY SLANPFLAFS GIHLTLTVNN HNDLISADRK RGCLFVCNHR TLLDPLYISY ALRKKNMKAV TYSLSRLSEL LAPIKTVRLT RDRVKDGQAM EKLLSQGDLV VCPEGTTCRE PYLLRFSPLF SEVCDVIVPV AIDSHVTFFY GTTASGLKAF DPIFFLLNPF PSYTVKLLDP VSGSSSSTCR GVPDNGKVNF EVANHVQHEI GNALGFECTN LTRRDKYLIL AGNNGVVKKK // ID NC003070_102 HYPOTHETICAL; PRT; 329 AA. AC NC003070_102; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[486964...487364, 488720...489062, DE 489146...489391]; Length: 990. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 133 FIRST EXON; p-value: 0.000. FT GENSCAN 134 134 AA on splice site: ag/c -> S. FT GENSCAN 135 248 INTERNAL EXON; p-value: 0.000. FT GENSCAN 249 329 LAST EXON; p-value: 0.000. SQ SEQUENCE 329 AA; 36839 MW; 7D106DF0449C5A13 CRC64; MVLPSSTPLQ TTGKKTISSP EYNFPVIDFS LNDRSKLSEK IVKACEVNGF FKVINHGVKP EIIKRFEHEG EEFFNKPESD KLRAGPASPF GYGCKNIGFN GDLGELEYLL LHANPTAVAD KSETISHDDP FKFSSATNDY IRTVRDLACE IIDLTIENLW GQKSSEVSEL IRDVRSDSIL RLNHYPPAPY ALSGVGQIGF GEHSDPQILT VLRSNDVDGL EICSRDGLWI PIPSDPTCFF VLVGDCLQAL TNGRFTSVRH RVLANTAKKP RMSAMYFAAP PLEAKISPLP KMVSPENPRR YNSFTWGDYK KATYSLRLDV PRLEFFKTL // ID NC003070_103 HYPOTHETICAL; PRT; 134 AA. AC NC003070_103; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[490497...490093]; Length: 405. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 134 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 134 AA; 14196 MW; 641AECAF7935E84E CRC64; MSPKTRRSTD LAATILMAIV ILASPIIINA EDSSAEVDVN CIPCLQNQPP PPPSPPPPSC TPSPPPPSPP PPKKSSCPPS PLPPPPPPPP PNYVFTYPPG DLYPIENYYG AAVAVESFSV MKLFVFGVMV FLIL // ID NC003070_104 HYPOTHETICAL; PRT; 828 AA. AC NC003070_104; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[491300...491379, 491488...491803, DE 491907...491959, 492086...492158, 492248...492352, 492443...492523, DE 492607...492720, 493095...493272, 493672...495158]; Length: 2487. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 26 FIRST EXON; p-value: 0.000. FT GENSCAN 27 27 AA on splice site: ag/g -> R. FT GENSCAN 28 132 INTERNAL EXON; p-value: 0.000. FT GENSCAN 133 149 INTERNAL EXON; p-value: 0.000. FT GENSCAN 150 150 AA on splice site: ag/g -> R. FT GENSCAN 151 174 INTERNAL EXON; p-value: 0.000. FT GENSCAN 175 209 INTERNAL EXON; p-value: 0.000. FT GENSCAN 210 236 INTERNAL EXON; p-value: 0.000. FT GENSCAN 237 274 INTERNAL EXON; p-value: 0.000. FT GENSCAN 275 333 INTERNAL EXON; p-value: 0.000. FT GENSCAN 334 334 AA on splice site: c/ac -> H. FT GENSCAN 335 828 LAST EXON; p-value: 0.000. SQ SEQUENCE 828 AA; 94871 MW; 1F22E231018FCE79 CRC64; MSWSKACRGT RISSYLENLH RTSQYPRTIL CSRYYTHGAC KSNEHYLRSK RVFWGSSSSW SLNSHSATAK SMLDSAHRQY STHSPSETKS QKMLYYLTAV VFGMVGLTYA AVPLYRTFCQ ATGYGGTVQR KETVEEKIAR HSESGTVTER EIVVQFNADV ADGMQWKFTP TQREVRVKPG ESALAFYTAE NKSSAPITGV STYNVTPMKA GVYFNKIQCF CFEEQRLLPG EQIDMPVFFY IDPEFETDPR MDGINNLILS YTFFKVSEEN TTETSRETTS SSLVLSPNRF GPNSLQAKER YENESSYGEQ VFPIEEEKNL EDEDNTSAPN SFAHDFQMMM ILKPLSSHHV SNFRLSVSFL HSVALSDAKV PVEEEGDDAE TVFRMINGSN LQVELKESLS SSGIHLSKDL IDRVLKRVRF SHGNPIQTLE FYRYASAIRG FYHSSFSLDT MLYILGRNRK FDQIWELLIE TKRKDRSLIS PRTMQVVLGR VAKLCSVRQT VESFWKFKRL VPDFFDTACF NALLRTLCQE KSMTDARNVY HSLKHQFQPD LQTFNILLSG WKSSEEAEAF FEEMKGKGLK PDVVTYNSLI DVYCKDREIE KAYKLIDKMR EEEETPDVIT YTTVIGGLGL IGQPDKAREV LKEMKEYGCY PDVAAYNAAI RNFCIARRLG DADKLVDEMV KKGLSPNATT YNLFFRVLSL ANDLGRSWEL YVRMLGNECL PNTQSCMFLI KMFKRHEKVD MAMRLWEDMV VKGFGSYSLV SDVLLDLLCD LAKVEEAEKC LLEMVEKGHR PSNVSFKRIK LLMELANKHD EVNNLIQKMA IFSTEIPR // ID NC003070_105 HYPOTHETICAL; PRT; 490 AA. AC NC003070_105; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[498435...498053, 497517...497329, DE 497241...497165, 497060...496972, 496899...496760, 496670...496590, DE 496115...495912, 495833...495757, 495651...495563, 495489...495346]; DE Length: 1473. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 127 FIRST EXON; p-value: 0.000. FT GENSCAN 128 128 AA on splice site: ta/c -> Y. FT GENSCAN 129 190 INTERNAL EXON; p-value: 0.000. FT GENSCAN 191 191 AA on splice site: tg/g -> W. FT GENSCAN 192 216 INTERNAL EXON; p-value: 0.000. FT GENSCAN 217 217 AA on splice site: g/aa -> E. FT GENSCAN 218 246 INTERNAL EXON; p-value: 0.000. FT GENSCAN 247 292 INTERNAL EXON; p-value: 0.000. FT GENSCAN 293 293 AA on splice site: tg/g -> W. FT GENSCAN 294 319 INTERNAL EXON; p-value: 0.000. FT GENSCAN 320 320 AA on splice site: at/a -> I. FT GENSCAN 321 387 INTERNAL EXON; p-value: 0.000. FT GENSCAN 388 388 AA on splice site: tg/g -> W. FT GENSCAN 389 413 INTERNAL EXON; p-value: 0.000. FT GENSCAN 414 414 AA on splice site: g/aa -> E. FT GENSCAN 415 443 INTERNAL EXON; p-value: 0.000. FT GENSCAN 444 490 LAST EXON; p-value: 0.000. SQ SEQUENCE 490 AA; 56245 MW; 7075D74AA6D80490 CRC64; MSKDENVESK ETIRVDKRVR EDEEEEEEKK IDTFFKLIKH YQEARKRRRE ELAENSGVVR RKSNGGERSG IVVPAFQPED FSQCRTGLKP PLMFVSDHKE ENTKVEQEED QTEERNEDKA LDLNLALYQK KSTSLQEPEK NGDDSGKAIC RFFSPRRSED RVVWSRRYRK IINNAQIQNR RNPYHNHAYC WIECGKCKIQ GFKLMLLGNG RPTMLHEIAG LVLVVDSTGR DQIEETKDFL NVVIDEIQGS VPDNAPVLVY GNKHEVPGAM SASEISNKLD LTSLRKKNWQ RNWHVQSSCA FSGDGLHEGL DWLLKNAERI FSKAKQTKST SLQEPEKNGD SSGKAICRFF PPRRIEDSVV RSRRCRKIIN NAQTQNRRNP YHNHAYYWNG CGKCKIQGFK LTLLGNGRPT MLQEIAGLVL VVDSTDRDRI EDAKDFLNAV IDEIQGSVPD NVAVLVFGNK HEVPGAMSAS EISNKLDLTS LRQKNWQRNW // ID NC003070_106 HYPOTHETICAL; PRT; 730 AA. AC NC003070_106; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[514984...514892, 514800...514663, DE 514605...514553, 513583...513332, 513273...513147, 512620...512525, DE 512430...512378, 512262...512214, 512121...512074, 506928...506644, DE 506534...506367, 506184...506137, 506052...505845, 505583...505499, DE 505407...505298, 505276...504897]; Length: 2193. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 31 FIRST EXON; p-value: 0.000. FT GENSCAN 32 77 INTERNAL EXON; p-value: 0.000. FT GENSCAN 78 94 INTERNAL EXON; p-value: 0.000. FT GENSCAN 95 95 AA on splice site: aa/a -> K. FT GENSCAN 96 178 INTERNAL EXON; p-value: 0.000. FT GENSCAN 179 179 AA on splice site: ac/a -> T. FT GENSCAN 180 221 INTERNAL EXON; p-value: 0.000. FT GENSCAN 222 253 INTERNAL EXON; p-value: 0.000. FT GENSCAN 254 270 INTERNAL EXON; p-value: 0.000. FT GENSCAN 271 271 AA on splice site: ag/a -> R. FT GENSCAN 272 287 INTERNAL EXON; p-value: 0.000. FT GENSCAN 288 303 INTERNAL EXON; p-value: 0.000. FT GENSCAN 304 398 INTERNAL EXON; p-value: 0.000. FT GENSCAN 399 454 INTERNAL EXON; p-value: 0.000. FT GENSCAN 455 470 INTERNAL EXON; p-value: 0.000. FT GENSCAN 471 539 INTERNAL EXON; p-value: 0.000. FT GENSCAN 540 540 AA on splice site: g/ga -> G. FT GENSCAN 541 567 INTERNAL EXON; p-value: 0.000. FT GENSCAN 568 568 AA on splice site: ag/t -> S. FT GENSCAN 569 604 INTERNAL EXON; p-value: 0.000. FT GENSCAN 605 605 AA on splice site: g/gc -> G. FT GENSCAN 606 730 LAST EXON; p-value: 0.000. SQ SEQUENCE 730 AA; 80776 MW; D2295E2B8C05DD11 CRC64; MEVDVPVSVA YNFYLDRESF PKWMPFISSV QVLKDKPDLS RWSLKYNAFG QDIKYSWLAR NLQARIHHSV LDQCSSVPTP NQKIHWRSLE GLPNKSKEMS SIASGTVSTT RSTLCFRKIP KSATVLSMLH RSSSSSPRML LLLSSSSANS AKLLNSNNGL LISSSPKPFR PVMQWQDVTE ILWFGLSRVK MVVDAPASVA YKLYADREMF PKWMPFLSSV EAMEGSPDLS RYLVKLESFG QNIEYHFLAK NLQPIPDRKI HWRSIEGFEN RGSVRFFPRG PSSCLVEISF SYEVPNAFAP VAFPPTPPPG PPDSPAPSLP PSPSDDPADD NNGIYNVRKY GAVGDGETDD TEAFKTAWDS SCNNENNTDS VLLVPYGYTF MIQSTIFTGP CRSYQFFQVD GTIVTPDGPE SWPSNISKRQ WLVFYRVNGM ALKGEGVIDG RGQKWWDLPC KPHRSVNKSA IVTGPCDSPI ALRFFMSSNL RVEGLQIKNS PQFHFRFDGC QGVHVESLHI TAPPLSPNTD GIHIENSNSV TIYNSIISNG DDCVSIGSGS YDVDIRNLTC GPGGHGISIG SLGNHNSRAC VSNITVRDSV IKYSDNGVRI KTWQGVTFNN IHVDSVRNPI IIDQYYCMTK DCANKTSAVF VSDIAYQGIK GTYDIRSPPM HFGCSDAVPC TNLTLSDIEL LPAKGEIVLD PFCWNAYGIA EELSIPPVWC LMSDPPKGLQ GSLVDKCGSS // ID NC003070_107 HYPOTHETICAL; PRT; 393 AA. AC NC003070_107; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[519037...520218]; Length: 1182. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 393 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 393 AA; 43159 MW; 27B0AF8AF55D2FF3 CRC64; METFLFTSES VNEGHPDKLC DQISDAVLDA CLEQDPDSKV ACETCTKTNM VMVFGEITTK ATVDYEKIVR DTCRAIGFVS DDVGLDADKC KVLVNIEQQS PDIAQGVHGH FTKCPEEIGA GDQGHMFGYA TDETPELMPL SHVLATKLGA RLTEVRKNGT CAWLRPDGKT QVTVEYYNDK GAMVPIRVHT VLISTQHDET VTNDEIARDL KEHVIKPVIP EKYLDEKTIF HLNPSGRFVI GGPHGDAGLT GRKIIIDTYG GWGAHGGGAF SGKDPTKVDR SGAYIVRQAA KSVVANGMAR RALVQVSYAI GVPEPLSVFV DTYETGLIPD KEILKIVKES FDFRPGMMTI NLDLKRGGNG RFLKTAAYGH FGRDDPDFTW EVVKPLKWDK PQA // ID NC003070_108 HYPOTHETICAL; PRT; 246 AA. AC NC003070_108; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[521626...520886]; Length: 741. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 246 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 246 AA; 27749 MW; FBBBB4AE27877E49 CRC64; MEEENLLNEN LLHPNESSPE ETQVTTVSKS KWTILVLAMI LLLVYLTFGV CTYSFFRDQF SGTETNLFVD AFYFSIVTFS TVGYGDIVPS TSTTKILTIV LVSTGVVFLD YLLNRVVSHV LSLQENAILD RINKTRNRAI RDHIAEDGKI RLKWKLCLAF CAVGLCVGSG ALFLHVFERL DWLDSVYLSV ISVTTVGYGD KTFKTVEGRG FAVFWLLLST IAMATLFLYL AEMRIDRTTV MKLPPR // ID NC003070_109 HYPOTHETICAL; PRT; 2534 AA. AC NC003070_109; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[524134...524436, 524521...524575, DE 524662...524837, 524917...525138, 525224...525462, 525546...527577, DE 527658...527921, 528010...528309, 528392...528695, 529948...530099, DE 530182...530236, 530403...530578, 530681...530902, 530980...531218, DE 531325...533005, 533079...533345, 533453...533716, 533808...534107, DE 534189...534542]; Length: 7605. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 101 FIRST EXON; p-value: 0.000. FT GENSCAN 102 119 INTERNAL EXON; p-value: 0.000. FT GENSCAN 120 120 AA on splice site: c/ag -> Q. FT GENSCAN 121 178 INTERNAL EXON; p-value: 0.000. FT GENSCAN 179 252 INTERNAL EXON; p-value: 0.000. FT GENSCAN 253 331 INTERNAL EXON; p-value: 0.000. FT GENSCAN 332 332 AA on splice site: at/g -> M. FT GENSCAN 333 1009 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1010 1097 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1098 1197 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1198 1298 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1299 1299 AA on splice site: g/at -> D. FT GENSCAN 1300 1349 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1350 1367 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1368 1368 AA on splice site: c/ag -> Q. FT GENSCAN 1369 1426 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1427 1500 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1501 1579 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1580 1580 AA on splice site: at/g -> M. FT GENSCAN 1581 2140 INTERNAL EXON; p-value: 0.000. FT GENSCAN 2141 2229 INTERNAL EXON; p-value: 0.000. FT GENSCAN 2230 2317 INTERNAL EXON; p-value: 0.000. FT GENSCAN 2318 2417 INTERNAL EXON; p-value: 0.000. FT GENSCAN 2418 2534 LAST EXON; p-value: 0.000. SQ SEQUENCE 2534 AA; 272971 MW; 48E8F56961F7BC40 CRC64; MNGDGAREGD SVSHEPSTSK SPKEGEETKK EEKSEEKANT VPFYKLFAFA DSSDVLLMIC GSIGAIGNGM SLPFMTLLFG DLIDSFGKNQ NNKDIVDVVS KVCLKFVYLG LGTLGAAFLQ VACWMITGER QAARIRSTYL KTILRQDIGF FDVETNTGEV VGRMSGDTVL IQDAMGEKVG KFIQLVSTFV GGFVLAFIKG WLLTLVMLTS IPLLAMAGAA MALIVTRASS RGQAAYAKAA TVVEQTIGSI RTVASFTGEK QAINSYKKFI TSAYKSSIQQ GFSTGLGLGV MFFVFFSSYA LAIWFGGKMI LEKGYTGGAV INVIIIVVAG SMSLGQTSPC VTAFAAGQAA AYKMFETIKR KPLIDAYDVN GKVLEDIRGD IELKDVHFSY PARPDEEIFD GFSLFIPSGA TAALVGESGS GKSTVISLIE RFYDPKSGAV LIDGVNLKEF QLKWIRSKIG LVSQEPVLFS SSIMENIAYG KENATVEEIK AATELANAAK FIDKLPQGLD TMVGEHGTQL SGGQKQRIAI ARAILKDPRI LLLDEATSAL DAESERVVQE ALDRVMVNRT TVIVAHRLST VRNADMIAVI HRGKMVEKGS HSELLKDSEG AYSQLIRLQE INKDVKTSEL SSGSSFRNSN LKKSMEGTSS VGNSSRHHSL NVLGLTTGLD LGSHSQRAGQ DETGTASQEP LPKVSLTRIA ALNKPEIPVL LLGTVAAAIN GAIFPLFGIL ISRVIEAFFK PAHELKRDSR FWAIIFVALG VTSLIVSPTQ MYLFAVAGGK LIRRIRSMCF EKAVHMEVAW FDEPQNSSGT MGARLSADAT LIRALVGDAL SLAVQNVASA ASGLIIAFTA SWELALIILV MLPLIGINGF VQVKFMKGFS ADAKVRTSTG LVPFFLGSAK ILCFFSCESR NLLWFWITMQ SKYEEASQVA NDAVGSIRTV ASFCAEEKVM QMYKKQCEGP IKDGIKQGFI SGLGFGFSFF ILFCVYATSF YAGARLVEDG KTTFNNVFQV FFALTMAAIG ISQSSTFAPD SSKAKVAAAS IFAIIDRKSK IDSSDETGTV LENVKGDIEL RHLSFTYPAR PDIQIFRDLC LTIRAGKTVA LVGESGSGKS TVISLLQRFY DPDSGHITLD GVELKKLQLK WLRQQMGLVG QEPVLFNDTI RANIAYGKGS EEAATESEII AAAELANAHK FISSIQQGYD TVVGERGIQL SGGQKQRVAI ARAIVKEPKI LLLDEATSAL DAESERVVQD ALDRVMVNRT TIVVAHRLST IKNADVIAVV KNGVIAEKGT HETLIKIEDS FDVFLMICGS LGAIGNGVCL PLMTLLFGDL IDSFGKNQNN KDIVDVVSKV CLKFVYLGLG RLGAAFLQVA CWMITGERQA AKIRSNYLKT ILRQDIGFFD VETNTGEVVG RMSGDTVHIQ DAMGEKVGKF IQLVSTFVGG FALAFAKGWL LTLVMLTSIP FLAMAGAAMA LLVTRASSRG QAAYAKAATV VEQTIGSIRT VASFTGEKQA INSYKKYITS AYKSSIQQGF STGLGLGVMI YVFFSSYALA IWFGGKMILE KGYTGGSVIN VIIIVVAGSM SLGQTSPCVT AFAAGQAAAY KMFETIKRKP LIDAYDVNGK VLGDIRGDIE LKDVHFSYPA RPDEEIFDGF SLFIPSGATA ALVGESGSGK STVINLIERF YDPKAGEVLI DGINLKEFQL KWIRSKIGLV CQEPVLFSSS IMENIAYGKE NATLQEIKVA TELANAAKFI NNLPQGLDTK VGEHGTQLSG GQKQRIAIAR AILKDPRVLL LDEATSALDT ESERVVQEAL DRVMVNRTTV VVAHRLSTVR NADMIAVIHS GKMVEKGSHS ELLKDSVGAY SQLIRCQEIN KGHDAKPSDM ASGSSFRNSN LNISREGSVI SGGTSSFGNS SRHHSLNVLG LFAGLDLGSG SQRVGQEETG TTSQEPLRKV SLTRIAALNK PEIPVLLLGT VVAAINGAIF PLFGILISRV IEAFFKPADQ LKKDSRFWAI IFVALGVTSL IVSPSQMYLF AVAGGKLIRR IQSMCFEKAV HMEVSWFDEP ENSSGTMGAR LSTDAALIRA LVGDALSLAV QNAASAASGL IIAFTASWEL ALIILVMLPL IGINGFLQVK FMKGFSADAK SKYEEASQVA NDAVGSIRTV ASFCAEEKVM QMYNKQCEGP IKDGVKQGFI SGLGFGFSFF ILFCVYATSF YAAARLVEDG KTTFIDVFQV FFALTMAAIG ISQSSTFAPD SSKAKVAAAS IFAIIDRKSK IDSSDETGTV LENVKGDIEL RHLSFTYPAR PGIQIFRDLC LTIRAGKTVA LVGESGSGKS TVISLLQRFY DPDSGQITLD GVELKKLQLK WLRQQMGLVG QEPVLFNDTI RANIAYGKGS EEAATESEII AAAELANAHK FISSIQQGYD TVVGEKGIQL SGGQKQRVAI ARAIVKEPKI LLLDEATSAL DAESERLVQD ALDRVIVNRT TVVVAHRLST IKNADVIAIV KNGVIAENGT HETLIKIDGG VYASLVQLHM TASN // ID NC003070_110 HYPOTHETICAL; PRT; 290 AA. AC NC003070_110; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[535699...534827]; Length: 873. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 290 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 290 AA; 33804 MW; 28B4419A5866A740 CRC64; MTEYWLASEN DAEFQEVALC NIRMKKNVLD QFKFPTENNN NLVEQPPLHN NMVSGSTMDQ NMEEYADELE NMLDEEQEED DDRAIQQQHP EFPLQSHDSR STLDKHMEEY ADDLEKMLDE EEEGDDDSAI QQQHLEIHLQ LQDHGSRSTT DQNMEEYANE LENILDEEEE EDDDDDRGIQ QQDSQIPLPV QSQDSGNPLV VIMEDERVDQ DMIFDLVKQE EEERKLKTFT EICFAGLKTQ RNLHFQDSYF AGGQEMSPWD NMVTNNNPFG LVFNTHGHEM QEPPVINGVN // ID NC003070_111 HYPOTHETICAL; PRT; 1545 AA. AC NC003070_111; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[536483...537207, 538041...538130, DE 538419...538533, 538627...538719, 538766...538853, 538936...539008, DE 539255...539360, 539611...539755, 541525...541792, 541856...542016, DE 543130...543979, 544762...544804, 544973...545062, 545707...545967, DE 546102...546235, 546313...546400, 546512...546637, 546727...546774, DE 547024...547172, 547248...547374, 547463...547671, 547748...547879, DE 547970...548051, 548134...548181, 548311...548439, 548525...548602, DE 548815...548994]; Length: 4638. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 241 FIRST EXON; p-value: 0.000. FT GENSCAN 242 242 AA on splice site: at/a -> I. FT GENSCAN 243 271 INTERNAL EXON; p-value: 0.000. FT GENSCAN 272 272 AA on splice site: ag/a -> R. FT GENSCAN 273 310 INTERNAL EXON; p-value: 0.000. FT GENSCAN 311 341 INTERNAL EXON; p-value: 0.000. FT GENSCAN 342 370 INTERNAL EXON; p-value: 0.000. FT GENSCAN 371 371 AA on splice site: g/gc -> G. FT GENSCAN 372 394 INTERNAL EXON; p-value: 0.000. FT GENSCAN 395 395 AA on splice site: ag/t -> S. FT GENSCAN 396 430 INTERNAL EXON; p-value: 0.000. FT GENSCAN 431 478 INTERNAL EXON; p-value: 0.000. FT GENSCAN 479 479 AA on splice site: g/ct -> A. FT GENSCAN 480 567 INTERNAL EXON; p-value: 0.000. FT GENSCAN 568 568 AA on splice site: ag/a -> R. FT GENSCAN 569 621 INTERNAL EXON; p-value: 0.000. FT GENSCAN 622 622 AA on splice site: c/tg -> L. FT GENSCAN 623 904 INTERNAL EXON; p-value: 0.000. FT GENSCAN 905 905 AA on splice site: gt/a -> V. FT GENSCAN 906 919 INTERNAL EXON; p-value: 0.000. FT GENSCAN 920 949 INTERNAL EXON; p-value: 0.000. FT GENSCAN 950 1036 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1037 1080 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1081 1081 AA on splice site: tg/g -> W. FT GENSCAN 1082 1110 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1111 1152 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1153 1168 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1169 1217 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1218 1218 AA on splice site: ag/t -> S. FT GENSCAN 1219 1260 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1261 1329 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1330 1330 AA on splice site: gg/g -> G. FT GENSCAN 1331 1373 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1374 1374 AA on splice site: ag/c -> S. FT GENSCAN 1375 1401 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1402 1417 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1418 1460 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1461 1486 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1487 1545 LAST EXON; p-value: 0.000. SQ SEQUENCE 1545 AA; 174834 MW; ACD15702B62BCE2C CRC64; MKKSSPLLCF SLALFSLLSS PSSSTRIISS IVPSAAPSPA VAPTTDGDVD ENDFSAFTQW NILNLTDLKS TFKNLPDFSK LNISSLHVSP AVGSVCSNTD YAAECIVSIL PLLRDFRKFE PKPIDVLRME MSALYEKANA TLDLAKRLIV DKSTPRDVAD VLDLCVDNYE SLLDDLKDAS VAVDDGDFER LESVVSAAIA DVVTCSDAFA ESSELESPMA NVDDFLKKLC SNVLAISQMI HIFTAGFVSA SPNGSSFDSP KLSLPFEPLR SRDDLQVPSS PYFPAYAQGQ GPPPMVQERF QSIISQLFQY RIIRCGGAVD DDMANIIVAQ LLYLDAVDPT KMFCFRILIF AYVEQDIVMY VNSPGGSVTA GMAIFDTMRH IRPDVSTVCV GLAASMGAFL LSAGTKGSLL ILSIISKNLL VKSYEFHFSD ANEMLHHKAN LNGYLAYHTG QSLEKINQDT DRDFFMSAKE AKEYGLIDAP VTVSGVFSLL HQADVGILYT ILSLIIVSTL IHILSGKPEC SVLHSHLYIC WIVLFIVQAC VAFGIEGTMS TTISIDTDKS FSLAAQERVV VKPVIDDTVF GVYVEEERWS ERAVVAVTFG LMWWWRLRDE VESLVVVAEV KLFQSLRIKM EHEETQKNTR NSWSLIRPFQ MISISFLSLL LPLSFLFLSR LSLYTSSTPV TVSGVSSVIH QADVGVLYTI LFLIIVFTLI HSLSGKPECS VLHSHLYICW IVLFIAQACA FGIKRTMSTT MSINPDKNLF LATHERWMLV RVLFFLGLHE VMLMWFRVVV KPVVDNTIYG VYVEERWSER AVVAVTFGIM WWWRLRDEVE SLVVVVTADR LNLPIRLEGL NFVNWCMYYI CVGIGLMKIF KGFLDFVNTL TLSIKRSRKG CESCVFDDMC NDDHVEDEKK RRGEWLMEKE NHEDDGEGLP PELNQIKEQI EKERFLHIKP AAEDDNGGDN KSLLSRMQNP LRHFSASSDY NSYEDQGYVL DEDQDYALEE DVPLFLDEDV PLLPSVKLPI VEKLPRSITW VFTKRHRQIY YLNGEALELS SEEDEEDEEE DEEEIKKEKC EFSEDVDRFI WTVGQDYGLD DLVVRRALAK YLEVDVSDIL ERYNELKLKN DGTAGEASDL TSKTITTAFQ DFADRRHCRR CMIFDCHMHE KYEPESRSVR SVTEADHVMD NDNSISNKIV VSDPNNTMWT PVEKDLYLKG IEIFGRNSCD VALNILRGLK TCLEIYNYMR EQDQCTMSLD LNKTTQRHNQ VTKKVSRKSS RSVRKKSRLR KYARYPPALK KTTSGEAKFY KHYTPCTCKS KCGQQCPCLT HENCCEKYCG CSKDCNNRFG GCNCAIGQCT NRQCPCFAAN RECDPDLCRS CPLSCGDGTL GETPVQIQCK NMQFLLQTNK KILIGKSDVH GWGAFTWDSL KKNEYLGEYT GELITHDEAN ERGRIEDRIG SSYLFTLNDQ LEIDARRKGN EFKFLNHSAR PNCYAKLMIV RGDQRIGLFA ERAIEEGEEL FFDYCYGPEH ADWSRGREPR KTGASKRSKE ARPAR // ID NC003070_112 HYPOTHETICAL; PRT; 639 AA. AC NC003070_112; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[562382...561902, 561549...561453, DE 558763...558708, 558505...558335, 558312...558209, 557991...557853, DE 557543...557492, 555792...555726, 554863...554796, 554613...554472, DE 553677...553600, 553490...553395, 552373...552219, 551608...551395]; DE Length: 1920. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 160 FIRST EXON; p-value: 0.000. FT GENSCAN 161 161 AA on splice site: g/ct -> A. FT GENSCAN 162 192 INTERNAL EXON; p-value: 0.000. FT GENSCAN 193 193 AA on splice site: gg/g -> G. FT GENSCAN 194 211 INTERNAL EXON; p-value: 0.000. FT GENSCAN 212 212 AA on splice site: g/at -> D. FT GENSCAN 213 268 INTERNAL EXON; p-value: 0.000. FT GENSCAN 269 269 AA on splice site: g/ag -> E. FT GENSCAN 270 303 INTERNAL EXON; p-value: 0.000. FT GENSCAN 304 349 INTERNAL EXON; p-value: 0.000. FT GENSCAN 350 350 AA on splice site: g/tt -> V. FT GENSCAN 351 366 INTERNAL EXON; p-value: 0.000. FT GENSCAN 367 367 AA on splice site: gc/a -> A. FT GENSCAN 368 389 INTERNAL EXON; p-value: 0.000. FT GENSCAN 390 411 INTERNAL EXON; p-value: 0.000. FT GENSCAN 412 412 AA on splice site: ag/a -> R. FT GENSCAN 413 459 INTERNAL EXON; p-value: 0.000. FT GENSCAN 460 485 INTERNAL EXON; p-value: 0.000. FT GENSCAN 486 517 INTERNAL EXON; p-value: 0.000. FT GENSCAN 518 568 INTERNAL EXON; p-value: 0.000. FT GENSCAN 569 569 AA on splice site: ag/t -> S. FT GENSCAN 570 639 LAST EXON; p-value: 0.000. SQ SEQUENCE 639 AA; 72441 MW; 49902935ABB9F6BE CRC64; MVDEKVIVDE VETRDAYRVA YVIHFLLGAG SLIPWNALIT AVDYFGYLYP DKHVEKTFTV AYMSCSVLVL VLMMTWNTRM SYRVRMNLGF SMFIIAMMIS PLIDWVWKGE KGENVSYMLM VGSVVLCGLA DGVVGGSLIG SAGKLPRQYM QAIFAGTASS ATSNATTPQV SSTFAFDSDN MDGWEENQVA SFGTLETLGV LVDCEIQPYL SDGCDPRVVR PSIKRLLENE GYCGPLTVTA VGKLANVPTD TLRALYSSGI HLIISPFGED SALWHCSVCC LDPPAHGFEN FITHLSTGDY KVMLHVTAYL MESDMMSRDQ PWTDERPPLP TDLSTRLAKL KDLEAVKYEV VKLERPRWNR SSSSDAAICH EEEAESYFEA PCSCSGTIKE YKPGYTTTSK PSRFIETAVT IRDNLHIMRR ENGRRRRNRR LVNREESDFQ ECNSGVDRGA SCCRYLALIF SVILLIKHAF DAVYGTEEYP YTIFTVLTLK AIGILLPMLV IIRTITAIQR SLRYQILVEV DLVTGKTEFI RSDIIYDCGK SLNPAIDLGQ VISLVLVRIL DVHLMFYHSL CFFLIKFYSS LTKRTHRSRE RQKLDDHPRQ CHRIRLITRL KISPPFLFSV SVFLRFTIFT KSWTDSRRW // ID NC003070_113 HYPOTHETICAL; PRT; 736 AA. AC NC003070_113; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[564308...564697, 565433...565630, DE 565784...565953, 566043...566140, 566226...567580]; Length: 2211. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 130 FIRST EXON; p-value: 0.000. FT GENSCAN 131 196 INTERNAL EXON; p-value: 0.000. FT GENSCAN 197 252 INTERNAL EXON; p-value: 0.000. FT GENSCAN 253 253 AA on splice site: gg/g -> G. FT GENSCAN 254 285 INTERNAL EXON; p-value: 0.000. FT GENSCAN 286 286 AA on splice site: g/gc -> G. FT GENSCAN 287 736 LAST EXON; p-value: 0.000. SQ SEQUENCE 736 AA; 79238 MW; D5AF4CCFAFF640DC CRC64; MAFLAVILFF LISSSSVCVH SRETFACDTK DAATATLRFC QLSVPIPERV RDLIGRLTLA EKVSLLGNTA AAIPRLGIKG YEWWSEALHG VSNVGPGTKF GGVYPAATSF PQVITTVASF NASLWESIGR VVSNEARAMY NGGVGGLTYW SPNVNILRDP RWGRGQETPG EDPVVAGKYA ASYVRGLQGN DRSRLKVSKQ DIEDTFDVPF RMCVKEGNVA SIMCSYNQVN GVPTCADPNL LKKTIRNQWG LNGYIVSDCD SVGVLYDTQH YTGTPEEAAA DSIKAGLDLD CGPFLGAHTI DAVKKNLLRE SDVDNALINT LTVQMRLGMF DGDIAAQPYG HLGPAHVCTP VHKGLALEAA QQGIVLLKNH GSSLPLSSQR HRTVAVIGPN SDATVTMIGN YAGVACGYTS PVQGITGYAR TIHQKGCVDV HCMDDRLFDA AVEAARGADA TVLVMGLDQS IEAEFKDRNS LLLPGKQQEL VSRVAKAAKG PVILVLMSGG PIDISFAEKD RKIPAIVWAG YPGQEGGTAI ADILFGSANP GGKLPMTWYP QDYLTNLPMT EMSMRPVHSK RIPGRTYRFY DGPVVYPFGH GLSYTRFTHN IADAPKVIPI AVRGRNGTVS GKSIRVTHAR CDRLSLGVHV EVTNVGSRDG THTMLVFSAP PGGEWAPKKQ LVAFERVHVA VGEKKRVQVN IHVCKYLSVV DRAGNRRIPI GDHGIHIGDE SHTVSLQAST LGVIKS // ID NC003070_114 HYPOTHETICAL; PRT; 513 AA. AC NC003070_114; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[568704...568739, 568832...569223, DE 569305...570418]; Length: 1542. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 12 FIRST EXON; p-value: 0.000. FT GENSCAN 13 142 INTERNAL EXON; p-value: 0.000. FT GENSCAN 143 143 AA on splice site: ag/a -> R. FT GENSCAN 144 513 LAST EXON; p-value: 0.000. SQ SEQUENCE 513 AA; 58247 MW; 85C47048E3002F18 CRC64; MAIHPWWKRN RKKVDKYMKN AKDLITSQDP NDIVSALSLL NSTLSISPHH ELALELKARS LLYLRRFKDV AVLLHNYIPS LRIDNEDVSS VFAASSELSS LMLLLPSGSP SHDSSFKCFS YSYLKKKVMA GLSNNSQVQG QWRYLVLGQA CYHLGLMDDA IILLQTGKRL ATAELRRESI CWSEDSFNLS TSESQPQPIT ESEIVSQMLS QTKLFLRRRT AALAALDAGL YSESIRHFSK IIDSRRGAPQ SFLVYCLIRR AFAYKSAGRI ADSIADCNLI LALEPSCIEA LETRAELFRS IRCFPDSLHD LEHLKLLFNS ILRDRSLTGP VWKRHNVRYR EIPGKLCVLT TNIKQMKEKI TNRENGNEDY YSLMGIERGC SRSELNRAYL LLNLRYKSER SMTSIDRFDI IDEQELVSVK NRARMSTLLL YRLIQKGYYA VLSDIETVEA DKAVAIDNRR IETPMDGNKA VAMTVVRKSN DKLDVVVKGV FCRDMAAVGS LISRAGLRQP ITV // ID NC003070_115 HYPOTHETICAL; PRT; 693 AA. AC NC003070_115; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[574746...574120, 573953...573489, DE 573252...572836, 572759...572187]; Length: 2082. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 209 FIRST EXON; p-value: 0.000. FT GENSCAN 210 364 INTERNAL EXON; p-value: 0.000. FT GENSCAN 365 503 INTERNAL EXON; p-value: 0.000. FT GENSCAN 504 693 LAST EXON; p-value: 0.000. SQ SEQUENCE 693 AA; 76098 MW; B5C213483C3F5425 CRC64; MDSLCLNSGL HGVIPAITAV GNGGCGGVVE VRATASAPSQ KRGPFGFSFK YPLTPFWSRG GGGGIASRRR SGLCLDDAVL VDSGDSRKPI AEETAVEMDT ERRNGSWVLK ILDVQSTWKH EEEEDDDEVE DEDGDEDEEV ELDDAVVSED DGGCDVCSVL EDDGNEANKF QLDRESFSKL LRRVTLPESK LYAQLSYLGN LAYSISKIKP ANLSKYYGLR FVTSSAEKTE SALKAENGEV SGETKPIVEA EEEVEEEEKN KSRKISASAA YEIVASAASY LHSRTNNILP FNSSSKAENS DKHDVNLTNA ESSSDVAYSV TSVVAAEEDV KQAVADDLKS TISSPCDWFI CDDDQSHTRF VVIQGLGAIV HRGIYEAAKG MYEQMLPEVK AHIKTHGTSA KFRFTGHSLG GSLSLLLNLM LLVRGEVPAS SLLPVITYGA PFVLCGGDRL LKKLGLPKSH VQAIVMHRDI VPRAFSCNYP YHVAELLKAV NGNFRSHPCL NKQSMLYSPM GELLILQPDE TFSPGHELLP SGNGLYLLTS DFESPDIEDS DEERLRAAQT VFLNTPHPLD ILSDRSAYGS SGTIQRDHDM NSYLKAVRSV IRKEVNQIRR AKREHRRSLW WPILVARESG SSGIAVSNGQ INGQDFSGMM QTGRKSLQRF SRLVASQHMP LIVVMLFPVK LLFLGAFNVF SFR // ID NC003070_116 HYPOTHETICAL; PRT; 712 AA. AC NC003070_116; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[575533...575586, 576774...576912, DE 576994...577592, 577784...577906, 577979...578052, 578722...578791, DE 578895...579017, 579196...579312, 579421...579525, 579647...579799, DE 579879...579962, 580646...580745, 581070...581186, 581278...581315, DE 581497...581559, 581644...581718, 581815...581919]; Length: 2139. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 18 FIRST EXON; p-value: 0.000. FT GENSCAN 19 64 INTERNAL EXON; p-value: 0.000. FT GENSCAN 65 65 AA on splice site: g/gt -> G. FT GENSCAN 66 264 INTERNAL EXON; p-value: 0.000. FT GENSCAN 265 305 INTERNAL EXON; p-value: 0.000. FT GENSCAN 306 329 INTERNAL EXON; p-value: 0.000. FT GENSCAN 330 330 AA on splice site: ag/g -> R. FT GENSCAN 331 353 INTERNAL EXON; p-value: 0.000. FT GENSCAN 354 394 INTERNAL EXON; p-value: 0.000. FT GENSCAN 395 433 INTERNAL EXON; p-value: 0.000. FT GENSCAN 434 468 INTERNAL EXON; p-value: 0.000. FT GENSCAN 469 519 INTERNAL EXON; p-value: 0.000. FT GENSCAN 520 547 INTERNAL EXON; p-value: 0.000. FT GENSCAN 548 580 INTERNAL EXON; p-value: 0.000. FT GENSCAN 581 581 AA on splice site: g/aa -> E. FT GENSCAN 582 619 INTERNAL EXON; p-value: 0.000. FT GENSCAN 620 620 AA on splice site: t/tg -> L. FT GENSCAN 621 632 INTERNAL EXON; p-value: 0.000. FT GENSCAN 633 653 INTERNAL EXON; p-value: 0.000. FT GENSCAN 654 678 INTERNAL EXON; p-value: 0.000. FT GENSCAN 679 712 LAST EXON; p-value: 0.000. SQ SEQUENCE 712 AA; 80693 MW; 03E63B6CC0972837 CRC64; MSTWGRQGRG LPRQSRSKNA ILVGNCNLSH VSLPKLFVGL FEIDENLKEE EVPDDDDSVG GEVQGEVNAN DYIPNPAAPA NTKRKWQIMK EKVQMTEDDD FDEQNAVIAE AAEQPLDLII PLLKYQKEFL AWATIQELSA VRGGILADEM GMGKTIQAIS LVLARREVDR AKSREAVGHT LVLVPPVALS QWLDEISRLT SPGSTRVLQY HGPKRDKNVQ KLMNYDFVLT TSPIVENEYR KDEGVDETMS PLHSIKWNRI IVDEAHDIKN RSSRTAKAVF ALEATYRWAL SGTPLQNDVD ELYSLIRFLR VSPYSYYFCK KCDCEVLDRR YIQAGTLMNN YAHIFGLLIR LRQAVDHPYL VSYSSPSGAN ANLLDANKNE KECGFGHDPS KDYFVTSSEH QASKTKLKGF RASSILNRIN LDDFKTSTKI EALREEIRFM VERDWSAKAI VFSQFTSFLD LISYALGKSG VSCVQLVGSM SKAAKDAALK NFKEEPDCRV LLMSLQAGGV ALNLTAASHV FMMDPWWNPA VERQAQDRIH RIGQCKPRER SVNKSPFLRS HHLHRRFCND HLCYSRRSIY EEEEEMSNTP AAAASSSSKS KAAGTSQPQE KRKTLFQKEL QHMMYGFGDE QNPLPESVAL VEDIVVEYVT DLVTHKAQEI GSKRGRLLVD DFLYLIRKDL PKLNRCRELL AMQEELKQAR KAFDVDEKEL VD // ID NC003070_117 HYPOTHETICAL; PRT; 129 AA. AC NC003070_117; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[582422...582536, 582632...582906]; Length: DE 390. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 38 FIRST EXON; p-value: 0.000. FT GENSCAN 39 39 AA on splice site: a/cg -> T. FT GENSCAN 40 129 LAST EXON; p-value: 0.000. SQ SEQUENCE 129 AA; 13976 MW; 2C44690D360A51BB CRC64; MKKKKSSRGM HMFSVVMFML RRRRGENPST PGSGGESLTT EMGGDDGDND NEDDDDSEGR LSETMEVFTA ASSSSSSGIS GYGSAMSLYD DDIDEEEDEC YSDVEGGDDM IDEKAEVLCN NEDAKSGLH // ID NC003070_118 HYPOTHETICAL; PRT; 768 AA. AC NC003070_118; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[583364...584070, 584414...584618, DE 584860...584956, 585053...585144, 585237...585368, 585500...585685, DE 585779...585942, 586023...586101, 586192...586320, 586439...586566, DE 586649...587036]; Length: 2307. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 235 FIRST EXON; p-value: 0.000. FT GENSCAN 236 236 AA on splice site: ag/c -> S. FT GENSCAN 237 304 INTERNAL EXON; p-value: 0.000. FT GENSCAN 305 336 INTERNAL EXON; p-value: 0.000. FT GENSCAN 337 337 AA on splice site: g/ag -> E. FT GENSCAN 338 367 INTERNAL EXON; p-value: 0.000. FT GENSCAN 368 411 INTERNAL EXON; p-value: 0.000. FT GENSCAN 412 473 INTERNAL EXON; p-value: 0.000. FT GENSCAN 474 527 INTERNAL EXON; p-value: 0.000. FT GENSCAN 528 528 AA on splice site: gc/t -> A. FT GENSCAN 529 554 INTERNAL EXON; p-value: 0.000. FT GENSCAN 555 597 INTERNAL EXON; p-value: 0.000. FT GENSCAN 598 639 INTERNAL EXON; p-value: 0.000. FT GENSCAN 640 640 AA on splice site: aa/g -> K. FT GENSCAN 641 768 LAST EXON; p-value: 0.000. SQ SEQUENCE 768 AA; 85718 MW; CC4D416E4D646219 CRC64; MVTITQLIPF REKLIETTVT NEVSIAKNWI LAIRLTYRDE PTVIISLNSK TNPQDDAKIS TLQLCIKTKC LILQLLHMKQ NTNLGECLSD LFRDERFVFV GIGIAKTVAK LGGLVRVVVK KVDVRDLVKV NFPFSYGERS RVSLKGMACE LLGFGSWKPK REICPRDLAN EVLDVEVVKF LSVDAYVCHE IAFKMLKYQA QFIVIRFIKH YFTEQTLGRY WANLYNKASL LKSKDSAKTE VRRNRYKVSV DADEGRRRRE DNMVEIRKNK REENLQKKRR EGFNPSMASQ PGQDFSSSLP TETRLENIQQ MIAGVMSEDR DLQLEATASF RRLLSIERNP PINEVVQSGV VPHIVQFLSR DDFTQLQFEA AWALTNIASG TSENTRVIID SGAVPLFVKL LSSASEEVRE QAVWALGNVA GDSPKCRDHV LSCEAMMSLL AQFHEHSKLS MLRNATWTLS NFCRGKPQPA FEQTKAALPA LERLLHSTDE EVLTDASWAL SYLSDGTNEK IQTVIDAGVI PRLVQLLAHP SPSVLIPALR TIGNIVTGDD IQTQAVISSQ ALPGLLNLLK NTYKKSIKKE ACWTISNITA GNTSQIQEVF QAGIIRPLIN LLEIGEFEIK KEAVWAISNA TSGGNHDQIK FLVSQGCIRP LCDLLPCPDP RVVTVTLEGL ENILKVGEAE KNLGNTGNDN LYAQMIEDAD GLDKIENLQS HDNNEIYEKA VKILESYWAA DDEEEDIGGV DAPENVQSSG FQFGNQSGNA PTGGFNFG // ID NC003070_119 HYPOTHETICAL; PRT; 247 AA. AC NC003070_119; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[588367...588540, 588663...589232]; Length: DE 744. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 58 FIRST EXON; p-value: 0.000. FT GENSCAN 59 247 LAST EXON; p-value: 0.000. SQ SEQUENCE 247 AA; 27456 MW; 776DEBBD5AAE378D CRC64; MMMQSRLLAF ASAARSRVRP IAQRRLAFGS STSGRTADPE IHAGNDGADP AIYPRDPEGM DDVANPKTAA EEIVDDTPRP SLEEQPLVPP KSPRATAHKL ESTPVGHPSE PHFQQKRKNS TASPPSLDSV SCAGLDGSPW PRDEGEVEEQ RRREDETESD QEFYKHHKAS PLSEIEFADT RKPITQATDG TAYPAGKDVI GWLPEQLDTA EESLMKATMI FKRNAERGDP ETFPHSRILR EMRGEWF // ID NC003070_120 HYPOTHETICAL; PRT; 96 AA. AC NC003070_120; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[589996...589706]; Length: 291. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 96 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 96 AA; 8127 MW; 2248DEE26D243DF7 CRC64; MEAEIVGEAS AAVIMIGGLG LFGTHSLQAG GNGGGSGKGQ WLHGGGGEGG GGEGGGGEGG GGQKISKGGG GGGSGGGQRS SSGGGGGGGE GDGGGG // ID NC003070_121 HYPOTHETICAL; PRT; 361 AA. AC NC003070_121; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[592115...593200]; Length: 1086. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 361 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 361 AA; 41157 MW; 499397B91DE277A1 CRC64; MHWITRFSAF FSAALAMILL SPSLQSFSPA AAIRSSHPYA DEFKPQQNSD YSSFRESPMF RNAEQCRSSG EDSGVCNPNL VHVAITLDID YLRGSIAAVN SILQHSMCPQ SVFFHFLVSS ESQNLESLIR STFPKLTNLK IYYFAPETVQ SLISSSVRQA LEQPLNYARN YLADLLEPCV KRVIYLDSDL VVVDDIVKLW KTGLGQRTIG APEYCHANFT KYFTGGFWSD KRFNGTFKGR NPCYFNTGVM VIDLKKWRQF RFTKRIEKWM EIQKIERIYE LGSLPPFLLV FAGHVAPISH RWNQHGLGGD NVRGSCRDLH SGPVSLLHWS GSGKPWLRLD SKLPCPLDTL WAPYDLYKHS H // ID NC003070_122 HYPOTHETICAL; PRT; 1106 AA. AC NC003070_122; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[598473...597521, 597382...596572, DE 596478...595019, 594059...593963]; Length: 3321. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 317 FIRST EXON; p-value: 0.000. FT GENSCAN 318 318 AA on splice site: ag/a -> R. FT GENSCAN 319 588 INTERNAL EXON; p-value: 0.000. FT GENSCAN 589 1074 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1075 1075 AA on splice site: ct/a -> L. FT GENSCAN 1076 1106 LAST EXON; p-value: 0.000. SQ SEQUENCE 1106 AA; 123802 MW; 13808CBC19FF4595 CRC64; MVKSAASQSP SPVTITVTPC KGSGDRSLGL TSPIPRASVI TNQNSPLSSR ATRRTSISSG NRRSNGDEGR YCSMSVEDLT AETTNSECVL SYTVHIPPTP DHQTVFASQE SEEDEMLKGN SNQKSFLSGT IFTGGFKSVT RGHVIDCSMD RADPEKKSGQ ICWLKGCDEK VVHGRCECGF RICRDCYFDC ITSGGGNCPG CKEPYRDIND DPETEEEDEE DEAKPLPQMG ESKLDKRLSV VKSFKAQNQA GDFDHTRWLF ETKGTYGYGN AVWPKDGYGI GSGGGGNGYE TPPEFGERSK RPLTRKVSVS AAIISPYRLL IALRLVALGL FLTWRVRHPN REAMWLWGMS TTCELWFALS WLLDQLPKLC PVNRLTDLGV LKERFESPNL RNPKGRSDLP GIDVFVSTAD PEKEPPLVTA NTILSILAVD YPVEKLACYL SDDGGALLTF EALAQTASFA STWVPFCRKH NIEPRNPEAY FGQKRNFLKN KVRLDFVRER RRVKREYDEF KVRINSLPEA IRRRSDAYNV HEELRAKKKQ MEMMMGNNPQ ETVIVPKATW MSDGSHWPGT WSSGETDNSR GDHAGIIQAM LAPPNAEPVY GAEADAENLI DTTDVDIRLP MLVYVSREKR PGYDHNKKAG AMNALVRTSA IMSNGPFILN LDCDHYIYNS MALREGMCFM LDRGGDRICY VQFPQRFEGI DPNDRYANHN TVFFDVSMRA LDGLQGPMYV GTGCIFRRTA LYGFSPPRAT EHHGWLGRRK VKISLRRPKA MMKKDDEVSL PINGEYNEEE NDDGDIESLL LPKRFGNSNS FVASIPVAEY QGRLIQDLQG KGKNSRPAGS LAVPREPLDA ATVAEAISVI SCFYEDKTEW GKRVGWIYGS VTEDVVTGYR MHNRGWRSIY CVTKRDAFRG TAPINLTDRL HQVLRWATGS VEIFFSRNNA IFATRRMKFL QRVAYFNVGM YPFTSLFLIV YCILPAISLF SGQFIVQSLD ITFLIYLLSI TLTLCMLSLL EIKWSGITLH EWWRNEQFWV IGGTSAHPAA VLQGLLKVIA GVDISFTLTS KSSAPEDGDD EFADLTYLTA GTTLKTKFTL LNCVYGLVEL SEATLL // ID NC003070_123 HYPOTHETICAL; PRT; 1304 AA. AC NC003070_123; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[612225...612019, 611920...611618, DE 611529...611026, 610935...610716, 609432...609280, 608938...608816, DE 608711...608398, 608309...608242, 606640...606529, 606322...606176, DE 606102...605919, 605805...605705, 605604...605515, 605403...605178, DE 604126...603945, 603727...603569, 603114...602932, 602021...601812, DE 601706...601653, 600825...600721, 600513...600434, 600234...600065, DE 599675...599656]; Length: 3915. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 69 FIRST EXON; p-value: 0.000. FT GENSCAN 70 170 INTERNAL EXON; p-value: 0.000. FT GENSCAN 171 338 INTERNAL EXON; p-value: 0.000. FT GENSCAN 339 411 INTERNAL EXON; p-value: 0.000. FT GENSCAN 412 412 AA on splice site: g/tt -> V. FT GENSCAN 413 462 INTERNAL EXON; p-value: 0.000. FT GENSCAN 463 463 AA on splice site: c/gc -> R. FT GENSCAN 464 503 INTERNAL EXON; p-value: 0.000. FT GENSCAN 504 504 AA on splice site: g/gt -> G. FT GENSCAN 505 608 INTERNAL EXON; p-value: 0.000. FT GENSCAN 609 630 INTERNAL EXON; p-value: 0.000. FT GENSCAN 631 631 AA on splice site: gc/c -> A. FT GENSCAN 632 668 INTERNAL EXON; p-value: 0.000. FT GENSCAN 669 717 INTERNAL EXON; p-value: 0.000. FT GENSCAN 718 778 INTERNAL EXON; p-value: 0.000. FT GENSCAN 779 779 AA on splice site: g/ct -> A. FT GENSCAN 780 812 INTERNAL EXON; p-value: 0.000. FT GENSCAN 813 842 INTERNAL EXON; p-value: 0.000. FT GENSCAN 843 917 INTERNAL EXON; p-value: 0.000. FT GENSCAN 918 918 AA on splice site: g/aa -> E. FT GENSCAN 919 978 INTERNAL EXON; p-value: 0.000. FT GENSCAN 979 1031 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1032 1092 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1093 1162 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1163 1180 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1181 1215 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1216 1241 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1242 1242 AA on splice site: ag/g -> R. FT GENSCAN 1243 1298 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1299 1299 AA on splice site: g/ta -> V. FT GENSCAN 1300 1304 LAST EXON; p-value: 0.000. SQ SEQUENCE 1304 AA; 145556 MW; 4D452E4F535FFCC9 CRC64; MANARSLVAK ANNINVGSLI LMALVFGSCV ANGEYLGGRR GLAANSGNPT VYDITKFGAV GDGSTNTFKA FLNTWIQVCD SPVPATLLVP KGTFLAGPVI FAGPCKSKVT VNVIGTIIAT TSGYATPEWF LFERVDNLVL TGTGTFHGKG EAVWKADGCG KKVQCNLPPT SLKFRNMKNV EINGISSVNA KAFHMFLVKT ENVNIQNIKL TAPAESPNTD GIHLSNADNV SILDSTIATG DDCVSVGRGS NNVTVERVIC GPGHGLSVGS LGKYKNEEDV SGIHVNNCTM IETDNGLRIK TWGGSDPSKA VDIKFENIIM QSVKNPIIID QNYGSRGGDS QVAISDILFK NIRGTTITKD VVQIMCSKSV PCQGVNVVDV NLDYVGKTGG EKKSSSGGLV GALCDNANVI FVLRHTNLQK RGEAKMVSLK LQKRLAASVM KCGKGKVWLD PNESSDISMA NSRQNIRKLV KDGFIIRKPT KIHSRSRARK MKIAKMKGRH SGYGKRKGTR EARLPTKVLW MRRMRVLRRL LKKYRETKKI DKHMYHDMYM RVKGNVFKNK RVLMESIHKS KAEKAREKTL SDQFEAKRAK NKASRERKHA RREERLAKGP GGDVAPVAAP APAATPAPTA AEEEENMDMS VIMRYGDDKA EELCLEVEDY WARVDESDGF DVEGIQAPPG GTPLIHYDCH LPNSRHPDPV LVKLYASAGL HRYNMLEGTN FKLVDVMKFN KLMMHLSPFY ITLLAQDPVS RSQQTFQVQV DEHCLSTMDL TVLIARPKAV STNESVLAPQ SFSVEESPEW PSDFNDGKRF YRVKESELRN NDWISLYLQL VLVSHDRMRI SDSDLSKLKI VEAVIETKDD MLPPNERLLN AKTAIVYITF KGFTKCRIGD EHTERKTIVR RIFDEDTGHL SIKGELIEES HSSSNMADDN SPCTNSHFFA FSNLSLSDKG HDDVPSFAAD ESLIGFLSST LESGSTSLRR KFDLKGFEKK QKRISSRHLY DVMLYDPPYA SSPFCYIQDI STKTTKQPKR ILHDDVLRRY GDTIQLNHNH LRKTGMDTPL LYRRQLLQTL PNLDKALKYS PLINSRAKEK KKMGSPNAAA ETDLTTDDFI GDTRRDSGSD TETNTDCDGE DLPLLLLPAP PGHFEEGERV LAKHSDCFYE AKVLKVEFKD NEWKYFVHYI EKNVLPSDNL LSFNIPPALR KQLLDDFEFV TQMQKLVQLP RSPNVDGILK KYIDSQMKKH GRVTDSLEEI LKGLRCYFDK ALPVMLLYNN ERKQYEESVS GGVSPSTVYG AEHLLRLFVF CNET // ID NC003070_124 HYPOTHETICAL; PRT; 155 AA. AC NC003070_124; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[614140...613994, 613706...613386]; Length: DE 468. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 49 FIRST EXON; p-value: 0.000. FT GENSCAN 50 155 LAST EXON; p-value: 0.000. SQ SEQUENCE 155 AA; 16894 MW; ACB272132DC88EF3 CRC64; MGESNMQYVT STSFLLLTYA KYLTSARTVA YCGGSVVTPA RLRSIAKKQV DYLLGGNPLK MSYMVGYGLK YPRRIHHRGS SLPSVAVHPT RIQCHDGFSL FTSQSPNPND LVGAVVGGPD QNDQFPDERS DYGRSEPATY INAPLVGALA YLARS // ID NC003070_125 HYPOTHETICAL; PRT; 621 AA. AC NC003070_125; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[615999...616068, 616657...616764, DE 618336...619328, 619639...620333]; Length: 1866. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 23 FIRST EXON; p-value: 0.000. FT GENSCAN 24 24 AA on splice site: g/ac -> D. FT GENSCAN 25 59 INTERNAL EXON; p-value: 0.000. FT GENSCAN 60 60 AA on splice site: a/tt -> I. FT GENSCAN 61 390 INTERNAL EXON; p-value: 0.000. FT GENSCAN 391 391 AA on splice site: g/ct -> A. FT GENSCAN 392 621 LAST EXON; p-value: 0.000. SQ SEQUENCE 621 AA; 68959 MW; B36BE21349090994 CRC64; MVDARARGRG REAIGEKQDE REKDQKRRRK RWSLESERTN VRVVGIVSKL ITNQGENRKI LASSQTLSNS STICKTTPDP KYCKSVFPHS QGNVQQYGCF SIRKSLSQSR KFIRTVDRYI KRNAHLSQPA VIRALQDCRF LAGLTMDYLL TSFETVNDTS AKTSFKPLSF PKADDIQTLL SAALTNEQTC LEGLTTAASY SATWTVRTGV ALPLVNDTKL LGVSLALFTK GWVPKKKKRA GFAWAQPRSG SSTHTKPFRL FRNGALPLKM TEKTKAVYES LSRRKLADGD SNGDGDDGSM VLISDIVTVS QDGTGNFTNI TAAVAAAPNN TDGSAGFFLI YVTAGIYEEY ISIAKNKRYM MMIGDGINQT VVTGNRSVVD GWTTFNSATF AVTAPNFVAV NITFRNTAGP EKHQAVALRS GADFSIFYSC SFEAYQDTLY THSLRQFYRE CDVYGTVDFI FGNAAVVFQN CNLYPRKPMP NQFNAITAQG RSDPNQNTGT SIQNCTIKPA DDLVSSNYTV KTYLGRPWKE YSRTVYMQSY IDGFVEPVGW REWNGDFALS TLYYAEYNNT GPGSNTTNRV TWPGYHVINS TDAANFTVTG LFIEADWIWK TGVPYTSGLI S // ID NC003070_126 HYPOTHETICAL; PRT; 190 AA. AC NC003070_126; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[622696...622638, 622126...621933, DE 621233...620914]; Length: 573. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: 0.000. FT GENSCAN 20 20 AA on splice site: at/a -> I. FT GENSCAN 21 84 INTERNAL EXON; p-value: 0.000. FT GENSCAN 85 85 AA on splice site: g/gc -> G. FT GENSCAN 86 190 LAST EXON; p-value: 0.000. SQ SEQUENCE 190 AA; 21722 MW; 41026A78F544BDF7 CRC64; MEDIAYFSFG KVFDEEDNTI NKRVSSQSLR LRFAIKSATT LWRLVKLIDL EVRRCDPNGE LEFVSVPNNL NDVEPQEENF HLHAGPWALR GRDRPITIKP TTTLWAVHEI LARELLRSST NEELDVVAVT RHLGDVDPKE KNFDAYAYQP YDPSSCYDAG DYGFVFDLVR VYGELTRGVE TDFENASSSV // ID NC003070_127 HYPOTHETICAL; PRT; 91 AA. AC NC003070_127; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[624304...624231, 624134...623933]; Length: DE 276. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 24 FIRST EXON; p-value: 0.000. FT GENSCAN 25 25 AA on splice site: ag/a -> R. FT GENSCAN 26 91 LAST EXON; p-value: 0.000. SQ SEQUENCE 91 AA; 9828 MW; 1DDC7418CB67983F CRC64; MARSLANAKI QSVFGSEKLS NAVFRRGFAA AAKTALDGSV STAEMKKRAG EASSEKAPWV PDPKTGYYRP ETVSEEIDPA ELRAILLNNK Q // ID NC003070_128 HYPOTHETICAL; PRT; 127 AA. AC NC003070_128; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[625608...625373, 625292...625145]; Length: DE 384. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 78 FIRST EXON; p-value: 0.000. FT GENSCAN 79 79 AA on splice site: cg/g -> R. FT GENSCAN 80 127 LAST EXON; p-value: 0.000. SQ SEQUENCE 127 AA; 14451 MW; 29915840C8AB2C4F CRC64; MARVGAKSSG AGAKKKGVSF VIDCSKPVDD TILEIATLEK FLQERIKVRG KAGALGNSVS ITRYNGKINV NANSNFSKRY LKYLTKKYLK KYNLRDWLRV IASNKDKNVY EVRYFRIDDE VASYEED // ID NC003070_129 HYPOTHETICAL; PRT; 675 AA. AC NC003070_129; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[626918...627004, 627100...627165, DE 627308...627382, 627482...627614, 627701...627747, 627835...627892, DE 627970...628019, 628112...628168, 628246...628335, 629288...629321, DE 629402...629494, 629580...629599, 630832...630952, 631104...631264, DE 631475...632043, 632128...632348, 632421...632449, 632647...632693, DE 632716...632785]; Length: 2028. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 29 FIRST EXON; p-value: 0.000. FT GENSCAN 30 51 INTERNAL EXON; p-value: 0.000. FT GENSCAN 52 76 INTERNAL EXON; p-value: 0.000. FT GENSCAN 77 120 INTERNAL EXON; p-value: 0.000. FT GENSCAN 121 121 AA on splice site: g/tt -> V. FT GENSCAN 122 136 INTERNAL EXON; p-value: 0.000. FT GENSCAN 137 155 INTERNAL EXON; p-value: 0.000. FT GENSCAN 156 156 AA on splice site: g/gg -> G. FT GENSCAN 157 172 INTERNAL EXON; p-value: 0.000. FT GENSCAN 173 191 INTERNAL EXON; p-value: 0.000. FT GENSCAN 192 221 INTERNAL EXON; p-value: 0.000. FT GENSCAN 222 232 INTERNAL EXON; p-value: 0.000. FT GENSCAN 233 233 AA on splice site: t/ct -> S. FT GENSCAN 234 263 INTERNAL EXON; p-value: 0.000. FT GENSCAN 264 264 AA on splice site: g/gt -> G. FT GENSCAN 265 270 INTERNAL EXON; p-value: 0.000. FT GENSCAN 271 310 INTERNAL EXON; p-value: 0.000. FT GENSCAN 311 311 AA on splice site: g/ga -> G. FT GENSCAN 312 364 INTERNAL EXON; p-value: 0.000. FT GENSCAN 365 553 INTERNAL EXON; p-value: 0.000. FT GENSCAN 554 554 AA on splice site: tg/g -> W. FT GENSCAN 555 627 INTERNAL EXON; p-value: 0.000. FT GENSCAN 628 628 AA on splice site: t/tg -> L. FT GENSCAN 629 637 INTERNAL EXON; p-value: 0.000. FT GENSCAN 638 652 INTERNAL EXON; p-value: 0.000. FT GENSCAN 653 653 AA on splice site: gg/g -> G. FT GENSCAN 654 675 LAST EXON; p-value: 0.000. SQ SEQUENCE 675 AA; 75333 MW; DB7CE273B62E8B98 CRC64; MSSRSSRTVY VGNLPGDIRE REVEDLFSKY GPVVQIDLKV PPRPPGYAFV EFDDARDAED AIHGRDGYDF DGHRLRVELA HGGRRSSDDT RGSFNGGGRG GGRGRGDGGS RGPSRRSEFR VLVTGLPSSA SWQDLKDHMR KGGDVCFSQV YRDARGTTGV VDYTCYEDMK YALKKLDDTE FRNAFSNGYV RVREYDSRKD SRSPSRGRSY SKSRSRSRGR SIAFKIKISS TFSFLYHRFR RKEARALASQ VQPRVLSTLG VHRGKMGDIQ VEGAADEDGR TPSIWDVFAH AGLVSSSLDV SLNSSRILAR GRCEAHGRHG IGGLQILYIL VKAFTKYDFS TTRLINQSKR NIINGLMKNM ILFQALEDEY GGWLSQEIVY DSSPFTFFLI GDIKLIHQSF CNDYCYRRDF TAYADTCFKE FGDRVSHWTT INEVNVFALG GYDQGITPPA RCSPPFGLNC TKGNSSIEPY IAVHNMLLAH ASATILYKQQ YKVLLSASLP SSICIAFCYV LFITQYKQHG SVGISVYTYG AVPLTNSVKD KQATARVNDF YIGWILHPLV FGDYPETMKT NVGSRLPAFT EEESEQVKGA FDFVGVINYM ALYVKDNSSS LKPNLQDFNT DIAVEMTLVG NTSIENETLV CFFMCLKLSL SVGTNDSSQL ITCGHNKGEV SELVH // ID NC003070_130 HYPOTHETICAL; PRT; 377 AA. AC NC003070_130; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[635474...635651, 635739...635986, DE 636081...636359, 636466...636729, 636828...636886, 636978...637083]; DE Length: 1134. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 59 FIRST EXON; p-value: 0.000. FT GENSCAN 60 60 AA on splice site: g/tt -> V. FT GENSCAN 61 142 INTERNAL EXON; p-value: 0.000. FT GENSCAN 143 235 INTERNAL EXON; p-value: 0.000. FT GENSCAN 236 323 INTERNAL EXON; p-value: 0.000. FT GENSCAN 324 342 INTERNAL EXON; p-value: 0.000. FT GENSCAN 343 343 AA on splice site: ag/c -> S. FT GENSCAN 344 377 LAST EXON; p-value: 0.000. SQ SEQUENCE 377 AA; 42946 MW; 0CECB0571A6DF136 CRC64; MKFCKKYEEY MQGQKEKKNL PGVGFKKLKK ILKRCRRNHV PSRISFTDAI NHNCSRECPV CDGTFFPELL KEMEDVVGWF NEHAQKLLEL HLASGFTKCL TWLRGNSRKK DHHGLIQEGK DLVNYALINA VAIRKILKKY DKIHESRQGQ AFKTQVQKMR IEILQSPWLC ELMAFHINLK ESKKESGATI TSPPPPVHAL FDGCALTFDD GKPLLSCELS DSVKVDIDLT CSICLVLFHS PLAGSTSKTK SLAMNIFKCN AINKSLTYGL QLIGEKQDTV FDPISLTCGH IYCYMCACSA ASVNVVDGLK TAEATEKCPL CREDGVYKGA VHLDELNILL KRSCRDYWEE RRKTERAERL QQAKEYWDYQ CRSFTGI // ID NC003070_131 HYPOTHETICAL; PRT; 176 AA. AC NC003070_131; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[638603...638258, 637514...637330]; Length: DE 531. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 115 FIRST EXON; p-value: 0.000. FT GENSCAN 116 116 AA on splice site: g/gg -> G. FT GENSCAN 117 176 LAST EXON; p-value: 0.000. SQ SEQUENCE 176 AA; 20085 MW; A72E28F8DC0F7E9F CRC64; MARSRRKYRN SRAKVRVALP KKNPNIFKPA FNFPPKLRAL MGDDVPEWDD QASVIQNYKS FGVISNPNLL GIRARTDHMI QDDSLNVPPP VEPPTDDPIA KEFEPIDSGS ELEEDGHVPR QEAKLDAAFG CNTSKAMYEI SDLQRQEPDF SFMMMMMISV SFVQFCYKLA TTLQWL // ID NC003070_132 HYPOTHETICAL; PRT; 106 AA. AC NC003070_132; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[641161...641176, 641556...641692, DE 641762...641929]; Length: 321. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 5 FIRST EXON; p-value: 0.000. FT GENSCAN 6 6 AA on splice site: g/tc -> V. FT GENSCAN 7 51 INTERNAL EXON; p-value: 0.000. FT GENSCAN 52 106 LAST EXON; p-value: 0.000. SQ SEQUENCE 106 AA; 12163 MW; 74447BBFA72C1E51 CRC64; MTPLRVHSKP LDETLRIFSD LPLVDPLTRA DLTAKAKKQT SVTYGSIALR LILKSNPVQL KSISIGEGYS INDDELELTA YFNVVRRREI DENDDRIWLS FLEEEK // ID NC003070_133 HYPOTHETICAL; PRT; 1521 AA. AC NC003070_133; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[654338...654062, 651885...651731, DE 651539...651320, 651219...651128, 650904...650791, 650709...650528, DE 650445...649762, 649675...649551, 649543...649368, 649139...648911, DE 648656...648465, 648432...648362, 648275...648102, 647995...647924, DE 647620...647531, 647321...647184, 646956...646777, 646689...646567, DE 646310...646126, 646037...645899, 645814...645755, 645679...645575, DE 645485...645376, 644261...644160, 643949...643877, 643745...643644, DE 643554...643305, 643208...643063]; Length: 4566. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 92 FIRST EXON; p-value: NaN. FT GENSCAN 93 93 AA on splice site: g/at -> D. FT GENSCAN 94 144 INTERNAL EXON; p-value: NaN. FT GENSCAN 145 217 INTERNAL EXON; p-value: NaN. FT GENSCAN 218 218 AA on splice site: g/gt -> G. FT GENSCAN 219 248 INTERNAL EXON; p-value: NaN. FT GENSCAN 249 286 INTERNAL EXON; p-value: NaN. FT GENSCAN 287 346 INTERNAL EXON; p-value: NaN. FT GENSCAN 347 347 AA on splice site: ac/t -> T. FT GENSCAN 348 574 INTERNAL EXON; p-value: NaN. FT GENSCAN 575 575 AA on splice site: ag/t -> S. FT GENSCAN 576 616 INTERNAL EXON; p-value: NaN. FT GENSCAN 617 617 AA on splice site: g/gg -> G. FT GENSCAN 618 675 INTERNAL EXON; p-value: NaN. FT GENSCAN 676 751 INTERNAL EXON; p-value: NaN. FT GENSCAN 752 752 AA on splice site: g/gg -> G. FT GENSCAN 753 815 INTERNAL EXON; p-value: NaN. FT GENSCAN 816 816 AA on splice site: g/cg -> A. FT GENSCAN 817 839 INTERNAL EXON; p-value: NaN. FT GENSCAN 840 897 INTERNAL EXON; p-value: NaN. FT GENSCAN 898 921 INTERNAL EXON; p-value: NaN. FT GENSCAN 922 951 INTERNAL EXON; p-value: NaN. FT GENSCAN 952 997 INTERNAL EXON; p-value: NaN. FT GENSCAN 998 1057 INTERNAL EXON; p-value: NaN. FT GENSCAN 1058 1098 INTERNAL EXON; p-value: NaN. FT GENSCAN 1099 1159 INTERNAL EXON; p-value: NaN. FT GENSCAN 1160 1160 AA on splice site: ag/g -> R. FT GENSCAN 1161 1206 INTERNAL EXON; p-value: NaN. FT GENSCAN 1207 1226 INTERNAL EXON; p-value: NaN. FT GENSCAN 1227 1261 INTERNAL EXON; p-value: NaN. FT GENSCAN 1262 1297 INTERNAL EXON; p-value: NaN. FT GENSCAN 1298 1298 AA on splice site: at/a -> I. FT GENSCAN 1299 1331 INTERNAL EXON; p-value: NaN. FT GENSCAN 1332 1332 AA on splice site: ag/g -> R. FT GENSCAN 1333 1356 INTERNAL EXON; p-value: NaN. FT GENSCAN 1357 1390 INTERNAL EXON; p-value: NaN. FT GENSCAN 1391 1473 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1474 1474 AA on splice site: a/gc -> S. FT GENSCAN 1475 1521 LAST EXON; p-value: 0.000. SQ SEQUENCE 1521 AA; 167824 MW; 3480BE1ACAE81CF9 CRC64; MDKSFTLFLT LTILVVFIIS SPPVQAGFAN DLGGVAWATT GDNGSGCHGS IAECIGAEEE EMDSEINRRI LATTKYISYQ SLKRNSVPCS RRDKTDFCSE NLFFESSGGD GGGEEEELER EIWLIRDVAL LLPSVFVLQQ VHPHAAAEPA SSSSASEVPI DNQAPVSDPG SISGDPELRT SDPQSNDAER PVTTTDVPAM ETDTNPELEG LVTPTPAGEV VVEAEKSKSS KKRIAKAPWA KLLSQFPQNP HLVMRGSVFT VGRRACDLCI RDHSMPNVLC ELRQSEHGGP SVASLEIIGN GVLVQVNGKI YQRSTCVHLR GGDEIIFTTP GKHAYVSFYK FFEKLSTVQI FQPLKDENLA APDRTSSLSL FEAQSAPLKG LHVETRARDS SSVDGTASLL ASISKLQNVP FLPPTAKSVK RQQNSEVPVL PSSCDDFILD VDLNDADSNN DHAAIASMEK TVASTSCAAN DDHDADGNGM DPFQEPEAGN IPDPAYEIRP ILSLLGDPSE FDLRGSISKI LVDERREVRE MPKEYERPSA SVLTRRQAHK DSLRGGILNP QDIEVSFENF PYFLSGTTKD VLMISTYAHI KYGKEYAEYA SDLPTACPRI LLSGPSGKLW TSIVYESFVS HFHFPNKFSY GIFEGSEIYQ EMLAKALAKQ CGAKLMIVDS LLLPGGSTPK EADTTKESSR RERLSVLAKR AVQAAQAAVL QHKKPISSVE AGITGGSTLS SQAVRRQEVS TATSKSYTFK AGDRQQVFKE KYSLRLKAMD LQKSGLDLID RYQMAMILVV YAKKTMVFFV LVIFVLISTT HLRLKASSLR LESSSSDDAD KLAINEIFEV AFNESERGSL ILFLKDIEKS VSGNTDVYIT LKSKLENLPE NIVVIASQTQ LDNRKEKSHP GGFLFTKFGS NQTALLDLAF PDEASLVDWK DKLERDTEIL KAQANITSIR AHLVICLIEN HMINRCGESG WLCFQSPSYE LLRTYSQGQQ AYHLGRKDVV TENEFEKKLL SDVIPPSDIG VSFSDIGALE NVKDTLKELV MLPLQRPELF GKGQLTKPTK GILLFGPPGT GKTMLAKAVA TEAGANFINI SMSSITSKVD SMLGRRENPG EHEAMRKMKN EFMINWDGLR TKDKERVLVL AATNRPFDLD EAVIRRLPRR LMVNLPDSAN RSKILSVILA KEEMAEDVDL EAIANMTDGY SGSDLKNLCV TAAHLPIREI LEKEKKERSV AQAENRAMPQ LYSSTDVRPL NMNDFKTAHD QVCASVASDS SNMNELQQWN ELYGEGGSRK KTSLSYFIAA KLRLCADGGA NRIYDELPLF FPNEDALAIR NRYKPDVIKG DMDSIRRDVL DFYINLGTKV IDESHDQDTT DLDKCILYIR HSTLNQETSG LQILATGALG GRFDHEAGNL NVLYRYPDTR IVLLSDDCLI QLLPKTHRHE IHIQSSLEGP HCGLIPIGTP SAKTTTSGLQ WDLSNTEMRF GGLISTSNLV KEEKITVESD SDLLWTISIK KTGLSIQDHT P // ID NC003070_134 HYPOTHETICAL; PRT; 453 AA. AC NC003070_134; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[658125...657856, 657495...657392, DE 657277...657142, 657058...656827, 656745...656588, 656505...656317, DE 656234...656134, 656029...655933, 655823...655749]; Length: 1362. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 90 FIRST EXON; p-value: NaN. FT GENSCAN 91 124 INTERNAL EXON; p-value: NaN. FT GENSCAN 125 125 AA on splice site: cg/g -> R. FT GENSCAN 126 170 INTERNAL EXON; p-value: NaN. FT GENSCAN 171 247 INTERNAL EXON; p-value: NaN. FT GENSCAN 248 248 AA on splice site: g/gt -> G. FT GENSCAN 249 300 INTERNAL EXON; p-value: NaN. FT GENSCAN 301 363 INTERNAL EXON; p-value: NaN. FT GENSCAN 364 396 INTERNAL EXON; p-value: NaN. FT GENSCAN 397 397 AA on splice site: ag/g -> R. FT GENSCAN 398 429 INTERNAL EXON; p-value: NaN. FT GENSCAN 430 453 LAST EXON; p-value: NaN. SQ SEQUENCE 453 AA; 50740 MW; 8E48AFF237AE57DD CRC64; MAVATAPSLN RHFPRRISNL YSRVKQRRPW LPPGDATLFN SRRNWDSHLF VYASSSSSPS SSPPSPNSPT DDLTAELCVN TGLDLFKRGR VKDALVQFET ALSLAPNPIE SQAAYYNKAC CHAYRGEGKK AVDCLRIALR DYNLKFATIL NDPDLASFRA LPEFKELQEE ARLGGEDIGD NFRRDLKLIS EVRAPFRGVR KFFYFAFAAA AGISMFFTVP RLVQAIRGGD GAPNLLETTG NAAINIGGIV VMVSLFLWEN KKEEEQMVQI TRDETLSRLP LRLSTNRVVE LVQLRDTVRP VILAGKKETV TLAMQKADRF RTELLRRGVL LVPVVWGERK TPEIEKKGFG ASSKAATSLP SIGEDFDTRA QSVVAQSKLK GEIRFKAETV SPGEWERWIR DQQISEGVNP GDDVYIILRL DGRVRRSGRG MPDWAEISKE LPPMDDVLSK LER // ID NC003070_135 HYPOTHETICAL; PRT; 209 AA. AC NC003070_135; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[659705...659556, 659471...659423, DE 659316...658886]; Length: 630. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 50 FIRST EXON; p-value: NaN. FT GENSCAN 51 66 INTERNAL EXON; p-value: NaN. FT GENSCAN 67 67 AA on splice site: g/aa -> E. FT GENSCAN 68 209 LAST EXON; p-value: NaN. SQ SEQUENCE 209 AA; 23599 MW; FD2CC0FD28A31ACC CRC64; MAGIKVFGHP ASTATRRVLI ALHEKNLDFE FVHIELKDGE HKKEPFIFRN PFGKVPAFED GDFKLFESRA ITQYIAHFYS DKGNQLVSLG SKDIAGIAMG IEIESHEFDP VGSKLVWEQV LKPLYGMTTD KTVVEEEEAK LAKVLDVYEH RLGESKYLAS DKFTLVDLHT IPVIQYLLGT PTKKLFDERP HVSAWVADIT SRPSAKKVL // ID NC003070_136 HYPOTHETICAL; PRT; 208 AA. AC NC003070_136; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[662191...662042, 661949...661901, DE 661790...661363]; Length: 627. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 50 FIRST EXON; p-value: NaN. FT GENSCAN 51 66 INTERNAL EXON; p-value: NaN. FT GENSCAN 67 67 AA on splice site: g/aa -> E. FT GENSCAN 68 208 LAST EXON; p-value: NaN. SQ SEQUENCE 208 AA; 23487 MW; 113CED008A11902F CRC64; MAGIKVFGHP ASTATRRVLI ALHEKNVDFE FVHVELKDGE HKKEPFILRN PFGKVPAFED GDFKIFESRA ITQYIAHEFS DKGNNLLSTG KDMAIIAMGI EIESHEFDPV GSKLVWEQVL KPLYGMTTDK TVVEEEEAKL AKVLDVYEHR LGESKYLASD HFTLVDLHTI PVIQYLLGTP TKKLFDERPH VSAWVADITS RPSAQKVL // ID NC003070_137 HYPOTHETICAL; PRT; 968 AA. AC NC003070_137; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[670851...670707, 669877...669784, DE 669569...669461, 669266...669129, 668987...668856, 668745...668613, DE 668468...668173, 668082...667957, 667868...667735, 667641...667541, DE 667464...667373, 667281...667151, 666472...666351, 666262...666114, DE 665910...665862, 665777...665397, 663902...663795, 663545...663079]; DE Length: 2907. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 48 FIRST EXON; p-value: NaN. FT GENSCAN 49 49 AA on splice site: a/aa -> K. FT GENSCAN 50 79 INTERNAL EXON; p-value: NaN. FT GENSCAN 80 80 AA on splice site: cg/a -> R. FT GENSCAN 81 116 INTERNAL EXON; p-value: NaN. FT GENSCAN 117 162 INTERNAL EXON; p-value: NaN. FT GENSCAN 163 206 INTERNAL EXON; p-value: NaN. FT GENSCAN 207 250 INTERNAL EXON; p-value: NaN. FT GENSCAN 251 251 AA on splice site: g/gt -> G. FT GENSCAN 252 349 INTERNAL EXON; p-value: NaN. FT GENSCAN 350 391 INTERNAL EXON; p-value: NaN. FT GENSCAN 392 435 INTERNAL EXON; p-value: NaN. FT GENSCAN 436 436 AA on splice site: gg/t -> G. FT GENSCAN 437 469 INTERNAL EXON; p-value: NaN. FT GENSCAN 470 470 AA on splice site: g/at -> D. FT GENSCAN 471 500 INTERNAL EXON; p-value: NaN. FT GENSCAN 501 543 INTERNAL EXON; p-value: NaN. FT GENSCAN 544 544 AA on splice site: tg/g -> W. FT GENSCAN 545 584 INTERNAL EXON; p-value: NaN. FT GENSCAN 585 585 AA on splice site: g/tt -> V. FT GENSCAN 586 634 INTERNAL EXON; p-value: NaN. FT GENSCAN 635 650 INTERNAL EXON; p-value: NaN. FT GENSCAN 651 651 AA on splice site: g/aa -> E. FT GENSCAN 652 777 INTERNAL EXON; p-value: NaN. FT GENSCAN 778 778 AA on splice site: c/ag -> Q. FT GENSCAN 779 813 INTERNAL EXON; p-value: NaN. FT GENSCAN 814 814 AA on splice site: g/aa -> E. FT GENSCAN 815 968 LAST EXON; p-value: NaN. SQ SEQUENCE 968 AA; 110189 MW; 70473A1C20CEE516 CRC64; MTGDGGFADV ILDLIENGFK VVVVHPTGDS TSQLFTLSPD YHRLIDSKKE VNISDKGPYP SILTGKRRYK GEHRQDRFER SLCFIVYRVI LKLLFGLLEE EEENDISPKE NAKEQERGLL LQTKLEKLIG TYNISILRSS LFYDYLTSVK FHKELLCCFQ VVLRKISSSL EDVVEVPDSP EESYISSSRR TGLACISAQE NGSDDEITHD EQVATWSAIS KETKSLIHLN GVASVASSHL SGFRAKKSSN GLKDHGRPKF SFNSHTHGET SSKISDMAEI FEPDVEDQAI EEDPIIECPN SFDERSENRQ GVSVAESREV LHECTKDAVP KLQEIPLDKI RLIKRSSELD SRHEAKSRKF THKGNSSNFQ DSDTDDELPG PMDSGSSSDD EPSYQSSVPN ISNQKKQFVG DRFDEAIKAS SLSKEGLLFG SPKLSGGSSL YGKLQQIMKQ EKETEMEIMR KLQSGIGEAD SSGYVDVKVM SRHLEGKLVV CKCSVIDLSG DSLLLKNTQA LAAKETETTI IFSPKVCADV DIEIGNFIRV YAPWVYGGYM QKLNKCIYIV YMDCLQMVFK LFPNWKREAE VKKLVAGYKV HGDPFSTNTR RVLAVLHEKR LSYEPITVKL QTGEHKTEPF LSLNPFGQVP VFEDGSVKLY ESRAITQYIA YVHSSRGTQL LNLRSHETMA TLTMWMEIEA HQFDPPASKL TWEQVIKPIY GLETDQTIVK ENEAILEKVL NIYEKRLEES RFLACNSFTL VDLHHLPNIQ YLLGTPTKKL FEKRSKVQYK IYGYPYSTNT RRVLAVLHEK GLSYDPITVN LIAESRAISE YIATVHKSRG TQLLNYKSYK TMGTQRMWMA IESFEFDPLT STLTWEQSIK PMYGLKTDYK VVNETEAKLE KVLDIYEERL KNSSFLASNS FTMADLYHLP NIQYLMDTHT KRMFVNRPSV RRWVAEITAR PAWKRACDVK AWYHKKKN // ID NC003070_138 HYPOTHETICAL; PRT; 137 AA. AC NC003070_138; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[672814...672651, 672548...672518, DE 672145...672106, 671974...671919, 671783...671661]; Length: 414. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 54 FIRST EXON; p-value: NaN. FT GENSCAN 55 55 AA on splice site: cg/g -> R. FT GENSCAN 56 65 INTERNAL EXON; p-value: NaN. FT GENSCAN 66 78 INTERNAL EXON; p-value: NaN. FT GENSCAN 79 79 AA on splice site: g/gt -> G. FT GENSCAN 80 97 INTERNAL EXON; p-value: NaN. FT GENSCAN 98 137 LAST EXON; p-value: NaN. SQ SEQUENCE 137 AA; 14552 MW; 80EB902D9C470EF8 CRC64; MGIGPIKAQL LILKESKLRE GPISKHTNGK DGIKVHAGEI DLNGDGECDG EETFRGIQIP IRASLGGLIL QTKLEKLIGC GALNYARDPG DVTVEATILT STDLMPKIYQ NCVTGNAEDL PEEVLAKNKV LISAIGT // ID NC003070_139 HYPOTHETICAL; PRT; 1304 AA. AC NC003070_139; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[673408...673706, 673964...674024, DE 674099...674165, 674441...674704, 674813...674880, 674968...675077, DE 675187...675227, 675340...675494, 675592...675945, 676044...676122, DE 677350...677434, 677508...677592, 677851...678002, 678094...678193, DE 678289...678522, 678592...678873, 678971...679060, 679148...679288, DE 679368...679490, 679578...679730, 679821...679997, 680103...680264, DE 680341...680484, 680559...680777, 680858...681025, 681111...681212]; DE Length: 3915. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 99 FIRST EXON; p-value: NaN. FT GENSCAN 100 100 AA on splice site: tg/c -> C. FT GENSCAN 101 120 INTERNAL EXON; p-value: NaN. FT GENSCAN 121 142 INTERNAL EXON; p-value: NaN. FT GENSCAN 143 143 AA on splice site: g/ag -> E. FT GENSCAN 144 230 INTERNAL EXON; p-value: NaN. FT GENSCAN 231 231 AA on splice site: a/gt -> S. FT GENSCAN 232 253 INTERNAL EXON; p-value: NaN. FT GENSCAN 254 289 INTERNAL EXON; p-value: NaN. FT GENSCAN 290 290 AA on splice site: ag/a -> R. FT GENSCAN 291 303 INTERNAL EXON; p-value: NaN. FT GENSCAN 304 304 AA on splice site: g/gg -> G. FT GENSCAN 305 355 INTERNAL EXON; p-value: NaN. FT GENSCAN 356 473 INTERNAL EXON; p-value: NaN. FT GENSCAN 474 499 INTERNAL EXON; p-value: NaN. FT GENSCAN 500 500 AA on splice site: g/gc -> G. FT GENSCAN 501 527 INTERNAL EXON; p-value: NaN. FT GENSCAN 528 528 AA on splice site: tg/t -> C. FT GENSCAN 529 556 INTERNAL EXON; p-value: NaN. FT GENSCAN 557 606 INTERNAL EXON; p-value: NaN. FT GENSCAN 607 607 AA on splice site: ac/g -> T. FT GENSCAN 608 640 INTERNAL EXON; p-value: NaN. FT GENSCAN 641 718 INTERNAL EXON; p-value: NaN. FT GENSCAN 719 812 INTERNAL EXON; p-value: NaN. FT GENSCAN 813 842 INTERNAL EXON; p-value: NaN. FT GENSCAN 843 889 INTERNAL EXON; p-value: NaN. FT GENSCAN 890 930 INTERNAL EXON; p-value: NaN. FT GENSCAN 931 981 INTERNAL EXON; p-value: NaN. FT GENSCAN 982 1040 INTERNAL EXON; p-value: NaN. FT GENSCAN 1041 1094 INTERNAL EXON; p-value: NaN. FT GENSCAN 1095 1142 INTERNAL EXON; p-value: NaN. FT GENSCAN 1143 1215 INTERNAL EXON; p-value: NaN. FT GENSCAN 1216 1271 INTERNAL EXON; p-value: NaN. FT GENSCAN 1272 1304 LAST EXON; p-value: NaN. SQ SEQUENCE 1304 AA; 149599 MW; D71E2F58004A565B CRC64; MFEKNGRTLL AKRKTQGTIK TRASKKIRKM EGTLERHSLL QFGQLSKISF ENRPSSNVAS SAFQGLLDSD SSELRNQLGS ADSDANCGEK DFILSQDFFC TPDYITPDNQ NLMSGLDISK DHSPCPRSPV KLNTVKSKRC RQESFTGNHS NSTWSSKHRV DEQENDDIDT DEVMGDKLQA NQTERTGYVS QAAVALRCRA MPPPCLKNPY VLNQSETATD PFGHQRSKCA SFLPVSTSGD GLSRYLTDFH EIRQIGAGHF SRVFKVLKRM DGCLYAVKHS TRKLYLDSER RKAMMEVQAL AALGFHENIV GYYSSWFENE QLYIQLELCD HSLSALPKKS SLKVSEREIL VIMHQIAKAL HFVHEKGIAH LDVKPDNIYI KNGVCKLGDF GCATRLDKSL PVEEGDARYM PQEILNEDYE HLDKVDIFSL GVTVYELIKG SPLTESRNQS LNIKEGKLPL LPGHSLQLQQ LLKTMMDRDP KRRPSARELL DHPMFDRIRG FKTPTSRFSE LFQIPVKSDS SNTLGTFCLS LISSVFVTVS QYLLEGYVPF LYERFRNLVH LIMAKKDSVL EAGWSVMEAG VAKLQKILEE VPDEPPFDPV QRMQLYTTVH NLCTQKPPND YSQQIYDRYG GVYVDYNKQT VLPAIREKHG EYMLRELVKR WANQKILVRW LSHFFEYLDR FYTRRGSHPT LSAVGFISFR DLVYQELQSK AKDAVLALIH KEREGEQIDR ALLKNVIDVY CGNGMGELVK YEEDFESFLL EDSASYYSRN ASRWNQENSC PDYMIKAEES LRLEKERVTN YLHSTTEPKL VAKVQNELLV VVAKQLIENE HSGCRALLRD DKMDDLARMY RLYHPIPQGL DPVADLFKQH ITVEGSALIK QATEAATDKA ASTSGLKVQD QVLIRQLIDL HDKFMVYVDE CFQKHSLFHK ALKEAFEVFC NKTVAGVSSA EILATYCDNI LKTGGGIEKL ENEDLELTLE KVVKLLVYIS DKDLFAEFFR KKQARRLLFD RNGNDYHERS LLTKFKELLG AQFTSKMEGM LTDMTLAKEH QTNFVEFLSV NKTKKLGMDF TVTVLTTGFW PSYKTTDLNL PIEMVNCVEA FKAYYGTKTN SRRLSWIYSL GTCQLAGKFD KKTIEIVVTT YQAAVLLLFN NTERLSYTEI LEQLNLGHED LARLLHSLSC LKYKILIKEP MSRNISNTDT FEFNSKFTDK MRRIRVPLPP MDERKKIVED VDKDRRYAID AALVRIMKSR KVLGHQQLVS ECVEHLSKMF KPDIKMIKKR IEDLISRDYL ERDTDNPNTF KYLA // ID NC003070_140 HYPOTHETICAL; PRT; 2237 AA. AC NC003070_140; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[692453...691443, 691364...691121, DE 690898...690757, 690579...690399, 690173...689985, 689813...689532, DE 689409...689251, 689158...689036, 688950...688679, 688602...688458, DE 688364...688281, 687340...687224, 686996...686844, 686751...686679, DE 686605...686505, 686424...686128, 685965...684864, 684777...683402, DE 683317...683175, 682892...682732, 682403...682243, 681921...681724]; DE Length: 6714. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 337 FIRST EXON; p-value: NaN. FT GENSCAN 338 418 INTERNAL EXON; p-value: NaN. FT GENSCAN 419 419 AA on splice site: g/gg -> G. FT GENSCAN 420 465 INTERNAL EXON; p-value: NaN. FT GENSCAN 466 466 AA on splice site: ag/g -> R. FT GENSCAN 467 526 INTERNAL EXON; p-value: NaN. FT GENSCAN 527 589 INTERNAL EXON; p-value: NaN. FT GENSCAN 590 683 INTERNAL EXON; p-value: NaN. FT GENSCAN 684 736 INTERNAL EXON; p-value: NaN. FT GENSCAN 737 777 INTERNAL EXON; p-value: NaN. FT GENSCAN 778 867 INTERNAL EXON; p-value: NaN. FT GENSCAN 868 868 AA on splice site: ag/g -> R. FT GENSCAN 869 916 INTERNAL EXON; p-value: NaN. FT GENSCAN 917 944 INTERNAL EXON; p-value: NaN. FT GENSCAN 945 983 INTERNAL EXON; p-value: NaN. FT GENSCAN 984 1034 INTERNAL EXON; p-value: NaN. FT GENSCAN 1035 1058 INTERNAL EXON; p-value: NaN. FT GENSCAN 1059 1059 AA on splice site: g/gt -> G. FT GENSCAN 1060 1092 INTERNAL EXON; p-value: NaN. FT GENSCAN 1093 1191 INTERNAL EXON; p-value: NaN. FT GENSCAN 1192 1558 INTERNAL EXON; p-value: NaN. FT GENSCAN 1559 1559 AA on splice site: g/ga -> G. FT GENSCAN 1560 2017 INTERNAL EXON; p-value: NaN. FT GENSCAN 2018 2064 INTERNAL EXON; p-value: NaN. FT GENSCAN 2065 2065 AA on splice site: ga/g -> E. FT GENSCAN 2066 2118 INTERNAL EXON; p-value: NaN. FT GENSCAN 2119 2119 AA on splice site: g/gt -> G. FT GENSCAN 2120 2172 INTERNAL EXON; p-value: NaN. FT GENSCAN 2173 2237 LAST EXON; p-value: NaN. SQ SEQUENCE 2237 AA; 246112 MW; BD2C5C81D4F32148 CRC64; MVERRNPLVL SSTRSTLRSV LNSSQPSSAD GDRVLNKDGD LLRGNARLSA GILRWRKDGE NVSDAKLDSL DDSALVGLST QLLKRLSINS GSLVCPSHLV QSTKLDLFWI ILLLLMLTIE FLKCKVVVKN IEIGIQRVAQ VVVLDPPKTT LEDASLTQVP VSDSLHTMLV FPTYDLMGQQ LLDQEVAYLS PMLAFNLSLH ISCLKSLVHR GNGVLEKYFE AKCDEEFIGK SAEDGSKIGL DLEPVSQVPG YASHLRVSFV KIPECGTIPS LKVNSSFEAE ERQGLIDSAL QKYFGTDRQL SRGDIFRIYI DWNCGSSICN PCSQRLCSES DDYIYFKVIA MEPSNERFLR VNHSQTALVL GGTVSSGLPP DLLVYRSKVP MPLQEETVNI LASVLSPPLC PSALASKLRV AVLLHGIPGC GKRTVVKYVA RRLGLHVVEF SCHSLLASSE RKTSTALAQT FNMARRYSPT ILLLRHFDVF KNLGSQDGSL GDRVGVSFEI ASVIRELTEP VSNGDSSMEE KSNSNFSENE VGKFRGHQVL LIASAESTEG ISPTIRRCFS HEIRMGSLND EQRSEMLSQS LQGVSQFLNI SSDEFMKGLV GQTSGFLPRD LQALVADAGA NLYISQESET KKINSLSDDL HGVDIHQASQ IDNSTEKLTA KEDFTKALDR SKKRNASALG APKVPNVKWD DVGGLEDVKT SILDTVQLPL LHKDLFSSGL RKRSGVLLYG PPGTGKTLLA KAVATECSLN FLSVKGPELI NMYIGESEKN VRDIFEKARS ARPCVIFFDE LDSLAPARGA SGDSGGVMDR VVSQMLAEID GLSDSSQDLF IIGASNRPDL IDPALLRPGR FDKLLYVGVN ADASYRERVL KALTRKFKLS EDVSLYSVAK KCPSTFTGAD MYALCADAWF QAAKRKVSKS DSGDMPTEED DPDSVVVEYV DFIKKTENEE SKNGILNLQR EEPPLSFRSS SFFLKLSLFK ALHMILVSSS DARSELGLGF GGLGEEIMQD CDFEEESTHS YVSCVDPDVA LSYIDEKLEN VLGHFQKDFE GGVSAENLGA KFGGYGSFLS MYQRSPVCSR PKTSPEVQQN QLGGRSNCSA SSLVPQLSIS GSASKPPASD VLVKLNKFVK SSHIGTPDSK HMSDAKTSSS APSNHKTLRF RIKVGSSDLS SLKNVSTFTK EGLNMLPSAS RAMVSFPLHK DQLLSPLSDD LIQLGSKEKI LKDAGYGSTN KTDAKSTPDG LVVSDSQKRA GKFPIGKKEK LRDRVKYRPP SNKLDRNHTV SNTEKEADKE SCEELVSKTM KLPLLSCLSP SYIHPAKEID NVSDSNVESI LRGTNKDAAL MGSKPELEDN VVAFSDRSVK ETESINVRKD VYLIKGEPLN SLESNPKREK APSIEHVDYS SVVKGTQSET RNEEQILKSK LPKVQRSQKG SSSIVTMNSQ RGKDAAVNII KKNVPDKLQE DIEESEHMCK GFFGDSKESK EEKQISPVLK AEKEKLSEEN ALGESFNSVK NDEEACDHLN LVCEPDLKHL IKPSDLNEDR HTTKQSVRRE VKNKHSLEGG MENMGMESER ELSGVSKKPK TGKSRFSAVD QPGSNKSNQI LEVLDTNKTM ITQALAENVK DFAKASSHGE RDDRKRKLKE NEESGDCMRL REAAVMESSG ENVRKRKRLK GSSCDEKELP FSSESCDKER SVSQENGRDS ASHLPSTLSS PSLCKDLGSE IIKNNVRESK GSLVESVAPL ALRVLDSGEL KSGRISERDE YHDTDYNAGE TLKRCRDGEA YSTIDRPGTT KKAAEDSKDR ERAYGEDCSI ENLKPKKSGR YPGENCIEGD SKQKSREEES SAPSKDNNWG LVNNVQDLGT AVKVKTKESR SKKRPARKVS MECNKEDSRE YQDPNTKLDR SGSHFSSRQK PDTANTSRGK SNPLEVTTEQ LKNKSASPAQ VEVLGHDTEI SNTKKQRLRN DNHSVTHDEG SRNQKQNGSR HKDHVGLSPF KKESTSQTAS NSIKEATDLK HMADRLKNAV SNHESTGVYF QAALKFLHGA SLLESSGTTI ARSKDIYGST AKLCEFCAHE YEKNKDMGAA ALAYKCMEVA YLRITYTSHG NIRRCRYELQ AALQVIPSGE SPSFASDGEN SNHTLTAEKF ALSNTVRSSP SVTGNHVISS GNNSSLSQLL AFSKNVNYAM EASRKAQIAL AAAKGKSFET RYSSNGITCI KRALDFNFQD MEKLLHVVRL AMESINR // ID NC003070_141 HYPOTHETICAL; PRT; 541 AA. AC NC003070_141; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[693480...693571, 694117...694183, DE 694280...695476, 695547...695793, 695836...695858]; Length: 1626. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 30 FIRST EXON; p-value: NaN. FT GENSCAN 31 31 AA on splice site: tg/g -> W. FT GENSCAN 32 53 INTERNAL EXON; p-value: NaN. FT GENSCAN 54 452 INTERNAL EXON; p-value: NaN. FT GENSCAN 453 534 INTERNAL EXON; p-value: NaN. FT GENSCAN 535 535 AA on splice site: g/ta -> V. FT GENSCAN 536 541 LAST EXON; p-value: NaN. SQ SEQUENCE 541 AA; 60439 MW; 16CA874736F63784 CRC64; MGLVTVGELK PAFTGKRGFR LNSTIRHASE WPISDVSSDL TVQVGSSSFC LHKFPLVSRS GKIRKLLADP KISNVCLSNA PGGSEAFELA AKFCYGINIE INLLNIAKLR CASHYLEMTE DFSEENLASK TEHFLKETIF PSILNSIIVL HHCETLIPVS EDLNLVNRLI IAVANNACKE QLTSGLLKLD YSFSGTNIEP QTPLDWWGKS LAVLNLDFFQ RVISAVKSKG LIQDVISKIL ISYTNKSLQG LIVRDPKLEK ERVLDSEGKK KQRLIVETIV RLLPTQGRRS SVPMAFLSSL LKMVIATSSS ASTGSCRSDL ERRIGLQLDQ AILEDVLIPI NLNGTNNTMY DIDSILRIFS IFLNLDEDDE EEEHHHLQFR DETEMIYDFD SPGSPKQSSI LKVSKLMDNY LAEIAMDPNL TTSKFIALAE LLPDHARIIS DGLYRAVDIY LKVHPNIKDS ERYRLCKTID SQKLSQEACS HAAQNERLPV QMAVQVLYFE QIRLRNAMSS SIGPTQFLFN SNCHQFPQRS GSGAVTSSDV V // ID NC003070_142 HYPOTHETICAL; PRT; 102 AA. AC NC003070_142; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[698515...698207]; Length: 309. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 102 SINGLE EXON; p-value: NaN. SQ SEQUENCE 102 AA; 11069 MW; 25E6C5C5A19D5B09 CRC64; MEKISNLLED KPVVIFSKTS CCMSHSIKSL ISGYGANSTV YELDEMSNGP EIERALVELG CKPTVPAVFI GQELVGGANQ LMSLQVRNQL ASLLRRAGAI WI // ID NC003070_143 HYPOTHETICAL; PRT; 333 AA. AC NC003070_143; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[699205...699216, 701400...701560, DE 701791...701875, 701976...702019, 702106...702183, 702276...702468, DE 702567...702597, 702679...702737, 702828...702901, 703012...703098, DE 703195...703256, 703353...703381, 703482...703568]; Length: 1002. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 4 FIRST EXON; p-value: NaN. FT GENSCAN 5 57 INTERNAL EXON; p-value: NaN. FT GENSCAN 58 58 AA on splice site: tt/g -> L. FT GENSCAN 59 86 INTERNAL EXON; p-value: NaN. FT GENSCAN 87 100 INTERNAL EXON; p-value: NaN. FT GENSCAN 101 101 AA on splice site: ag/t -> S. FT GENSCAN 102 126 INTERNAL EXON; p-value: NaN. FT GENSCAN 127 127 AA on splice site: aa/a -> K. FT GENSCAN 128 191 INTERNAL EXON; p-value: NaN. FT GENSCAN 192 201 INTERNAL EXON; p-value: NaN. FT GENSCAN 202 202 AA on splice site: g/ct -> A. FT GENSCAN 203 221 INTERNAL EXON; p-value: NaN. FT GENSCAN 222 245 INTERNAL EXON; p-value: NaN. FT GENSCAN 246 246 AA on splice site: ca/g -> Q. FT GENSCAN 247 274 INTERNAL EXON; p-value: NaN. FT GENSCAN 275 275 AA on splice site: tg/g -> W. FT GENSCAN 276 295 INTERNAL EXON; p-value: NaN. FT GENSCAN 296 296 AA on splice site: g/gt -> G. FT GENSCAN 297 305 INTERNAL EXON; p-value: NaN. FT GENSCAN 306 333 LAST EXON; p-value: NaN. SQ SEQUENCE 333 AA; 37574 MW; CEAF8B69423A06CC CRC64; MLPFVKNSTP RSGSQRPFKA KEHDTCLSSL SGDAKLFFLD RENTAPLRLN RTFESELLGF KVHLWDQSLV PLHFSIRKRK NTPRYLISCS QKKDVTVVDG SCMDEIYDKL AERLVPTAAA MFSPNLKRLV GLAGPPGAGK STVANEVVRR VNKLWPQKAA SFDAEVNPPD VAIVLPMDGF HLYRSQLDAM EDPKEAHARR GAPWTFDPAL LLNCLKKLKN EGSVYVPSFD HGVGDPVEDD IFVSLQHKVV IVEGNYILLE EGSWKDISDM FDEKWFIDVN LDTAMQRVEN RHISTGKPPD VAKWRVDYND RPNAELIIKS KTNADLLIRS MNI // ID NC003070_144 HYPOTHETICAL; PRT; 302 AA. AC NC003070_144; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[706457...706101, 705656...705534, DE 705367...705302, 705129...705064, 704668...704588, 704494...704279]; DE Length: 909. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 119 FIRST EXON; p-value: NaN. FT GENSCAN 120 160 INTERNAL EXON; p-value: NaN. FT GENSCAN 161 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 204 INTERNAL EXON; p-value: NaN. FT GENSCAN 205 231 INTERNAL EXON; p-value: NaN. FT GENSCAN 232 302 LAST EXON; p-value: NaN. SQ SEQUENCE 302 AA; 32705 MW; 2D63EACD453AF945 CRC64; MANNNNIPHD SISDPSPTDD FFEQILGLSN FSGSSGSGLS GIGGVGPPPM MLQLGSGNEG NHNHMGAIGG GGPVGFHNQM FPLGLSLDQG KGHGFLKPDE TGKRFQDDVL DNRCSSMKPI FHGQPMSQPA PPMPHQQSTI RPRVRARRGQ ATDPHSIAER LRRERIAERI RSLQELVPTV NKTDRAAMID EIVDYVKFLR LQVKVLSMSR LGGAGAVAPL VTEMPLSSSV EDETQAVWEK WSNDGTERQV AKLMEENVGA AMQLLQSKAL CIMPISLAMA IYHSQPPDTS SSIVKPEMNP PP // ID NC003070_145 HYPOTHETICAL; PRT; 599 AA. AC NC003070_145; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[707726...708189, 708304...708509, DE 708601...709467, 709598...709860]; Length: 1800. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 154 FIRST EXON; p-value: NaN. FT GENSCAN 155 155 AA on splice site: ag/g -> R. FT GENSCAN 156 223 INTERNAL EXON; p-value: NaN. FT GENSCAN 224 224 AA on splice site: g/gt -> G. FT GENSCAN 225 512 INTERNAL EXON; p-value: NaN. FT GENSCAN 513 513 AA on splice site: g/ca -> A. FT GENSCAN 514 599 LAST EXON; p-value: NaN. SQ SEQUENCE 599 AA; 66972 MW; B4190E9B926BE1B2 CRC64; MGSSKFKRAI GAVKDQTSVG LAKVNGRSAS LSELDVAIVK ATRHEEFPAE EKYIREILSL TSYSRSYINA CVSTLSRRLN KTKCWTVALK TLILIQRLLG EGDQAYEQEI FFATRRGTRL LNMSDFRDVS RSNSWDYSAF VRTYALYLDE RLDFRMQARH GKRGVYCVGG EADEEEQDQA AADLSTAIVV RSQPIAEMKT EQIFIRIQHL QQLLDRFLAC RPTGNARNNR VVIVALYPIV KESFQIYYDV TEIMGILIER FMELDIPDSI KVYDIFCRVS KQFEELDQFY SWCKNMGIAR SSEYPEIEKI TQKKLDLMDE FIRDKSALEH TKQSKSVKSE ADEDDDEART EEVNEEQEDM NAIKALPEPP PKEEDDVKPE EEAKEEVIIE KKQEEMGDLL DLGNTNGGEA GQAGDSLALA LFDGPYASGS GSESGPGWEA FKDDSADWET ALVQTATNLS GQKSELGGGF DMLLLNGMYQ HGAVNAAVKT STAYGASGSA SSMAFGSAGR PAATMLALPA PSTANGNAGN INSPVPMDPF AASLEVAPPA YVQMNDMEKK QRMLMEEQMM WDQYSRDGRQ GHMNLRQNQN QPYSYTPQY // ID NC003070_146 HYPOTHETICAL; PRT; 3580 AA. AC NC003070_146; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[726891...726710, 726347...726275, DE 726122...726037, 725824...725689, 725608...725552, 725409...725236, DE 725081...724890, 724746...724681, 724389...724275, 724119...722966, DE 722815...722116, 722007...721905, 721795...721609, 721545...720681, DE 720585...718935, 718662...715660, 715315...715007, 714864...714558, DE 714463...714317, 714206...712971]; Length: 10743. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 60 FIRST EXON; p-value: NaN. FT GENSCAN 61 61 AA on splice site: ag/c -> S. FT GENSCAN 62 85 INTERNAL EXON; p-value: NaN. FT GENSCAN 86 113 INTERNAL EXON; p-value: NaN. FT GENSCAN 114 114 AA on splice site: at/g -> M. FT GENSCAN 115 159 INTERNAL EXON; p-value: NaN. FT GENSCAN 160 178 INTERNAL EXON; p-value: NaN. FT GENSCAN 179 236 INTERNAL EXON; p-value: NaN. FT GENSCAN 237 300 INTERNAL EXON; p-value: NaN. FT GENSCAN 301 322 INTERNAL EXON; p-value: NaN. FT GENSCAN 323 360 INTERNAL EXON; p-value: NaN. FT GENSCAN 361 361 AA on splice site: g/ag -> E. FT GENSCAN 362 745 INTERNAL EXON; p-value: NaN. FT GENSCAN 746 978 INTERNAL EXON; p-value: NaN. FT GENSCAN 979 979 AA on splice site: g/gt -> G. FT GENSCAN 980 1012 INTERNAL EXON; p-value: NaN. FT GENSCAN 1013 1013 AA on splice site: ag/a -> R. FT GENSCAN 1014 1075 INTERNAL EXON; p-value: NaN. FT GENSCAN 1076 1363 INTERNAL EXON; p-value: NaN. FT GENSCAN 1364 1364 AA on splice site: g/ga -> G. FT GENSCAN 1365 1913 INTERNAL EXON; p-value: NaN. FT GENSCAN 1914 1914 AA on splice site: ag/g -> R. FT GENSCAN 1915 2914 INTERNAL EXON; p-value: NaN. FT GENSCAN 2915 2915 AA on splice site: at/g -> M. FT GENSCAN 2916 3017 INTERNAL EXON; p-value: NaN. FT GENSCAN 3018 3018 AA on splice site: cg/a -> R. FT GENSCAN 3019 3120 INTERNAL EXON; p-value: NaN. FT GENSCAN 3121 3169 INTERNAL EXON; p-value: NaN. FT GENSCAN 3170 3580 LAST EXON; p-value: NaN. SQ SEQUENCE 3580 AA; 398529 MW; B41E9D09D110512C CRC64; MKWATLLKDI KEKVGLAQSS DSDPFPVDLT APPSSSSSSS SPSFTYPSSS SLHHFNFSPS SRDNHELELD FKRLWEEFRS SSSEKEKEAA LNLTVDIFCR LVKRHANVDQ LVTMLVETHI FSFVIGRAFV TDIEKLKIGS KTRSLNVEKV LRFFSDVTKE GFSPGANLLT AVEVLVSGPI DKQSLLDSGI FCCLIHVLIA LLAYDELSKS KITGDLEVVS AEKDAGYIVL QTRRLEVEGS VVHIMKALAS NPSAAQSLIE DDSLESLFNM VANGSITVFS QYKEGLVPLH NIQLHRHAMQ ILGLLLVNDN GSTARYIRKH HLIKVLLMAV KEFDPSCGDS AYTMGIVDLL LECVELSYRP EAGGVRLRED IRNAHGYHFL VQFALVLSSL PKNPIFVSSN HDSGSDDPEV FHDGENTNST ENADFSSQNF APSLSRLLDV LVTLAQTGPA EPSVGRASRS SQTKPTGHSR SRTSSVDSIY DETWEQGSGK VKDLEAVQML QDIFLKAENK DLQAEVLNRM FKIFSSHVEN YRLCQELRTV PLLVLNMAGF PSSLQDIILK ILEYAVTVVN CVPEQELLSL CCLLQQPITS QLKHTILSFF VKLISFDQQY KKVLREVGVL EVLQDDLKQH KLLIGPDQYS GVSSHSDRKP SSGSFRKNLD TKDAIISSPK LMESGSGKLP VFEVDNTITV GWDCLISLLK KAEANQSSFR AANGVAIILP FLISDAHRSG VLRILSCLIT EDTKQVHHDE LGAVVDLLKS GMVTGISGHQ YKLHDDAKCD TMGALWRIVG VNGSAQRVFG EATGFSLLLT TLHTFQGKRE HMDESDLTVY IKLFKYLFRL MTAAVCENAV NRMKLHAVIT SQTFFELLAE SGLLCVELER QVIQLLLELA LEVVVPPFLT SESTALATIP ENENTTFVVT TPSGQFNPDK ERIYNAGAVR VLIRSLLLFS PKMQLEFLRL LESLARASPF NQENLTSIGC VELLLEIIYP FLAGSSPFLS YALKIVEILG AYRLSPSELR MLFRYVLQMR IMNSGHAIVG MMEKLILMED TALEHLSLAP FVELDMSKTG HASVQFRNFL TTQGKESEAS KAGGSSKTRM TSAQQHEQNI FRMFSVGAVS NESPFYAELY FQEDGILTLA TSNSHSLSFS GLEIEEGRWH HLAVVHSKPN ALAGLFQASV AYVYLDGKLR HTGKLGYSPS PVGKSLQVTV GTPATCARVS DLTWKTRSCY LFEEVLTSGC IGFMYILGRG YKGLFQDADL LRFVPNQACG GGSMAILDSL DTDMTSSSNG QKFDGSNRQG DSKADGSGIV WDLERLGNLA FQLPGKKLIF AFDGTCSEFI RASGNFSLLN LVDPLSAAAS PIGGIPRFGR LVGNVSICRQ SVIGDTIRPV GGMTVVLALV EAAESRNMLH MALSLLACAL HQNPQNVKDM QTIRGYHLLA LFLRPKMTLF DMQSLEIFFQ IAACEALFSE PKKLESVQSN ITMPPTETIF ENSYEDLSLS RFRYDSSSVG SHGDMDDFSV PKDSFSHLSE LETDIPVETS NCIVLSNADM VEHVLLDWTL WVTSPVSIQI ALLGFLENLV SMHWYRNHNL TILRRINLVE HLLVTLQRGD VEVPVLEKLV VLLGCILEDG FLTSELENVV RFVIMTFNPP EVKSRSSLLR ESMGKHVIVR NMLLEMLIDL QVTIKAEDLL ELWHKIVSSK LITYFLDEAV HPTSMRWIMT LLGVCLASSP NFSLKFRTSG GYQGLLRVLQ NFYDSPDIYY ILFCLIFGKP VYPRLPEVRM LDFHALVPND GSYVELKFIE LLDSVVAMAK STYDRLIMQS MLAHQSGNLS QVSASLVAEL IEGAEMTGEL QGEALMHKTY AARLMGGEAS APAAATSVLR FMVDLAKMCP QFSTACRRAE FVENCADLYF SCVRAAYAVK MAKQLSVKAE EKHINDADDS GSQGSLPHDQ DQSTKTSISV GSFPQGQVSL GSEDMSLPAN YVVNDKMENI LPPPTQDTSK SLQGVEDVKK QDDHHVGPSA SSERDFQDFT GNPVQVQATD SQSSASFPMI ESPLLSEKSS LKVSFTPSPS PVVALASWLG SNYNESKSST LGSPSLESYV SVNEVDASSE RKSGSQGSSA ANAFFTVSPK LLLETDETGY GGGPCSAGAS AVLDFMAEAL ADLVTEQIKA VPVLESILEM VPFYVDPESV LVFQGLCLSR VMNYLERRLL RDDEEDEKKL DKAKWSVNLD AFCWMIVDRV YMGAFSQPAG VLRALEFLLS MLQLANKDGR VEEVTPSGKG LLSLGRATRQ LDAYVHSILK NTNRMVLYCF LPSFLITIGE EDLLSQLGLL VESKKRPSPN PATDESGIDI STVLQLLVAN RRIIFCPSNL DTDLNCCLCV NLISLLLDQR KSVQNMSLDI VKYLLVHRRS ALEDLLVTKP NQGQNFDVLH GGFDKLLTGN LPEFFKWLES SDKIINKVLE QCAAIMWVQY IAGSAKFPGV RIKGMEGRRK REMGRKSRDM SKLDLKHWDQ LNERRYALEV LRDAMSTELR VVRQNKYGWI LHAESEWQTH LQQLVHERGI FPMRKSKGTE DPEWQLCPIE GPYRMRKKLE RCKLKIDSIQ NVLDGKLELG EIELPKVKNE DGPVISDTDS EPPFLLSELY DESFLKESDD FKDVASARNG WNDDRASSTN EASLHSALDF GGKSSIASVP ITDTTHVKSE TGSPRHSSSA KMDETNGREE KSEKELNDDG EYLIRPYLEH LEKIRFRYNC ERVVDLDKHD GIFLIGEFCL YVIENFYIDE DGCICEKECE DELSVIDQAL GVKKDVSGSS DFHSKSSTSW TTTVKTGAVG GRAWAYGGGA WGKEKMCMTG NLPHPWRMWK LNNVHEILKR DYQLRPVAIE IFSMDGCNDL LVFHKKEREE VFKNLVAMNL PRNSMLDTTI SGSAKQESNE GGRLFKLMAK SFSKRWQNGE ISNFQYLMHL NTLAGRGYSD LTQYPVFPWV LADYDSESLD FSDPKTFRKL HKPMGCQTPE GEEEFRKRYE SWDDPEVPKF HYGSHYSSAG IVLFYLIRLP PFSSENQKLQ GGQFDHADRL FNSIKDTWLS AAGKGNTSDV KELIPEFFYM PEFLENRFSL DLGEKQSGEK VGDVFLPPWA RGSVREFILK HREALESDYV SENLHHWIDL IFGYKQRGKA AEEAVNVFYH YTYEGNVDID AVTDPAMKAS ILAQINHFGQ TPKQLFPKAH VKRRTDRKIP LHPLKHSMHL VPHEIRKCSS SISQIITFHD KVLVAGANCF LKPRGYTKYI TWGFPDRSLR FMSYDQDKLL STHENLHESN QIQCAGVSHD GRIVVTGAED GLVCVWRVSK DGPRGSRRLR LEKALCAHTA KVTCLRVSQP YMMIASGSDD CTVIIWDLSS LSFVRQLPDF PVPISAIYIN DLTGEIVTAA GTVLAVWSIN GDCLAVANTS QLPSDSVLSV TGSTSSDWLE TSWYVTGHQS GAVKVWRMIH CTDPVSAESK TSSSNRTGGL NLGDQVPEYK LILHKVLKFH KQPVTALHLT SDLKQLLSGD SAGQLLSWTV PDETLRASMK QASLKQASLK QASLKQASSV // ID NC003070_147 HYPOTHETICAL; PRT; 301 AA. AC NC003070_147; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[729446...729475, 730118...730415, DE 730497...730610, 730916...731379]; Length: 906. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 10 FIRST EXON; p-value: NaN. FT GENSCAN 11 109 INTERNAL EXON; p-value: NaN. FT GENSCAN 110 110 AA on splice site: g/ta -> V. FT GENSCAN 111 147 INTERNAL EXON; p-value: NaN. FT GENSCAN 148 148 AA on splice site: g/gg -> G. FT GENSCAN 149 301 LAST EXON; p-value: NaN. SQ SEQUENCE 301 AA; 34351 MW; 4555DEE3CD9DEBA5 CRC64; MEVYIIVVVR NPLLRFTVFR MYKWNLPYRK DDVETGREGG ERSLYPTMLE SPELRWGFIR KVYSIIAFQL LATIAVASTV VFVRPIAVFF ATTSAGLALW IVLIITPLIV MCPLYYYHQK HPVNYLLLGI FTVALAFAVG LTCAFTSGKV ILEAAILTTV VVLSLTVYTF WAAKKGYDFN FLGPFLFGAL IVLMVFALIQ VPVPYNCLFW FNIRNSGLKL SLIVVLWVLF VFLQIFFPLG RISVMIYGCL AAIIFCGYIV YDTDNLIKRY SYDEYIWAAV SLYLDIINLF LALLTIFRAA E // ID NC003070_148 HYPOTHETICAL; PRT; 1744 AA. AC NC003070_148; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[737332...737233, 737081...732928, DE 732774...731794]; Length: 5235. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 33 FIRST EXON; p-value: NaN. FT GENSCAN 34 34 AA on splice site: g/at -> D. FT GENSCAN 35 1418 INTERNAL EXON; p-value: NaN. FT GENSCAN 1419 1744 LAST EXON; p-value: NaN. SQ SEQUENCE 1744 AA; 199778 MW; 268A543613BD714D CRC64; MTAVVNGNSK RYSWWWDSHI SPKNSKWLQE NLTDMDSKVK QMIKVIEEDA DSFARRAEMY YKKRPELMKL VEEFYRAYRA LAERYDHATG VIRHAQQTMA EAFPNQDPMM FGEESPLGSS TDGFDPQTPD SYPPIRAPVY PDDLRKGAFG ISSSHLSTVK RNIAFMEDPQ SVSSGKGFKT AKARKGLNFN NVDGKEINAK VLSESERASK AEAEIVALKD ALSKVQAEKE ASLAQFDQNL EKLSNLESEV SRAQEDSRVL IERATRAEAE VETLRESLSK VEVEKESSLL QYQQCLQNIA DLEDRISLAQ KEAGEVDERA NRAEAETLAL KQSLVSSETD KEAALVQYQQ CLKTISNLEE RLHKAEEDSR LTNQRAENAE GEVESLKQKV SKLIEENEAY ELQYQQCLDT IADLKLKLFH AQEETQRLSR EIEDGVAKLK FAEEKCVVLE RSNQNLHSEL DGLLEKLGNQ SHELTEKQKE LGRLWTCVQE ENLRFMEAET AFQTLQQLHS QSQEELSTLA LELQNRSQIL KDMEARNNGL QEEVQEAKDQ SKSLNELNLS SAASIKSLQE EVSKLRETIQ KLEAEVELRV DQRNALQQEI YCLKEELSQI GKKHQSMVEQ VELVGLHPES FGSSVKELQE ENSKLKEIRE RESIEKTALI EKLEMMEKLV QKNLLLENSI SDLNAELETI RGKLKTLEEA SMSLAEEKSG LHSEKDMLIS RLQSATENSK KLSEENMVLE NSLFNANVEL EELKSKLKSL EESCHLLNDD KTTLTSERES LLSHIDTMRK RIEDLEKEHA ELKVKVLELA TERESSLQKI EELGVSLNAK DCEYASFVQF SESRMNGMES TIHHLQDENQ CRVREYQVEL DRAHDAHIEI IVLQKCLQDW LEKSSSLIAE NQDIKEASKL LEKLVSELEE ENIGKQVQID SSINCIKILR TGIYQVLMKL EIIPGIGSGD ENSRDQRNMH DILNRLEDMQ TMLLSIRDEN QHSAIENLVL IEFLRQLKSE AVGIETEKKI LEEELESQCQ QLSFSRDETQ KLIFVNGELT TKVNQGVNRE KVLMVEIEDF HRQVLQLRDD YTILQGDNNK TLDEKAYLTK STLQLEEEKC KLEDDISLLL SETIYQSNLI ILLEDVILEK LSGAMKLNED LDRLSIVKCK LEEEVRELGD KLKSADIANF QLQVVLEKSN AELLSARSAN VHLEHEIANV KVQKEKELLE AMLMISIMQN EKSELSKAVE GLECRYKEAK AIEEDRDKQV LRLRGDYDEQ VKKNSHSNEA NLKLEADLMN LLMELEEIKV EKENLNQELF TERNEIELWE SQSATLFGEL QISAVHETLL EGLTNELVEA CKNLESRSTL KDREIEQLKG RVNNLEDANK GQNDLMCKYA QAIFLLKESI QSLEKHAMLH EFENGPATTN QSFVGISYQE TASLVDNSDG FLEIQELHLR IKAIEEAITK KLAMEELKTS SARRSRRRNG SLRKQNHEIY SEETEMITKD IVLDQVSDCS SYGISTRDIL KIEDDHSLEA KSQNPPKGKS LSEESLVVDK LEISDRFTDP NKDANKRKVL ERLNSDLQKL SNLHVAVEDL KIKVETEEKD EKGKENEYET IKGQINEAEE ALEKLLSINR KLVTKVQNGF ERSDGSKSSM DLDENESSRR RRISEQARRG SEKIGRLQLE IQRLQFLLLK LEGDREDRAK AKISDSKTRI LLRDYIYSGV RGERRKRIKK RFAFCGCVQP PPSP // ID NC003070_149 HYPOTHETICAL; PRT; 712 AA. AC NC003070_149; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[739456...739513, 739608...740050, DE 740124...740245, 740355...740502, 740620...740760, 740970...741065, DE 741684...741795, 741946...742050, 742181...742317, 742502...742736, DE 743010...743203, 743301...743546, 743620...743716, 743815...743819]; DE Length: 2139. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: NaN. FT GENSCAN 20 20 AA on splice site: g/tt -> V. FT GENSCAN 21 167 INTERNAL EXON; p-value: NaN. FT GENSCAN 168 207 INTERNAL EXON; p-value: NaN. FT GENSCAN 208 208 AA on splice site: ag/t -> S. FT GENSCAN 209 257 INTERNAL EXON; p-value: NaN. FT GENSCAN 258 304 INTERNAL EXON; p-value: NaN. FT GENSCAN 305 336 INTERNAL EXON; p-value: NaN. FT GENSCAN 337 373 INTERNAL EXON; p-value: NaN. FT GENSCAN 374 374 AA on splice site: g/gt -> G. FT GENSCAN 375 408 INTERNAL EXON; p-value: NaN. FT GENSCAN 409 409 AA on splice site: g/tt -> V. FT GENSCAN 410 454 INTERNAL EXON; p-value: NaN. FT GENSCAN 455 532 INTERNAL EXON; p-value: NaN. FT GENSCAN 533 533 AA on splice site: g/at -> D. FT GENSCAN 534 597 INTERNAL EXON; p-value: NaN. FT GENSCAN 598 679 INTERNAL EXON; p-value: NaN. FT GENSCAN 680 711 INTERNAL EXON; p-value: NaN. FT GENSCAN 712 712 AA on splice site: g/gg -> G. FT GENSCAN 713 712 LAST EXON; p-value: NaN. SQ SEQUENCE 712 AA; 77996 MW; 6DD15F7EFE01115A CRC64; MLRADRPINK TIKPILVKSV FTKLTNIPSL FGFIYNAVAE SHLFSSLGKF ENRDQMSMMT VWALRRNVRR KNHSMLVRYI SGSASMKPKE QCIEKILVAN RGEIACRIMR TAKRLGIQTV AVYSDADRDS LHVKSADEAV RIGPPSARLS YLSGVTIMEA AARTGAQAIH PGYGFLSESS DFAQLCEDSG LTFIGPPASA IRDMGDKSAS KRIMGAAGVP LVPGYHGHEQ DIDHMKSEAE KIGYPIIIKP THGGGGKGMR IVQSGKDFAD SFLGAQREAA ASFGVNTILL EKYITRPRHI EVQVIFGDKH GNVLHLYERD CSVQRRHQKI IEEAPAVEHP VTEMIVGQDL VEWQIRVANG EPLPLSQSEV PMSGHAFEAR IYAENVPKGF LPATGVLNHY RPVAVSPSVR VETGVEQGDT VSMHYDPMIA KLVVWGGNRG EALVKLKDCL SNFQVAGVPT NINFLQKLAS HKEFAVGNVE THFIEHHKSD LFADESNPAA TEVAYKAVKH SAALVAACIS TIEHSTWNES NHDSLLDSYT SFMSLIIGIL MVCQHSTRQE GNDSPSLELR VTRAGKCDFR VEAAGLSMNV SLAAYLKDGY KHIHIWHGSE HHQFKQKVGI EFSEDEEGVQ HRTSSETSSH PPGTIVAPMA GLVVKVLVEN EAKVDQGQPI LVLEAMKMEH VVKAPSSGSI QDLKVKAGQQ VSDGSALFRI KG // ID NC003070_150 HYPOTHETICAL; PRT; 806 AA. AC NC003070_150; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[747723...747637, 746359...744026]; Length: DE 2421. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 29 FIRST EXON; p-value: NaN. FT GENSCAN 30 806 LAST EXON; p-value: NaN. SQ SEQUENCE 806 AA; 90889 MW; CA4C52B64F08679F CRC64; MVYSPETWYL TVKEHFEVNL QEYSPATWLC RLSSLFRGVL IHAIEVTGTQ SQASLQHGLA GKAVLSNAKY LSVLTRFRTS TRFDVWSIHR REAISSISGS ILLQARDPAK LNEEIQIAVD EHRCDEAWRL FEQHMQMEGF PRKSVVNNVV VCFAESLDSN WLQKGYSLVE QAYEEGKQNL LEKEPLLYLS LALAKSGMAV PASTILRKLV ETEEYPHVSA WSAVLAHMSL AGSGSYLSAE LVLEIGYLFH NNRVDPRKKS NAPLLAMKPN TQVLNVALAG CLLFGTTRKA EQLLDMIPKI GVKADANLLV IMAHIYERNG RREELRKLQR HIDEACNLNE SQFWQFYNCL LMCHLKFGDL ESASKMVLEM LRRGKVARNS LGAAILEFDT ADDGRLYTKR VSGKGSEVKE HDNPETRVVS IHSMIPYDEF SRDRKFLKLE AEAKDVLGAL LAKLHVQVEL ITSERGVLQP TEEIYVKLAK AFLESGKMKE LAKFLLKAEH EDSPVSSDNS MLINVINACI SLGMLDQAHD LLDEMRMAGV RTGSSVYSSL LKAYCNTNQT REVTSLLRDA QKAGIQLDSS CYEALIQSQV IQNDTHGALN VFKEMKEAKI LRGGNQKFEK LLKGCEGNAE AGLMSKLLRE IREVQSLDAG VHDWNNVIHF FSKKGLMQDA EKALKRMRSL GHSPNAQTFH SMVTGYAAIG SKYTEVTELW GEMKSIAAAT SSMKFDQELL DAVLYTFVRG GFFSRANEVV EMMEKKNMFV DKYKYRMLFL KYHKTAYKGK APKVQSESQL KKREAGLVFK KWLGLS // ID NC003070_151 HYPOTHETICAL; PRT; 324 AA. AC NC003070_151; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[749359...749495, 749785...749949, DE 750118...750298, 750416...750472, 750557...750627, 750796...750915, DE 751553...751796]; Length: 975. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 45 FIRST EXON; p-value: NaN. FT GENSCAN 46 46 AA on splice site: at/a -> I. FT GENSCAN 47 100 INTERNAL EXON; p-value: NaN. FT GENSCAN 101 101 AA on splice site: gt/a -> V. FT GENSCAN 102 161 INTERNAL EXON; p-value: NaN. FT GENSCAN 162 180 INTERNAL EXON; p-value: NaN. FT GENSCAN 181 203 INTERNAL EXON; p-value: NaN. FT GENSCAN 204 204 AA on splice site: ga/g -> E. FT GENSCAN 205 243 INTERNAL EXON; p-value: NaN. FT GENSCAN 244 244 AA on splice site: ca/g -> Q. FT GENSCAN 245 324 LAST EXON; p-value: NaN. SQ SEQUENCE 324 AA; 35804 MW; 1E632A3823A0A6E2 CRC64; MQEESHIEEV EIQNKLEVAP ALISVHPSQK SVAVAVGSDL RIFDLIENCP VSLVDESDGP IRKESIRAIR YSTSGKLFVS AGDDKLVKIW SADSWRCLNT VCSEKRVSAV AISSDDSHVC YADKFGVVWV IELDGINDGK TLPSKKGALL LSHYCSIITS LEFSPDGRYI LSADRDFKIR VTVFPKKPLE GAHEIQSFCL GHSEFITCTA FVSTPELTQG YLMSGSGDST VSDLKVYISS SYDQVRVISC LETESSSILE DEQIPGGTKL LEQLQGKVTI EESVMSAAAE AVRAAMSSLL MKKQYSEEKR EFRKRTRNDK KTTR // ID NC003070_152 HYPOTHETICAL; PRT; 182 AA. AC NC003070_152; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[752271...752660, 752982...753140]; Length: DE 549. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 130 FIRST EXON; p-value: NaN. FT GENSCAN 131 182 LAST EXON; p-value: NaN. SQ SEQUENCE 182 AA; 18942 MW; 33F1827C33CBE4E7 CRC64; MAQHQHSPQR PRDQDNTRPH DQYGIVFSVS GDDVARKQGD SFSQPDPTVA TMGSVDTVTI GEALEATALS LGDKPVDRRD AAAIQAAETR ATGESKGRPG GLAVAAQAAA TTNEQTVSEE DKVNIADILT DAAERLPGDK VVTSEDAEAV VGAELRSSSE MKTTPGGVAD SMSAGARLNQ QL // ID NC003070_153 HYPOTHETICAL; PRT; 614 AA. AC NC003070_153; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[756223...755381, 754890...754533, DE 754171...753528]; Length: 1845. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 281 FIRST EXON; p-value: NaN. FT GENSCAN 282 400 INTERNAL EXON; p-value: NaN. FT GENSCAN 401 401 AA on splice site: g/ag -> E. FT GENSCAN 402 614 LAST EXON; p-value: NaN. SQ SEQUENCE 614 AA; 69034 MW; FA76E31041414FAF CRC64; MDLLREEILK KRKSLAEESG GKKFFKRSEI EQKKIQKLRE EERREHELKA QRRAAAAASG GDGKSSGSAP GSSNAATSAS SKSSASDAAA IADSKALTDE NLILPRQEVI RRLRFLKQPM TLFGEDDQSR LDRLKYVLKE GLFEVDSDMT EGQTNDFLRD IAELKKRQKS GMMGDRKRKS RDERGRDEGD RGETREDELS GGESSDVDAD KDMKRLKANF EDLCDEDKIL VFYKKLLIEW KQELDAMENT ERRTAKGKQM VATFKQCARY LVPLFNLCRK KGLPADIRQA LMVMVNHCIK RDYLAAMDHY IKLAIGNAPW PIGVTMVGIH ERSAREKIYT NSVAHIMNDE TTRKYLQSVK RLMTFCQRRY PTMPSKAVEF NSLANGSDLQ SLLAEERFFG EKDSEKTEET MATQAAGIFS PAITTTTSAV KKLHLFSSSH RPKSLSFTKT AIRAEKTESS SAAPAVKEAP VGFTPPQLDP NTPSPIFAGS TGGLLRKAQV EEFYVITWNS PKEQIFEMPT GGAAIMREGP NLLKLARKEQ CLALGTRLRS KYKITYQFYR VFPNGEVQYL HPKDGVYPEK ANPGREGVGL NMRSIGKNVS PIEVKFTGKQ SYDL // ID NC003070_154 HYPOTHETICAL; PRT; 1304 AA. AC NC003070_154; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[756651...756735, 757554...757686, DE 757937...758006, 761231...761810, 761886...762391, 762510...762578, DE 763013...763210, 763299...763430, 763980...764066, 764157...764597, DE 764699...764866, 765025...765255, 765485...765600, 765753...765985, DE 767167...767298, 769794...770527]; Length: 3915. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 28 FIRST EXON; p-value: NaN. FT GENSCAN 29 29 AA on splice site: a/tt -> I. FT GENSCAN 30 72 INTERNAL EXON; p-value: NaN. FT GENSCAN 73 73 AA on splice site: at/t -> I. FT GENSCAN 74 96 INTERNAL EXON; p-value: NaN. FT GENSCAN 97 289 INTERNAL EXON; p-value: NaN. FT GENSCAN 290 290 AA on splice site: g/gt -> G. FT GENSCAN 291 458 INTERNAL EXON; p-value: NaN. FT GENSCAN 459 481 INTERNAL EXON; p-value: NaN. FT GENSCAN 482 547 INTERNAL EXON; p-value: NaN. FT GENSCAN 548 591 INTERNAL EXON; p-value: NaN. FT GENSCAN 592 620 INTERNAL EXON; p-value: NaN. FT GENSCAN 621 767 INTERNAL EXON; p-value: NaN. FT GENSCAN 768 823 INTERNAL EXON; p-value: NaN. FT GENSCAN 824 900 INTERNAL EXON; p-value: NaN. FT GENSCAN 901 938 INTERNAL EXON; p-value: NaN. FT GENSCAN 939 939 AA on splice site: gg/g -> G. FT GENSCAN 940 1016 INTERNAL EXON; p-value: NaN. FT GENSCAN 1017 1017 AA on splice site: a/gc -> S. FT GENSCAN 1018 1060 INTERNAL EXON; p-value: NaN. FT GENSCAN 1061 1061 AA on splice site: a/aa -> K. FT GENSCAN 1062 1304 LAST EXON; p-value: NaN. SQ SEQUENCE 1304 AA; 145838 MW; 593BA63562376600 CRC64; MSFYMTYLAR WPDYFHVAEG PGNRVMGYIM GKVEGQGESW HGHVTAVTVS PEYRRQQLAK KLMNLLEDIS DKIDKAYFVD LFVRASNTPA IKMYEKLWDG PIGYLRDKKT NQKHRRRYLT LSLSLSMRTL ISHRQCVTSP FLISAASPPF PGRCFKLSSF TPPRHRRFSS LSIRNISHES ADQTSSSRPR TLYPGGYKRP ELAVPGLLLR LDADEVMSGN REETLDLVDR ALAKSVQIVV IDGGATAGKL YEAACLLKSL VKGRAYLLIA ERVDIASAVG ASGVALSDEG LPAIVARNTL MGSNPDSVLL PLVARIVKDV DSALIASSSE GADFLILGSG EEDTQVADSL LKSVKIPIYV TCRGNEEAKE ELQLLKSGVS GFVISLKDLR SSRDVALRQS LDGAYVVNNH ETQNMNELPE KKNSAGFIKL EDKQKLIVEM EKSVLRETIE IIHKAAPLME EVSLLIDAVS RIDEPFLMVI VGEFNSGKST VINALLGKRY LKEGVVPTTN EITFLCYSDL ESEEQQRCQT HPDGQYVCYL PAPILKDINI VDTPGTNVIL QRQQRLTEEF VPRADLLVFV LSADRPLTES EVAFLRYTQQ WKKKFVFILN KSDIYRDARE LEEAISFVKE NTRKLLNTEN VILYPVSARS ALEAKLSTAS LVGRDDLEIA DPGSNWRVQS FNELEKFLYS FLDSSTATGM ERIRLKLETP MAIAERLLSS VEALVRQDCL AAREDLASAD KIISRTKEYA LKMEYESISW RRQALSLIDN ARLQVVDLIG TTLRLSSLDL AISYVFKGEK SASVAATSKV QGEILAPALT NAKELLGKYA EWLQSNTARE GSLSLKSFEN KWPTYVNSKT QLGIDTYDLL QKTDKVSLKT IQNLSAGTTS KRLEQDIREV FFVTVGGLGA AGLSASLLTS VLPTTLEDLL ALGLCSAGGY VAIANFPYRR QAIIGKVNKV ADALAQQLED AMQKDLSDAT SNLVNFVNIV AKPYREEAQL RLDRLLGIQK ELSDIRSKSK TKKGENTRKH SDWRMRNSDG QGSLEREKIR YRFLLIALLT KHRKMSLVVC QPQINESFYQ KDDMGGLSFL QSMSDITSIA QTKEDKAYVH PMEKRSVSKL NEKSLEMCTE SLGTETGSES GDELSLLAFE ATTTPRAPPR QLKPQEDTNL PDKTPPMSRN NSFPPPIKFV EDSKYNRMVR WLGEDGRIVV QAIRVSSPPS CFVSERGEGR LRLILTSESS LLSHNHEEEE EEETEEGIDE ETSENLEGKS GNKKFSRFSR RCKENGREPK PMLTWKQQQF WVAT // ID NC003070_155 HYPOTHETICAL; PRT; 958 AA. AC NC003070_155; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[773543...773691, 774096...774153, DE 774231...774539, 774652...774700, 775808...776355, 776415...776497, DE 776571...776762, 777116...777237, 777336...777739, 777872...778065, DE 778153...778333, 778486...778610, 778807...778896, 779002...779080, DE 779174...779250, 779538...779646, 779744...779840, 780513...780523]; DE Length: 2877. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 49 FIRST EXON; p-value: NaN. FT GENSCAN 50 50 AA on splice site: tc/g -> S. FT GENSCAN 51 69 INTERNAL EXON; p-value: NaN. FT GENSCAN 70 172 INTERNAL EXON; p-value: NaN. FT GENSCAN 173 188 INTERNAL EXON; p-value: NaN. FT GENSCAN 189 189 AA on splice site: g/gt -> G. FT GENSCAN 190 371 INTERNAL EXON; p-value: NaN. FT GENSCAN 372 398 INTERNAL EXON; p-value: NaN. FT GENSCAN 399 399 AA on splice site: tg/c -> C. FT GENSCAN 400 462 INTERNAL EXON; p-value: NaN. FT GENSCAN 463 463 AA on splice site: ag/t -> S. FT GENSCAN 464 503 INTERNAL EXON; p-value: NaN. FT GENSCAN 504 504 AA on splice site: g/ga -> G. FT GENSCAN 505 638 INTERNAL EXON; p-value: NaN. FT GENSCAN 639 702 INTERNAL EXON; p-value: NaN. FT GENSCAN 703 703 AA on splice site: ag/t -> S. FT GENSCAN 704 763 INTERNAL EXON; p-value: NaN. FT GENSCAN 764 804 INTERNAL EXON; p-value: NaN. FT GENSCAN 805 805 AA on splice site: ag/g -> R. FT GENSCAN 806 834 INTERNAL EXON; p-value: NaN. FT GENSCAN 835 835 AA on splice site: aa/g -> K. FT GENSCAN 836 861 INTERNAL EXON; p-value: NaN. FT GENSCAN 862 886 INTERNAL EXON; p-value: NaN. FT GENSCAN 887 887 AA on splice site: ag/a -> R. FT GENSCAN 888 923 INTERNAL EXON; p-value: NaN. FT GENSCAN 924 955 INTERNAL EXON; p-value: NaN. FT GENSCAN 956 956 AA on splice site: g/gt -> G. FT GENSCAN 957 958 LAST EXON; p-value: NaN. SQ SEQUENCE 958 AA; 108359 MW; EB1BD80296DCB33C CRC64; MEMAEGEGTT EENYDVDIAT TASSLGGSGV FHIINDIVGF VLYMHQQIPS VLQDMSLEFE GLQTEFMDLE TNLAEPQVKP LVRRKLMSRK REVKNEIKKL EKLMKTISSL RSALQLMIRE APGIQKVVLI LGGSPLRPQN AYELLFTQRR DHVLGYEGDF AKSKAAEALS KKTIRALIST GAGSTSYPGR RKKMIFKIED VTVYFPYDNI YPEQYEYMVE LKRALDAKGH CLLEMPTGTG KTIALLSLIT SYRLSRPDSP IKLVYCTRTV HEMEKTLGEL KLLHDYQVRH LGTQAKILAL GLSSRKNLCV NTKVLAAENR DSVDAACRKR TASWVRALST ENPNVELCDF FENYEKAAEN ALLPPGVYTL ELECLFRGYL HILPDGICVF RFLGSKGVCY QYLLDPKVAG FISKELQKES VVVFDEAHNI DNVCIEALSV SVRRVTLEGA NRNLNKIRQE IDSSYYIIEI VQLVGRFKAT DAGRLRAEYN RLVEGLALRG DLSGGDQWLA NPALPHDILK EAVPGNIRRA EHFVHVLRRL LQYLGVRLDT ENVEKESPVS FVSSLNSQAG IEQKTLKFCY DRLQSLMLTL EITDTDEFLP IQTVCDFATL VGTYARGFSI IIEPYDERMP HIPDPILQLS CHDASLAIKP VFDRFQSVVI TSGTLSPIDL YPRLLNFTPV VSRSFKMSMT RDCICPMVLT RGSDQLPVST KFDMRSDPGV VRNYGKLLVE MVSIVPDGVV CFFVSYSYMD GIIATWNETG ILKEIMQQKL VFIETQDVVE TTLALDNYRR ACDCGRGAVF FSVARGKVAE GIDFDRHYGR LVVMYGVPFQ YTLSKILRAR LEYLHDTFQI KEGDFLTFDA LRQAAQCVGR VIRSKADYGM MIFADKRYSR HDKRSKLPGW ILSHLRDAHL NLSTDMAIHI AREFLRKMAQ PYDKAGTMGR KTLLTQEDLE KMAETGFH // ID NC003070_156 HYPOTHETICAL; PRT; 268 AA. AC NC003070_156; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[782948...783175, 783318...783596, DE 783681...783842, 783943...784080]; Length: 807. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 76 FIRST EXON; p-value: NaN. FT GENSCAN 77 169 INTERNAL EXON; p-value: NaN. FT GENSCAN 170 223 INTERNAL EXON; p-value: NaN. FT GENSCAN 224 268 LAST EXON; p-value: NaN. SQ SEQUENCE 268 AA; 29860 MW; 32C4C669FAD9EE08 CRC64; MAEKKRVKYF IVDAFAESAF KGNPAAVCFL EDDNERDDAW LQSLATEFNI SETCFLTPII GDLPRFRLRW FTPVAEVDLC GHATLASAHV LFSTGLVGSE TVEFDTLSGI LTAKHLKNDD DGEESSIELD FPVVPTYEVN YIDDDLSLFS KALNGATILD VKATKKDLLV VLSSWEAVID LKPRLDEISK CPCEGMMVTA AASDGSTYDF CSRYFAPRFG INEDPVTGSA HCALAHYWSL RMNKCDFFAY QVLSLFLSTQ TLKRQILK // ID NC003070_157 HYPOTHETICAL; PRT; 91 AA. AC NC003070_157; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[784447...784722]; Length: 276. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 91 SINGLE EXON; p-value: NaN. SQ SEQUENCE 91 AA; 10600 MW; EBCB45170E6D1CA4 CRC64; MANATHKASC ELHIRKVFED VDPFKHTQNA SKEVENYELD HIDFISSYED SVVSHIHERC NIRGYHKDVY HNVVDCGFAL LAFTRQVNHI D // ID NC003070_158 HYPOTHETICAL; PRT; 433 AA. AC NC003070_158; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[787143...788444]; Length: 1302. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 433 SINGLE EXON; p-value: NaN. SQ SEQUENCE 433 AA; 45718 MW; 7214FC4B8BA72962 CRC64; MAPSPIIFSV LLLFIFSLSS SAQTPFRPKA LLLPVTKDQS TLQYTTVINQ RTPLVPASVV FDLGGRELWV DCDKGYVSST YQSPRCNSAV CSRAGSTSCG TCFSPPRPGC SNNTCGGIPD NTVTGTATSG EFALDVVSIQ STNGSNPGRV VKIPNLIFDC GATFLLKGLA KGTVGMAGMG RHNIGLPSQF AAAFSFHRKF AVCLTSGKGV AFFGNGPYVF LPGIQISSLQ TTPLLINPVS TASAFSQGEK SSEYFIGVTA IQIVEKTVPI NPTLLKINAS TGIGGTKISS VNPYTVLESS IYNAFTSEFV KQAAARSIKR VASVKPFGAC FSTKNVGVTR LGYAVPEIEL VLHSKDVVWR IFGANSMVSV SDDVICLGFV DGGVNARTSV VIGGFQLEDN LIEFDLASNK FGFSSTLLGR QTNCANFNFT STA // ID NC003070_159 HYPOTHETICAL; PRT; 434 AA. AC NC003070_159; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[790110...791414]; Length: 1305. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 434 SINGLE EXON; p-value: NaN. SQ SEQUENCE 434 AA; 46149 MW; 17DD7684008FDAFC CRC64; MASSRIIIFS VLLLSIFSLS SSAQPSFRPK ALLLPVTKDP STLQYTTVIN QRTPLVPASV VFDLGGREFW VDCDQGYVST TYRSPRCNSA VCSRAGSIAC GTCFSPPRPG CSNNTCGAFP DNSITGWATS GEFALDVVSI QSTNGSNPGR FVKIPNLIFS CGSTSLLKGL AKGAVGMAGM GRHNIGLPLQ FAAAFSFNRK FAVCLTSGRG VAFFGNGPYV FLPGIQISRL QKTPLLINPG TTVFEFSKGE KSPEYFIGVT AIKIVEKTLP IDPTLLKINA STGIGGTKIS SVNPYTVLES SIYKAFTSEF IRQAAARSIK RVASVKPFGA CFSTKNVGVT RLGYAVPEIQ LVLHSKDVVW RIFGANSMVS VSDDVICLGF VDGGVNPGAS VVIGGFQLED NLIEFDLASN KFGFSSTLLG RQTNCANFNF TSTA // ID NC003070_160 HYPOTHETICAL; PRT; 447 AA. AC NC003070_160; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[798093...798010, 797789...797728, DE 797600...797546, 797411...797313, 796892...796776, 796631...796536, DE 796018...795988, 795188...795102, 794969...794899, 794650...794542, DE 794410...794328, 794180...794022, 793930...793829, 793789...793601]; DE Length: 1344. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 28 FIRST EXON; p-value: NaN. FT GENSCAN 29 48 INTERNAL EXON; p-value: NaN. FT GENSCAN 49 49 AA on splice site: tt/g -> L. FT GENSCAN 50 67 INTERNAL EXON; p-value: NaN. FT GENSCAN 68 100 INTERNAL EXON; p-value: NaN. FT GENSCAN 101 139 INTERNAL EXON; p-value: NaN. FT GENSCAN 140 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 181 INTERNAL EXON; p-value: NaN. FT GENSCAN 182 182 AA on splice site: g/cg -> A. FT GENSCAN 183 210 INTERNAL EXON; p-value: NaN. FT GENSCAN 211 211 AA on splice site: g/ga -> G. FT GENSCAN 212 234 INTERNAL EXON; p-value: NaN. FT GENSCAN 235 270 INTERNAL EXON; p-value: NaN. FT GENSCAN 271 271 AA on splice site: g/gt -> G. FT GENSCAN 272 298 INTERNAL EXON; p-value: NaN. FT GENSCAN 299 351 INTERNAL EXON; p-value: NaN. FT GENSCAN 352 385 INTERNAL EXON; p-value: NaN. FT GENSCAN 386 447 LAST EXON; p-value: NaN. SQ SEQUENCE 447 AA; 49989 MW; 20DA0D280562E068 CRC64; MSFTPSTFRI AISLLLLVAI VSAVIFLPKL KDFLLWIKED LGPFGPLALA LAYIPLTIVA VPASVLTLGG GYLFGLPVGF VADSLGATLG ATAAFLLGRT IVLLLRVVPI LPFNMLNYLL SVTPVRLGEY MLATWLGMMQ PITFALVYVG TTLKDLSDIT HGWHEVSVFR WVIMMVGVAL AADFLTFNCS IMAATNVMRR DESLLIDPQR GDTSVSRGLS LEKKIEALES LAGQVSNRRS RRWLNDRILM ELVPRLDAQE IRGLFAPPPW GDDVPPSAFS LTNVGEWDKF RNIDMDKEAN IMDSLNRSSV RQKGSVDGDK IAVLNAWRRI DCRTRDALRR SFLPELIEGY ESCISHFIEE GGEGDVLELK VQDPFHRLLL HGVCEMLNED IESNNLKETV FFGAVSQFGI NNCNRTDRKD SNEDNEHQME EEWRFRGETK HQPSAFP // ID NC003070_161 HYPOTHETICAL; PRT; 1338 AA. AC NC003070_161; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[799191...799431, 799641...799717, DE 799827...799910, 799999...800112, 800382...800465, 800629...800700, DE 801356...801429, 801517...801889, 802012...802058, 802163...802223, DE 802320...802377, 803543...803687, 803835...803957, 804247...804367, DE 804961...805070, 805167...805350, 805860...806141, 806196...806409, DE 806661...806737, 807951...808017, 808185...808394, 808628...808822, DE 808913...809214, 809409...809582, 809911...810090, 810188...810234, DE 810380...810680]; Length: 4017. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 80 FIRST EXON; p-value: NaN. FT GENSCAN 81 81 AA on splice site: g/ct -> A. FT GENSCAN 82 106 INTERNAL EXON; p-value: NaN. FT GENSCAN 107 134 INTERNAL EXON; p-value: NaN. FT GENSCAN 135 172 INTERNAL EXON; p-value: NaN. FT GENSCAN 173 200 INTERNAL EXON; p-value: NaN. FT GENSCAN 201 224 INTERNAL EXON; p-value: NaN. FT GENSCAN 225 248 INTERNAL EXON; p-value: NaN. FT GENSCAN 249 249 AA on splice site: ag/g -> R. FT GENSCAN 250 373 INTERNAL EXON; p-value: NaN. FT GENSCAN 374 388 INTERNAL EXON; p-value: NaN. FT GENSCAN 389 389 AA on splice site: ag/g -> R. FT GENSCAN 390 409 INTERNAL EXON; p-value: NaN. FT GENSCAN 410 428 INTERNAL EXON; p-value: NaN. FT GENSCAN 429 429 AA on splice site: g/tt -> V. FT GENSCAN 430 476 INTERNAL EXON; p-value: NaN. FT GENSCAN 477 477 AA on splice site: aa/g -> K. FT GENSCAN 478 517 INTERNAL EXON; p-value: NaN. FT GENSCAN 518 518 AA on splice site: ag/g -> R. FT GENSCAN 519 558 INTERNAL EXON; p-value: NaN. FT GENSCAN 559 594 INTERNAL EXON; p-value: NaN. FT GENSCAN 595 595 AA on splice site: aa/a -> K. FT GENSCAN 596 656 INTERNAL EXON; p-value: NaN. FT GENSCAN 657 750 INTERNAL EXON; p-value: NaN. FT GENSCAN 751 821 INTERNAL EXON; p-value: NaN. FT GENSCAN 822 822 AA on splice site: g/ct -> A. FT GENSCAN 823 847 INTERNAL EXON; p-value: NaN. FT GENSCAN 848 869 INTERNAL EXON; p-value: NaN. FT GENSCAN 870 870 AA on splice site: g/tc -> V. FT GENSCAN 871 939 INTERNAL EXON; p-value: NaN. FT GENSCAN 940 940 AA on splice site: g/aa -> E. FT GENSCAN 941 1004 INTERNAL EXON; p-value: NaN. FT GENSCAN 1005 1005 AA on splice site: g/at -> D. FT GENSCAN 1006 1105 INTERNAL EXON; p-value: NaN. FT GENSCAN 1106 1163 INTERNAL EXON; p-value: NaN. FT GENSCAN 1164 1223 INTERNAL EXON; p-value: NaN. FT GENSCAN 1224 1238 INTERNAL EXON; p-value: NaN. FT GENSCAN 1239 1239 AA on splice site: ca/a -> Q. FT GENSCAN 1240 1338 LAST EXON; p-value: NaN. SQ SEQUENCE 1338 AA; 149629 MW; 54D93CDAEB147016 CRC64; MVVLSTLALV RAAYSLNSFV FEAEDIRFGS PWWFVVVGVA CFLVLFAGIM SGLTLGLMSL GLVELEILQQ SGSSAEKKQA AAILPVVKKQ HQLLVTLLLC NAAAMEALPI CLDKIFHPFV AVLLSVTFVL AFGEIIPQAI CSRYGLAVGA NFLWLVRILM IICYPIAYPI GKVLDAVIGH NDTLFRRAQL KALVSIHSQE AGKGGELTHE ETMIISGALD LSQKVKSLLT VRAETEAPVS SVSIRKIPRV PSDMPLYDIL NEFQKGSSHM AAVVKVKDKD KKNNMQLLSN GETPKENMKF YQSSNLTAPL LKHESHDVVV DIDKVPKHVK NRGRNFQQNG TVTRDLPCLL EDNEDAEVIG IITLEDVFEE LLQAEIVDET DVYIDVHKRV RVAAAAAAAV SSITRASPAE IQSKVGQTVK KLVGKEARVY SLYQCRFCIK HEPNTASLTG NVSIWVVNSM EKSGPVQKAV VLQPFVKLVR LVARAFYDDY TTKSDNQQKS ARSDNRGIAA VVLDALARRQ WVREEDLAKD LQLHAKQLRK IIRLFEEEKL IMRDHRKEIC DVVRFRLHRM KKRLKDELED KNTVQEYGCP NCQRKYNALD ALRLISMVDD SFHCENCNGE LVVECNKLTS EEVVDGDDNA RRRRRENLKN MLQKLEVEVN LGDGNEDVKS KGGDSSLKVL PPWMIKEGMN LTEEQRGEMR QEAKVDGGAG AAAKLSDDKK SAIGNGDEKD LKACFWGQIL YIYLDSHSSF DEYLKAYYAE LMKQQELAAR RNQQESAGEP TSGIQSGTVY SGRQVSMKAK REEDEDEDEE EVEWEEKAPV TANGNYKVDL NVEAEASGGE EEEEEDDSLL TFTDGFRFCL SISNGNISSV LLKPHYIQYY VPLQIDARIL RAVAIEHPKD ADEAAAVVLS EIIPSFSSNL FHNFTQSSYK SSGSISEREE TLPLVVTRDH NTRALSTDLV SNMNELTTLQ PNVDPDVCHK DLESEEIQSV KKARGKENGN YDLFDNVKLA SNFWEDLGFD ITWNQAENAV SKLVDSTPGD TMTTTQQGSC FEVGHGSTNL VDETSNRSLF SENGDTEIGD AFSTSTHVCS VDQLEDIIED AKSNKKNLLT EMETVTNIMR EVELKEKDAE KSKEEAARGG LDTLQKVEEL KKMLEHAKEA NDMMRGSLEI RLAAALELKK TAEKEKKDKE DSALKALAEQ EANMEKVVQE SKLLQQEAEE NSKLRDFLMD RGQIVDTLQG EISVICQDVK LLKEKFENRV PLTKSISSSF TSSCGSSMKS LVLENPSERL NGVTETSNNN KFPEAAAFFM NKEKDDCRDL LEDGWDIFDK ETEQVVWY // ID NC003070_162 HYPOTHETICAL; PRT; 670 AA. AC NC003070_162; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[813086...812030, 811988...811033]; Length: DE 2013. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 352 FIRST EXON; p-value: NaN. FT GENSCAN 353 353 AA on splice site: g/ca -> A. FT GENSCAN 354 670 LAST EXON; p-value: NaN. SQ SEQUENCE 670 AA; 76291 MW; 7022A17E1CDCD90A CRC64; MAIFKDCEVE IFSEEDGFRN AWYRAILEET PTNPTSESKK LRFSYMTKSL NKEGSSSPPT VEQRFIRPVP PENLYNGVVF EEGTMVDADY KHRWRTGVVI NKMENDSYLV LFDCPPDIIQ FETKHLRAHL DWTGSEWVQP EVRELSKSMF SPGTLVEVSC VIDKVEVSWV TAMIVKEIEE SGEKKFIVKV CNKHLSCRVD EAKPNMTVDS CCVRPRPPLF FVEEYDLRDC VEVFHGSSWR QGVVKGVHIE KQYTVTLEAT KDKLVVKHSD LRPFKVWEDG VWHNGPQQKP VKESPSNAIK QKPMCSSSGA RPMTPKMATK HARISFNPEE NVEELSVAET VAATGKLEKM GIAEESVSCV TPLKQTEANA EGNKLEPMRN QNCLRNDSTQ QMLPEEENSK DGSTKRKREE KHNSASSVMD EIDGTCNGSE SEISNTGKSI CNNDDVDDQP LSTELPYYQS LSVVNSFAAD AEETPAKSAR TISPFAKKLP FWKSYETDEL YKSLPQSPHF SPLFKAKEDI REWSAVGMMV TFYCLLKEVK DLQLDDSSSK LSSLSSSLAE LEKHGFNVTD PLSRISKVLP LQDKRAKKAE ERKCLEKKIE CEEIERKRFE EEFADFERII IEKKRQALVA KEKKEAADKR IGEMKTCAET IDQEIKDEEL EFQTTVSTPW // ID NC003070_163 HYPOTHETICAL; PRT; 882 AA. AC NC003070_163; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[813975...816623]; Length: 2649. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 882 SINGLE EXON; p-value: NaN. SQ SEQUENCE 882 AA; 98885 MW; 277940176F03B06D CRC64; MAAWSPSVGI GSCCLNNGIT RTWKFPSARL FTGRKNKIKL GSETLMFTRK RFMGDLVTSA LQSYQFSKIC ASKTSIELRE ALSSRRAEAD DLKKVTSYSF RTKAGALVKV KVEKKREKYS ILVYVSSLEL SGDDKSRLVM VWGVYRSDSS CFLPLDFENS SQDSQTHTTE TTFVKSSLSE LMLGLEFDGK ESPFYLSFHL KLVSGRDPDG QEMLTHRDTD FCIPVGFTAG HPLPLGLSSG PDDDSWNFSF FSRSSTNVVL CLYDDSTTDK PALELDLDPY VNRTGDVWHA SVDNTWDFVR YGYRCKETAH SKEDVDVEGE PIVLDPYATV VGKSVSQKYL GSLSKSPSFD WGEDVSPNIP LEKLLVYRLN VKGFTQHRSS KLPSNVAGTF SGVAEKVSHL KTLGTNAVLL EPIFSFSEQK GPYFPFHFFS PMDIYGPSNS LESAVNSMKV MVKKLHSEGI EVLLEVVFTH TADSGALRGI DDSSYYYKGR ANDLDSKSYL NCNYPVVQQL VLESLRYWVT EFHVDGFCFI NASSLLRGVH GEQLSRPPLV EAIAFDPLLA ETKLIADCWD PLEMMPKEVR FPHWKRWAEL NTRYCRNVRN FLRGRGVLSD LATRICGSGD VFTDGRGPAF SFNYISRNSG LSLVDIVSFS GPELASELSW NCGEEGATNK SAVLQRRLKQ IRNFLFIQYI SLGVPVLNMG DECGISTRGS PLLESRKPFD WNLLASAFGT QITQFISFMT SVRARRSDVF QRRDFLKPEN IVWYANDQTT PKWEDPASKF LALEIKSESE EEETASLAEP NEPKSNDLFI GFNASDHPES VVLPSLPDGS KWRRLVDTAL PFPGFFSVEG ETVVAEEPLQ QLVVYEMKPY SCTLFETINT TA // ID NC003070_164 HYPOTHETICAL; PRT; 220 AA. AC NC003070_164; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[817027...817096, 817175...817767]; Length: DE 663. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 23 FIRST EXON; p-value: NaN. FT GENSCAN 24 24 AA on splice site: g/at -> D. FT GENSCAN 25 220 LAST EXON; p-value: NaN. SQ SEQUENCE 220 AA; 24685 MW; E75ECBBAE396D5A7 CRC64; MNDGCCSKCQ VKATKKKKSK KNEDEPDEED VKQKQKVSKN ENPKKDSNGE KGSDEKDKKN KNVLRWPQDS STKKEKGDGK NLSSEENGGK NLPSEKNGVN PPPPCYTAQP MMYHGGHNPL HRWPISAASR SHYPPFSMGA MHGPYGGCGP CGGPIDPYQS MPRPRMFDGV HISQYPPMAA AYPYMAAAGP FPYWQTRPYM DANPMKRYTT YAENYSYFYS // ID NC003070_165 HYPOTHETICAL; PRT; 333 AA. AC NC003070_165; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[819712...819783, 819897...820077, DE 820159...820501, 820595...820666, 820738...820964, 821012...821080, DE 821723...821760]; Length: 1002. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 24 FIRST EXON; p-value: NaN. FT GENSCAN 25 84 INTERNAL EXON; p-value: NaN. FT GENSCAN 85 85 AA on splice site: g/aa -> E. FT GENSCAN 86 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 199 AA on splice site: cg/a -> R. FT GENSCAN 200 222 INTERNAL EXON; p-value: NaN. FT GENSCAN 223 223 AA on splice site: ag/t -> S. FT GENSCAN 224 298 INTERNAL EXON; p-value: NaN. FT GENSCAN 299 299 AA on splice site: g/gt -> G. FT GENSCAN 300 321 INTERNAL EXON; p-value: NaN. FT GENSCAN 322 322 AA on splice site: t/ta -> L. FT GENSCAN 323 333 LAST EXON; p-value: NaN. SQ SEQUENCE 333 AA; 38032 MW; 8D4C5BF3BFCC6EA2 CRC64; MASHSYSTLR FSLLQELRSN EIIKTSRAYL VNPGGRKYEI LPESSFNLKS QLLEPLKPFS SFSKSNHFLE FDSTMMKHRL MDVHETGPDP VCLSLGITQQ YARKEEVLEF LLSRSEEELK EEGFDLSLLS ELMGLDALRS SSQQPYAKPL LDLMVDANIL FSSSRAELND LVSTAAEFHR LRNSTRWRKL SRLVPQFQRF DSEVPIDTLQ LPEDAVTLAP PKSPKKTRLK PSPKKQNPKI RDKEYDLYER NRLHACESLL SLMIGNEQHR KTTMLSLKKS RGELFELLTQ CSIGFAGTGT LLRKPILRGC NEFEFGIALV VLNSQRTSIV VLH // ID NC003070_166 HYPOTHETICAL; PRT; 470 AA. AC NC003070_166; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[824246...822834]; Length: 1413. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 470 SINGLE EXON; p-value: NaN. SQ SEQUENCE 470 AA; 51896 MW; B14F987CF1F3DB7C CRC64; MNFFKSVFTE DLDPPETESE SDSPKHSEEH EHPEQEHPEQ SESNDDGGWS FGGLMKTLAT RSESVIETYR RDLEEFGTGL KKEIEVAQGS LGTVGHAIDE LGNTVLKGTA EIIAQGKEAI LAAGNESDSS DNNSSQSFGR RDSFSSKPYS RFDAQIRAVQ GDLNTYCEEP EDSDDYKKWE SAFSLDGKAE EMEKLLEENG DMKGVYKRVV PSMVDHETFW FRYFYRVNKL KQAEDLRANL VKRAISLDDE EELSWDIDDE EESSEKVVEA TKDVSRLKLE GNDGMGGGDV SETVKDEVES TYSVAKVSTQ DEVTSADSVT EVSNVGLKTD KDSEEKKETD SEEVPEEKSF VDAAPPASDE APIQDSVKPT SDEAPIQDSV KPKSDEAAPS QDSAKPDVAA SSSTQQPSEE DLGWDEIEDM SSIDGKETSR SGGSPNRAEL RKRLSAAEED EDLSWDIDED DEEESSSSKA // ID NC003070_167 HYPOTHETICAL; PRT; 322 AA. AC NC003070_167; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[824653...824813, 824891...824992, DE 825102...825147, 825253...825342, 825422...825487, 825587...825655, DE 825745...826179]; Length: 969. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 53 FIRST EXON; p-value: NaN. FT GENSCAN 54 54 AA on splice site: aa/g -> K. FT GENSCAN 55 87 INTERNAL EXON; p-value: NaN. FT GENSCAN 88 88 AA on splice site: ag/g -> R. FT GENSCAN 89 103 INTERNAL EXON; p-value: NaN. FT GENSCAN 104 133 INTERNAL EXON; p-value: NaN. FT GENSCAN 134 155 INTERNAL EXON; p-value: NaN. FT GENSCAN 156 178 INTERNAL EXON; p-value: NaN. FT GENSCAN 179 322 LAST EXON; p-value: NaN. SQ SEQUENCE 322 AA; 36575 MW; A6B87D8C0BF17786 CRC64; MVMRKLQLPL SQTQKVRFER AIERLQSLSS SANSDASVIV TDSIPVNHDD AFLKGHGTSE VDGELLATVC GVVERVDKLV YVRTLRARYK PEVGDIVVGR VIEVAQKRWR VELNFNQDGV LMLSSMNMPD GIQRRRTSVD ELNMRNIFVE HDVVCAEVRN FQHDGSLQLQ ARSQKYGKLE KGQLLKVDPY LVKRSKHHFH YVESLGIDLI IGCNGFIWVG EHVEVRDPMA IDDQKDEEMI SSSSTGKEQS HIPLETRQTI CRIGNAIRVL SNLGFTVTLE VIMETVNLSN SKNIDIHDML GSEFHVVVAE NEAERRRTKR KK // ID NC003070_168 HYPOTHETICAL; PRT; 2766 AA. AC NC003070_168; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[827182...829182, 829721...829958, DE 830861...831313, 831420...832640, 832753...832935, 833084...833367, DE 833415...833570, 833694...833956, 834254...834419, 834514...834987, DE 836209...836328, 836724...837233, 837471...837908, 838069...838313, DE 838497...839226, 839311...839468, 839677...839804, 839919...840328, DE 840507...840629]; Length: 8301. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 667 FIRST EXON; p-value: NaN. FT GENSCAN 668 746 INTERNAL EXON; p-value: NaN. FT GENSCAN 747 747 AA on splice site: a/tt -> I. FT GENSCAN 748 897 INTERNAL EXON; p-value: NaN. FT GENSCAN 898 898 AA on splice site: g/gt -> G. FT GENSCAN 899 1304 INTERNAL EXON; p-value: NaN. FT GENSCAN 1305 1305 AA on splice site: g/gc -> G. FT GENSCAN 1306 1365 INTERNAL EXON; p-value: NaN. FT GENSCAN 1366 1366 AA on splice site: g/aa -> E. FT GENSCAN 1367 1460 INTERNAL EXON; p-value: NaN. FT GENSCAN 1461 1512 INTERNAL EXON; p-value: NaN. FT GENSCAN 1513 1599 INTERNAL EXON; p-value: NaN. FT GENSCAN 1600 1600 AA on splice site: aa/g -> K. FT GENSCAN 1601 1655 INTERNAL EXON; p-value: NaN. FT GENSCAN 1656 1813 INTERNAL EXON; p-value: NaN. FT GENSCAN 1814 1853 INTERNAL EXON; p-value: NaN. FT GENSCAN 1854 2023 INTERNAL EXON; p-value: NaN. FT GENSCAN 2024 2169 INTERNAL EXON; p-value: NaN. FT GENSCAN 2170 2250 INTERNAL EXON; p-value: NaN. FT GENSCAN 2251 2251 AA on splice site: ct/c -> L. FT GENSCAN 2252 2494 INTERNAL EXON; p-value: NaN. FT GENSCAN 2495 2546 INTERNAL EXON; p-value: NaN. FT GENSCAN 2547 2547 AA on splice site: ag/g -> R. FT GENSCAN 2548 2589 INTERNAL EXON; p-value: NaN. FT GENSCAN 2590 2590 AA on splice site: g/gt -> G. FT GENSCAN 2591 2726 INTERNAL EXON; p-value: NaN. FT GENSCAN 2727 2766 LAST EXON; p-value: NaN. SQ SEQUENCE 2766 AA; 306871 MW; CBA15697BE6A9CC3 CRC64; MSIVQKQEEM NGCGLNVDKV EAFTVSPQEK GRKNKRKLAD PSQPNASSLT EFPPYELPSL KPQNHLSGNG SVGEVSNQLQ VEVSESVEWD DPFACHLEEL LSSNLLTLFL DTMKQLIDLG YTDDEVLKAV SRCRLYCGGN NLLSNIVNNT LSALKTGDEG AGSGDYVFED LQQLVSYTLV EMISLIKEVR PSLSTVEAMW RLLMCDLNVL QAFEAEGDGL VSSSKLSDSE SLGAESNPPK SSDPDNPKPP QSDPQSNRNE PLKFGNFPNT PNSKKTQSSG TTPGKEVCSG STVSCQGMRS TSFTLVSDEK LVSCRKGRTK KEIAMLRQKS CVEKIRTYSK GSGYKAAKFA SVGSFLLEKR VKSSSEFVPR NSSSKITAEI GVKVSLAEDS GCFVRKNSKL DSPVVVVDAK GYITALPARS VKSASKKKTG SESVTLIPSA SEKKSDSSIP STSEKKSGSE SEEKASVSAK LAPDYYAGIP YDAALGIYVP RDKKDELILK LVPRVNDLQN ELQVWTDWAN QKVKEATGRL LKDQPELKAL RKEREEAEQY KKEKQLLEEN TRKRLSEMDF ALKNATSQLE KAFNTAHRLE LEQSILKKEM EAAKIKAVES AESFREAKER GERSLKDIHS WEGQKIMLQE ELKGQREKVT VLQKEVTKAK NRQNQIEAAL KQERTAKGKL SAQASLIRKE TKELEALGKV EEERIKGKAE TDVKYYIDNI KRLEREISEL KLKSDYSRII ALKKGSIFFS IRRISGTVLP YTRHRQFDFV SLPLREFISV GEMKLQVRVV EARNLPAMDL NGFSDPYVRL QLGKQRSRTK VVKKNLNPKW TEDFSFGVDD LNDELVVSVL DEDKYFNDDF VGQVRVSVSL VFDAENQSLG TVWYPLNPKK KGSKKDCGEI LLKICFSQKN SVLDLTSSGD QTSASRSPDL RLESPIDPST CASPSRSDDA SSIPQTTFAG RFTQIFQKNA ITATPTQSSS RSIDASDLSE ISKPVFSLEL SEDESSSTSF EELLKAMESK DQGSEPPSNL SGGVVVDQLF MISPSDLNIV LFASDSSFYA SLTELQGTTE VQIGPWKAEN DGESVKRVVS YLKAATKLIK AVKGTEEQTY LKADGEVYAV LASVATPDVP FGGTFKVEVL YCISPGPELP SGEQCSRLVV SWRLNFLQST MMRGLIENGA RQGLKDNFEQ YANLLAQSVK PVDSKDIGLN KEQALSSLQA EPQSDWKLAV QYFANFTVLS TFLIGIYVFV HIVFAIPSAI QGLEFNGLDL PDSIGEFVVS GVLVLQCERV LQLISRFMQA RKQKGSDHGI KAHGDGWLLT VALIEGVDLA AVDPSGHCDP YIVFTSNGKT RTSSIKFQKS NPQWNEIFEF DAMADPPSVL NVEVFDFDGP FDEAVSLGHA EVNFVRSNIS DLADVWVPLQ GKLAQACQSK LHLRIFLDHT GGGDVVRDYL NKMEKEVGKK CCCAFLSAEW KFQINVRSPQ TNSAFQKLFG LPQEEFLIND FTCHLKRKMP LQGRLFLSAR IVGFYASIFG NKTKFFFLWE DIEEIQVLPP TLASMGSPIV VMTLRPNRGL DARIGAKTHD EEGRLKFHFH SFVSFNVAQK TIMALWKAKS LTPEQKVQAV EEESEQKLQS EESGLFLGVD DVRFSEVFSL TLPVPVSFFM ELFGGGEVDR KAMERAGCQS YSCSPWESEK DDVYERQTYY RDKRISRYRG EVTSTQQKSL VPEKNGWLVE EVMTLHGVPL GDYFNLHLRY QMEESTSKPK TTYVRVYFGI EWLKSTRHQK RVTKNILVNL QDRLKMTFGF LEKEYSSRQQ QQQIISSCLK TVSANATNVA SSVRSAGASV AASISAAEDD KDQRLLCPQT SELRFHDQVT WAGFGILELG QHVTRHVLLL GYQNGFQVFD VEDASNFNEL VSKRGGPVSF LQMQPLPARS GDHEGFWNSH PLLLVVAGDE TNGTGLGHSF SQNGSLARDG SSDSKAGDAI NYPTTVRFYS LRSHSYVYVL RFRSSVCMIR CSSRVVAVGL ANQIYCVDAL TLENKFSVLT YPVPQPVRQG TTRVNVGYGP MAVGPRWLAY ASKSSMTMKT GRLSPQTFTS SPSLSPSSSS GGSSFMARYA MESSKQLANG LINLGDMGYK TLSKYCQDML PDGSTSPASP NAIWKVGGVS GSDAENAGMV AVKDLVSGAL VSQFKAHTSP ISALCFDPSG TLLVTASVCG NNINVFQIMP SRSHNAPGDL SYEWDLLMCI SSNCIEGSLQ LDAAFQPCEG EEPTRLPASS LPWWFTQSLS SNQQSLSPPT AVALSVVSRI KYSSFGWLNT VSNATTAATG KVFVPSGAVA AVFHKSVTHD LQLNSRTNAL EHILVYTPSG HVVQHELLPS VCTESPENGL RVQKTSHVQV QEDDLRVKVE PIQWWDVCRR SDWLETEERL PKSITEKQYD LETVSNHLTS HEDACLSLDM NSHFSEDKYL KSCSEKPPER SHCYLSNFEV KVTSGMLPVW QNSKISFHVM DSPRDSSSTG GEFEIEKVPA HELEIKQKKL LPVFDHFHST KATLEDRFSM KCYHTSATGS HQVNGKICQD IINCHSKPGS IESAESSEEG STKQMENLHD SDHMSNSIKS SLPLYPTVNG IYKEIEKNNA NGWMEKPVTA KLSTLKETRI TNGFTTPPIL TDSVNEQMLS TGKPPMGFGF ALHEEHCKAV ADPKEEHLKK KLDEVTNVHH LNVNNNNTEK LQGDKMVNSQ VLNAFGVCIL YKDFLRVGGN NFSVPFLLLY KIWLRK // ID NC003070_169 HYPOTHETICAL; PRT; 744 AA. AC NC003070_169; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[844157...843676, 843551...843230, DE 843145...842982, 842298...841032]; Length: 2235. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 160 FIRST EXON; p-value: NaN. FT GENSCAN 161 161 AA on splice site: gg/g -> G. FT GENSCAN 162 268 INTERNAL EXON; p-value: NaN. FT GENSCAN 269 322 INTERNAL EXON; p-value: NaN. FT GENSCAN 323 323 AA on splice site: aa/t -> N. FT GENSCAN 324 744 LAST EXON; p-value: NaN. SQ SEQUENCE 744 AA; 83439 MW; DDEC12B68860C00B CRC64; MESTDRSSQA KAFDEAKIGV KGLVDSGITE IPALFRATPA TLASLKSPPP PKHLTIPTVD LKGASVVEKI GEAAEKWGLF HLVNHGIPVE VLERMIQGIR GFHEQEPEAK KRFYSRDHTR DVLYFSNHDL QNSEAASWRD TLGCYTAPEP PRLEDLPAVC GEIMLEYSKE IMSLGERLFE LLSEALGLNS HHLKDMDCAK SQYMVGQHYP PCPQPDLTIG INKHTDISFL TVLLQDNVGG LQVFHEQYWI DVTPVPGALV INIGDFLQLI TNDKFISAEH RVIANGSSEP RTSVAIVFST FMRAYSRVYG PIKDLLSAEN PANLYLSNLD DIIGARVFTP SVYFYPSTNN RESFVLKRLQ DALSEVLVPY YPLSGRLREV ENGKLEVFFG EEQGVLMVSA NSSMDLADLG DLTVPNPAWL PLIFRNPGEE AYKILEMPLL IAQVTFFTCG GFSLGIRLCH CICDGFGAMQ FLGSWAATAK TGKLIADPEP VWDRETFKPR NPPMVKYPHH EYLPIEERSN LTNSLWDTKP LQKCYRISKE FQCRVKSIAQ GEDPTLVCST FDAMAAHIWR SWVKALDVKP LDYNLRLTFS VNVRTRLETL KLRKGFYGNV VCLACAMSSV ESLINDSLSK TTRLVQDARL RVSEDYLRSM VDYVDVKRPK RLEFGGKLTI TQWTRFEMYE TADFGWGKPV YAGPIDLRPT PQVCVLLPQG GVESGNDQSM VVCLCLPPTA VHTFTRLLSL NDHK // ID NC003070_170 HYPOTHETICAL; PRT; 361 AA. AC NC003070_170; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[846462...845951, 845434...845113, DE 845032...844781]; Length: 1086. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 170 FIRST EXON; p-value: NaN. FT GENSCAN 171 171 AA on splice site: gg/g -> G. FT GENSCAN 172 278 INTERNAL EXON; p-value: NaN. FT GENSCAN 279 361 LAST EXON; p-value: NaN. SQ SEQUENCE 361 AA; 40596 MW; D538420E2F85879D CRC64; MESSDRSSQA KAFDETKTGV KGLVASGIKE IPAMFHTPPD TLTSLKQTAP PSQQLTIPTV DLKGGSMDLI SRRSVVEKIG DAAERWGFFQ VVNHGISVEV MERMKEGIRR FHEQDPEVKK RFYSRDHTRD VLYYSNIDLH TCNKAANWRD TLACYMAPDP PKLQDLPAVC GEIMMEYSKQ LMTLGEFLFE LLSEALGLNP NHLKDMGCAK SHIMFGQYYP PCPQPDLTLG ISKHTDFSFI TILLQDNIGG LQVIHDQCWV DVSPVPGALV INIGDLLQLI SNDKFISAEH RVIANGSSEP RISMPCFVST FMKPNPRIYG PIKELLSEQN PAKYRDLTIT EFSNTFRSQT ISHPALHHFR I // ID NC003070_171 HYPOTHETICAL; PRT; 291 AA. AC NC003070_171; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[848120...848118, 848004...847789, DE 847672...847582, 847521...846956]; Length: 876. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 1 FIRST EXON; p-value: NaN. FT GENSCAN 2 73 INTERNAL EXON; p-value: NaN. FT GENSCAN 74 103 INTERNAL EXON; p-value: NaN. FT GENSCAN 104 104 AA on splice site: g/gc -> G. FT GENSCAN 105 291 LAST EXON; p-value: NaN. SQ SEQUENCE 291 AA; 32812 MW; 971156CBB3AA4DD9 CRC64; MIQQQLGEKT KEQTSNLTKR FQIQNRSDTE RDDDESDRSV ESYNLFHVPL TDFDLNIKWR VIFVNDYIVG INQVQSQLGL GPELQSSTVT KQEKSGHRKK RTRGPLLCNT TTGTRSLAEV YESCRKSLTN ELAGPATNLT EKTQFPTLRH PIWSSPQPRK TTTHFQASDL QNLRSENFPK AKQPSIRIDG DKKRRLKKGD GSDNPSATGT RNSHLRSSLL TIKHALKTHQ STASSATATA SNQVVEPRKT SPISTDVHNQ EETEEKPLIH RERATAATLT GKTTPLQCRR R // ID NC003070_172 HYPOTHETICAL; PRT; 397 AA. AC NC003070_172; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[852680...853873]; Length: 1194. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 397 SINGLE EXON; p-value: NaN. SQ SEQUENCE 397 AA; 43596 MW; EDF62B5EF80D5C1B CRC64; MMMRFTKLVW CLMFLLRFGF FTEAILDPVD FLALQAIRKS LDDLPGSKFF ESWDFTSDPC GFAGVYCNGD KVISLNLGDP RAGSPGLSGR IDPAIGKLSA LTELSIVPGR IMGALPATIS QLKDLRFLAI SRNFISGEIP ASLGEVRGLR TLDLSYNQLT GTISPSIGSL PELSNLILCH NHLTGSIPPF LSQTLTRIDL KRNSLTGSIS PASLPPSLQY LSLAWNQLTG SVYHVLLRLN QLNYLDLSLN RFTGTIPARV FAFPITNLQL QRNFFFGLIQ PANQVTISTV DLSYNRFSGG ISPLLSSVEN LYLNSNRFTG EVPASFVERL LSANIQTLYL QHNFLTGIQI SPAAEIPVSS SLCLQYNCMV PPLQTPCPLK AGPQKTRPTT QCTEWRG // ID NC003070_173 HYPOTHETICAL; PRT; 1009 AA. AC NC003070_173; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[864019...863860, 863541...863459, DE 862831...862753, 862640...862488, 862393...862314, 862212...862042, DE 861909...861831, 861511...861368, 861278...861113, 859761...859397, DE 859291...859231, 859045...859000, 858849...858776, 858692...858598, DE 858039...858011, 857798...857703, 857627...857546, 857436...857278, DE 856839...856758, 856243...856055, 855942...855858, 855785...855687, DE 855600...855404, 855329...855271, 855140...855030, 854833...854802, DE 854705...854652]; Length: 3030. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 53 FIRST EXON; p-value: NaN. FT GENSCAN 54 54 AA on splice site: g/ga -> G. FT GENSCAN 55 81 INTERNAL EXON; p-value: NaN. FT GENSCAN 82 107 INTERNAL EXON; p-value: NaN. FT GENSCAN 108 108 AA on splice site: g/at -> D. FT GENSCAN 109 158 INTERNAL EXON; p-value: NaN. FT GENSCAN 159 159 AA on splice site: g/gc -> G. FT GENSCAN 160 185 INTERNAL EXON; p-value: NaN. FT GENSCAN 186 242 INTERNAL EXON; p-value: NaN. FT GENSCAN 243 268 INTERNAL EXON; p-value: NaN. FT GENSCAN 269 269 AA on splice site: g/ga -> G. FT GENSCAN 270 316 INTERNAL EXON; p-value: NaN. FT GENSCAN 317 317 AA on splice site: g/ga -> G. FT GENSCAN 318 371 INTERNAL EXON; p-value: NaN. FT GENSCAN 372 372 AA on splice site: tc/c -> S. FT GENSCAN 373 493 INTERNAL EXON; p-value: NaN. FT GENSCAN 494 494 AA on splice site: a/gt -> S. FT GENSCAN 495 513 INTERNAL EXON; p-value: NaN. FT GENSCAN 514 514 AA on splice site: ag/g -> R. FT GENSCAN 515 529 INTERNAL EXON; p-value: NaN. FT GENSCAN 530 553 INTERNAL EXON; p-value: NaN. FT GENSCAN 554 554 AA on splice site: aa/g -> K. FT GENSCAN 555 585 INTERNAL EXON; p-value: NaN. FT GENSCAN 586 586 AA on splice site: g/gt -> G. FT GENSCAN 587 595 INTERNAL EXON; p-value: NaN. FT GENSCAN 596 627 INTERNAL EXON; p-value: NaN. FT GENSCAN 628 654 INTERNAL EXON; p-value: NaN. FT GENSCAN 655 655 AA on splice site: g/tg -> V. FT GENSCAN 656 707 INTERNAL EXON; p-value: NaN. FT GENSCAN 708 708 AA on splice site: g/at -> D. FT GENSCAN 709 734 INTERNAL EXON; p-value: NaN. FT GENSCAN 735 735 AA on splice site: ag/a -> R. FT GENSCAN 736 797 INTERNAL EXON; p-value: NaN. FT GENSCAN 798 798 AA on splice site: ac/c -> T. FT GENSCAN 799 826 INTERNAL EXON; p-value: NaN. FT GENSCAN 827 859 INTERNAL EXON; p-value: NaN. FT GENSCAN 860 924 INTERNAL EXON; p-value: NaN. FT GENSCAN 925 925 AA on splice site: tg/g -> W. FT GENSCAN 926 944 INTERNAL EXON; p-value: NaN. FT GENSCAN 945 945 AA on splice site: g/aa -> E. FT GENSCAN 946 981 INTERNAL EXON; p-value: NaN. FT GENSCAN 982 982 AA on splice site: g/aa -> E. FT GENSCAN 983 992 INTERNAL EXON; p-value: NaN. FT GENSCAN 993 1009 LAST EXON; p-value: NaN. SQ SEQUENCE 1009 AA; 113579 MW; C66EEDC01574A726 CRC64; MAEETMENEE RVKLFVGQVP KHMTEIQLLT LFREFSIVNE VNIIKEKTTR APRGCCFLTC PTREDADKVI NSFHNKKTLP GNSLSYDMFQ ASSPLQVKYA DGELERLDVL DCSCNPEHKL FVGMLPKNVS ETEVQSLFSE YGTIKDLQIL RGSLQTSKGC LFLKYESKEQ AVAAMEALNG RHIMEGANVP LIVKWADTEK ERQARRLLKV QSHVSRLDPQ NPSMFGALPM SYVPPYNGYG YHVPGTYGYM LPPIQTQHAF HNVISPNQGP AGANLFIYNI PREFEDQELA ATFQPFGKVL SAKVFVDKAT GISKCFGFIS YDSQAAAQNA INTMNGCQLS GKKLKVQLKR DNGQQQQQQQ SKNPLFNGLL NSRWTPMPFK KFRRSLNFII CLPKLKECAR GQILGYKMET KAIGFHHLSP PQNIVLWLLI NLINIHLLRM NRYRPFTIPD EDWPGPRCGH TLTAVFVNNS HQLILFGGST TAVANHNSSL PEISLDGVTN SVHSFDVLTR KWTRACHAAA LYGTLILIQG GIGPSGPSDG DVYMLDMTNN KWIKFLVGGE TPSPRYGHVM DIAAQRWLVI FSGNNGMLQV VLEKMTLGDT YGLKMDSDNV WTPVPAVAPS PRYQHTAVFG GSKLHVIGGI LNRARLIDGE AVVAVLDTET GEWVDTNQPE TSASGANRQN QYQLMRRCHH AAASFGSHLY VHGGIREDML LDDYLMDEPK PLSSEPEASS FIMRRDFFLS YLEVKHLCDE VEKIFMNEPT LLQLKVPIKV FGDIHGQYGD LMRLFHEYGH PSVEGDITHI DYLFLGDYVD RGQHSLEIIM LLFALKIEYP KNIHLIRGNH ESLAMNRIYG FLTECEERMG ESYGFEAWLK INQVFDYLPL AALLEKKVLC VHGGIGRAVT IEEIENIERP AFPDTGSMVL KDILWSDPTM NDTVLGIVDN ARGEERNGLE MILRAHECVI DGFERFADGR LITVFSATNY CEEDYTDKAW MQELNIEMPP TPARGESSE // ID NC003070_174 HYPOTHETICAL; PRT; 645 AA. AC NC003070_174; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[871176...870652, 870572...870477, DE 870228...870145, 869963...869876, 869783...869659, 869428...869251, DE 867710...867630, 866977...866217]; Length: 1938. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 175 FIRST EXON; p-value: NaN. FT GENSCAN 176 207 INTERNAL EXON; p-value: NaN. FT GENSCAN 208 235 INTERNAL EXON; p-value: NaN. FT GENSCAN 236 264 INTERNAL EXON; p-value: NaN. FT GENSCAN 265 265 AA on splice site: g/aa -> E. FT GENSCAN 266 306 INTERNAL EXON; p-value: NaN. FT GENSCAN 307 365 INTERNAL EXON; p-value: NaN. FT GENSCAN 366 366 AA on splice site: t/gc -> C. FT GENSCAN 367 392 INTERNAL EXON; p-value: NaN. FT GENSCAN 393 393 AA on splice site: g/gt -> G. FT GENSCAN 394 645 LAST EXON; p-value: NaN. SQ SEQUENCE 645 AA; 73574 MW; 14E7395AD02B9F52 CRC64; MASHSSTLLS SPTFAPFSSH RLHYSPNPST LRFSRPIRNK PNLALRCSVS IEKEVPETER PFTFLRDSDD VTPSSSSSSV RARFETMIRA AQDSVCDAIE AIEGGPKFKE DVWSRPGGGG GISRVLQDGN VFEKAGVNVS VVYGVMPPEA YRAAKGSASD QKPGPVPFFA AGVSSVLHPK NPFAPTLHFN YRYFETDAPK GKKAKVSIQK QACDKFDPSF YPRFKKWCDD YFYIKHRDER RGLGGIFFDD LNDYDQEMLL SFATECANSV VPAYIPIVEK RKDMEFTEQH KAWQQLRRGR YVEFNLKLGL NSIEYGNVVM VFVSAETGRG DRRVEAIGCL HQPEGVDLAC VSHCCEKSEF ISIVSCGEEE EEEEEEETAP FIYCLSWFST HRGFDMVNFD CVIEELDEKT KEMLRVIDED ADSFAARAEM YYKKRPELIA MVEEFYRSHR SLAERYDLLR PSSVHKHGSD SESHEKSSTC DESSWSEACE THEEYAESEI DNGESKWVDE SEIDGIVEEI EPSEVVYSEG NGNSEMMKIE IERLREENKV YSEMVREKDE EKREAIRQMS VAIQMLKEEN SELKKRVTNT VVARRNKEGG DSQRKQQMWK PFEFKKIKLE GLWGKGFGNW ALPNTDSTSK ELMTL // ID NC003070_175 HYPOTHETICAL; PRT; 247 AA. AC NC003070_175; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[871875...872222, 872293...872430, DE 872473...872730]; Length: 744. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 116 FIRST EXON; p-value: NaN. FT GENSCAN 117 162 INTERNAL EXON; p-value: NaN. FT GENSCAN 163 247 LAST EXON; p-value: NaN. SQ SEQUENCE 247 AA; 28249 MW; 3085E6E2AA3C0DB6 CRC64; MKILPVGSRF CPTDLGLVRL YLRNKVERNQ SSFITTMDIH QDYPWLLPHV NNPLFNNNEW YYFVPLTERG GKILSVHRKV AARGGSEGGT WRSNDGKKEI KDGHMQKGDG LRAVFHSDDL QKVVLCRIRY KKEANVNEFG LVNHQAHQTQ VAENTNNILR NQLEMMLEGQ EDREQKEEAD LTGFADSLET MLEGQEDHEQ PEDADLTGFA DSLETMLEGH EDREQPEEAE LTGFANDDLE TMMLDGE // ID NC003070_176 HYPOTHETICAL; PRT; 436 AA. AC NC003070_176; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[873437...874297, 874384...874833]; Length: DE 1311. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 287 FIRST EXON; p-value: NaN. FT GENSCAN 288 436 LAST EXON; p-value: NaN. SQ SEQUENCE 436 AA; 48272 MW; 58BECE2A55E4F3A9 CRC64; MAAQLQPYNI IETCHISPPK GTVASTTLPL TFFDAPWLSL PLADSLFFFS YQNSTESFLQ DFVPNLKHSL SITLQHFFPY AGKLIIPPRP DPPYLHYNAG EDSLVFTVAE STETDFDQLK SDSPKDISVL HGVLPKLPPP HVSPEGIQMR PIMAMQVTIF PGAGICIGNS ATHVVADGVT FSHFMKYWMS LTKSSGKDPA TVLLPSLPIH SCRNIIKDPG EVAAGHLERF WSQNSAKHSS HVTPENMVRA TFTLSRKQID NLKSWVTEQS ENQSPVSTFV VTLAFIWESS QVHTTNTTNI LWQLYGSWYR ISQEHDLLGE KCVMAASDAI TARIKDMLSS DLLKTAPRWG QGVRKWVMSH YPTSIAGAPK LGLYDMDFGL GKPCKMEIVH IETGGSIAFS ESRDGSNGVE IGIALEKKKM DVFDSLLQKG IKKFAT // ID NC003070_177 HYPOTHETICAL; PRT; 429 AA. AC NC003070_177; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[877548...876259]; Length: 1290. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 429 SINGLE EXON; p-value: NaN. SQ SEQUENCE 429 AA; 47578 MW; 0332BB7B5322657E CRC64; MSSSYASSCT KLISLTKQLS SYANQGNHEQ ALNLFLQMHS SFALPLDAHV FSLALKSCAA AFRPVLGGSV HAHSVKSNFL SNPFVGCALL DMYGKCLSVS HARKLFDEIP QRNAVVWNAM ISHYTHCGKV KEAVELYEAM DVMPNESSFN AIIKGLVGTE DGSYRAIEFY RKMIEFRFKP NLITLLALVS ACSAIGAFRL IKEIHSYAFR NLIEPHPQLK SGLVEAYGRC GSIVYVQLVF DSMEDRDVVA WSSLISAYAL HGDAESALKT FQEMELAKVT PDDIAFLNVL KACSHAGLAD EALVYFKRMQ GDYGLRASKD HYSCLVDVLS RVGRFEEAYK VIQAMPEKPT AKTWGALLGA CRNYGEIELA EIAARELLMV EPENPANYVL LGKIYMSVGR QEEAERLRLK MKESGVKVSP GSSWCLFKD // ID NC003070_178 HYPOTHETICAL; PRT; 1137 AA. AC NC003070_178; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[883338...882433, 882421...882251, DE 881877...880698, 879430...878912, 878811...878736, 878707...878541, DE 878343...877949]; Length: 3414. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 302 FIRST EXON; p-value: NaN. FT GENSCAN 303 359 INTERNAL EXON; p-value: NaN. FT GENSCAN 360 752 INTERNAL EXON; p-value: NaN. FT GENSCAN 753 753 AA on splice site: g/ct -> A. FT GENSCAN 754 925 INTERNAL EXON; p-value: NaN. FT GENSCAN 926 926 AA on splice site: g/at -> D. FT GENSCAN 927 950 INTERNAL EXON; p-value: NaN. FT GENSCAN 951 951 AA on splice site: ct/a -> L. FT GENSCAN 952 1006 INTERNAL EXON; p-value: NaN. FT GENSCAN 1007 1007 AA on splice site: g/ga -> G. FT GENSCAN 1008 1137 LAST EXON; p-value: NaN. SQ SEQUENCE 1137 AA; 127498 MW; 26D3E1D495F9FA82 CRC64; MVGFPANQAK TVEEELPLQV VADQDPKVKT FKDSFDFPSI DSFPDFDSIT WSESNRNVED FSIEDTDFDF FEYHMEIPQE DVVTVSDVMC PENQTTVNSE SIEGKLEVFN DDMSGEVGVS SVRSESMVEV KPVLCDVMSE EVGVSSVEVE PNVCVEMSAN GGDEPAVKDS EPVVSENSRS MEGETEPVVV SDASVACPTM DLDESGLKKT DEGLACSIEV GLEKVSLAVD DDEKSDEAKG EMDSAESESE TSSSSASSSD SSSSEEEESD EDESDKEENK KEEKFEHMVV GKEDDLAGDL KRNLDEENGD DDIEDEDDDD DDDDDDDDDV NEMVAWSNDE DDDLGLQTKE PIRSKNELKV MSAQVIVEGM EKHSPLTEGS ILWITEKRTP LGLVDEIFGP VKCPYYIVRF NSESEVPEGV CQGTPVSFVA DFAQHILNIK ELQKKGYDAS GDNDEEIPDE LEFSDDEKEA EYRRMQKLEK RGMMSDQKTG NTRNKKKKNR DPGRPTSSYS GEWTKNQGSS SLSSNRSDPQ MGGPRPQMDG FPPNNAAWRP QSNQQNPYQL PPIPNQMGMQ IPFMAMQNQN QMMFQPQFNG GQMPMPGGPG GLNFFPGQAS APWPAMVGQN CFNQQFGMGR GIQQQPLPNE LSFNMFSQGL QMHPPQSQMH RPQSQMNPQF QMPPQFQPHQ QSPMNPQYQM MHRPQSPANP QFQMQAQSDV RPLQSQIPQS PSDLQSPMEP QSQGFSSGQS SERGRGFHGR GRASLIMSIT LLILLISGQF DNFFGEEDQL PVDVVSESND YFVESDFKQS MNSTADVNPE PPRLAYLISG TKGDSHRMMR TLQAVYHPRN QYVLHLDLEA PPRERMELAM SVKTDPTFRE MENVRVMAQS NLVTYKGPTM IACTLQAVSI LLRESLHWDW FLNLSASDYP LVTQDDLLYV FSNLSRNVNF IENMQLTGWK LVTPESQSIC TYLEFLSRNQ RAKSIIVDPA LYLSKKSDIA WTTQRRSLPN SFRLFTGYFH TVICNSKEFI NTAIGHDLHY IAWDSPPKQH PRSLSLKDFD NMVKSKAPFA RKFHKNDPAL DKIDKELLGR THRFAPGGWC VGSSANGNDQ CSVQGDDSVL KPGPGSERLQ ELVQTLSSEE FRRKQCS // ID NC003070_179 HYPOTHETICAL; PRT; 609 AA. AC NC003070_179; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[883782...885611]; Length: 1830. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 609 SINGLE EXON; p-value: NaN. SQ SEQUENCE 609 AA; 67403 MW; 3B04DC4A544ECA92 CRC64; MNLIILKRHF SQHASLCLTP SISSSAPTKQ SRILELCKLG QLTEAIRILN STHSSEIPAT PKLYASLLQT CNKVFSFIHG IQFHAHVVKS GLETDRNVGN SLLSLYFKLG PGMRETRRVF DGRFVKDAIS WTSMMSGYVT GKEHVKALEV FVEMVSFGLD ANEFTLSSAV KACSELGEVR LGRCFHGVVI THGFEWNHFI SSTLAYLYGV NREPVDARRV FDEMPEPDVI CWTAVLSAFS KNDLYEEALG LFYAMHRGKG LVPDGSTFGT VLTACGNLRR LKQGKEIHGK LITNGIGSNV VVESSLLDMY GKCGSVREAR QVFNGMSKKN SVSWSALLGG YCQNGEHEKA IEIFREMEEK DLYCFGTVLK ACAGLAAVRL GKEIHGQYVR RGCFGNVIVE SALIDLYGKS GCIDSASRVY SKMSIRNMIT WNAMLSALAQ NGRGEEAVSF FNDMVKKGIK PDYISFIAIL TACGHTGMVD EGRNYFVLMA KSYGIKPGTE HYSCMIDLLG RAGLFEEAEN LLERAECRND ASLWGVLLGP CAANADASRV AERIAKRMME LEPKYHMSYV LLSNMYKAIG RHGDALNIRK LMVRRGVAKT VGQSWIDAH // ID NC003070_180 HYPOTHETICAL; PRT; 199 AA. AC NC003070_180; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[887778...887725, 887635...887531, DE 887430...887368, 887048...887015, 886789...886696, 886599...886554, DE 886460...886393, 886244...886196, 885937...885851]; Length: 600. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 18 FIRST EXON; p-value: NaN. FT GENSCAN 19 53 INTERNAL EXON; p-value: NaN. FT GENSCAN 54 74 INTERNAL EXON; p-value: NaN. FT GENSCAN 75 85 INTERNAL EXON; p-value: NaN. FT GENSCAN 86 86 AA on splice site: g/gt -> G. FT GENSCAN 87 116 INTERNAL EXON; p-value: NaN. FT GENSCAN 117 117 AA on splice site: ag/a -> R. FT GENSCAN 118 132 INTERNAL EXON; p-value: NaN. FT GENSCAN 133 154 INTERNAL EXON; p-value: NaN. FT GENSCAN 155 155 AA on splice site: ac/a -> T. FT GENSCAN 156 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 199 LAST EXON; p-value: NaN. SQ SEQUENCE 199 AA; 22192 MW; 13F76011661EC1EE CRC64; MARHDPNPFA DEEINPFANH TSVPPASNSY LKPLPPEPYD RGATVDIPLD SGNDLRAKEM ELQAKENELK RKEQKIQYVA FTTLLGPTIW LLSIIYFLAG VPGAYVLWYR PLYRATRTDS ALKFGAFFFF YVFHIAFCGF AAVAPPVIFQ GKSLTGFLPA IELLTTNAAV GQVYAYFRGS GKAAEMKREA TKSTLMRAL // ID NC003070_181 HYPOTHETICAL; PRT; 660 AA. AC NC003070_181; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[892410...890428]; Length: 1983. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 660 SINGLE EXON; p-value: NaN. SQ SEQUENCE 660 AA; 74630 MW; 5823AD8D3BB9A3F8 CRC64; MRRFYRKPFS VPLLNRPHCS SQSHCLYKNG DFLSDDSKCS PLSSSRTSVR WVFNSSSLPP PEWIEPFNDV SDLVKSNRNL LPSPWVSQIL NLLDGSASME SNLDGFCRKF LIKLSPNFVS FVLKSDEIRE KPDIAWSFFC WSRKQKKYTH NLECYVSLVD VLALAKDVDR IRFVSSEIKK FEFPMTVSAA NALIKSFGKL GMVEELLWVW RKMKENGIEP TLYTYNFLMN GLVSAMFVDS AERVFEVMES GRIKPDIVTY NTMIKGYCKA GQTQKAMEKL RDMETRGHEA DKITYMTMIQ ACYADSDFGS CVALYQEMDE KGIQVPPHAF SLVIGGLCKE GKLNEGYTVF ENMIRKGSKP NVAIYTVLID GYAKSGSVED AIRLLHRMID EGFKPDVVTY SVVVNGLCKN GRVEEALDYF HTCRFDGLAI NSMFYSSLID GLGKAGRVDE AERLFEEMSE KGCTRDSYCY NALIDAFTKH RKVDEAIALF KRMEEEEGCD QTVYTYTILL SGMFKEHRNE EALKLWDMMI DKGITPTAAC FRALSTGLCL SGKVARACKI LDELAPMGVI LDAACEDMIN TLCKAGRIKE ACKLADGITE RGREVPGRIR TVMINALRKV GKADLAMKLM HSKIGIGYER MGSVKRRVKF TTLLETCFDS // ID NC003070_182 HYPOTHETICAL; PRT; 385 AA. AC NC003070_182; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[896170...896015, 895921...895583, DE 895253...895224, 894944...894545, 894005...893773]; Length: 1158. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 52 FIRST EXON; p-value: NaN. FT GENSCAN 53 165 INTERNAL EXON; p-value: NaN. FT GENSCAN 166 175 INTERNAL EXON; p-value: NaN. FT GENSCAN 176 308 INTERNAL EXON; p-value: NaN. FT GENSCAN 309 309 AA on splice site: g/ac -> D. FT GENSCAN 310 385 LAST EXON; p-value: NaN. SQ SEQUENCE 385 AA; 42778 MW; 2C982016749BB23A CRC64; MGKRGFSDRM VSLHNLVSIP NRIIGNGKSR SSCIFTQQGR KGINQDAMIV WEDFMSKDVT FCGVFDGHGP HGHLVARKVR DSLPVKLLSL LNSIKSKQNG PIGTRASKSD SLEAEKEEST EEDKLNFLWE EAFLKSFNAM DKELRSHPNL ECFCSGCTAV TIIKQGKLRG SNSVKVWDVL SNEEVVEVVA SATSRASAAR LVVDSAVREW KLKYPTSKMD DCAVVCLFLD GRMDSETSDN EEQCFSSATN AVESDESQGA EPCLQRNVTV RSLSTDQENN SYGKVIAEAD NAEKEKTREG EQNWSGLEDR EARENESLDK KTNEPKGRNS SSSSSIFDPV LVQFNLEIGQ SQFHIFDPFV EDMTSIGNCF LPLQGKLLEL RLQPV // ID NC003070_183 HYPOTHETICAL; PRT; 414 AA. AC NC003070_183; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[898915...899340, 901485...901705, DE 901890...902076, 902169...902254, 902347...902671]; Length: 1245. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 142 FIRST EXON; p-value: NaN. FT GENSCAN 143 215 INTERNAL EXON; p-value: NaN. FT GENSCAN 216 216 AA on splice site: ag/g -> R. FT GENSCAN 217 278 INTERNAL EXON; p-value: NaN. FT GENSCAN 279 306 INTERNAL EXON; p-value: NaN. FT GENSCAN 307 307 AA on splice site: tg/g -> W. FT GENSCAN 308 414 LAST EXON; p-value: NaN. SQ SEQUENCE 414 AA; 46978 MW; 5AA93E1ED95FDBDB CRC64; MASASATATL LKPNLPPHKP TIIASSVSPP LPPPRRNHLL RRDFLSLAAT STLLTQSIQF LAPAPVSAAE DEEYIKDTSA VISKVRSTLS MQKTDPNVAD AVAELREASN SWVAKYRKEK ALLGKASFRD IYSALNAVSG HYTEIRTLNR LWHPWERQKV EFFRLSDLWD CYDEWSAYGA SVPIHVTNGE SLVQYYVPYL SAIQIFTSHS SLIRLREESE DGECEGRDPF SDSGSDESVS EEGLENNTLL HPSDRLGYLY LQYFERSAPY TRVPLMDKIN ELAQRYPGLM SLRSVDLSPA SWMSVAWCRF SLSVYYPSTR CDRSGLTLCC ITDMEPEENG GDKERVRREG EDITLLPFGM ATYKMQGDVW LSQDHDDQER LASLYSVADS WLKQLRVQHH DFNYFCNMSM THRG // ID NC003070_184 HYPOTHETICAL; PRT; 153 AA. AC NC003070_184; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[906012...905942, 905783...905666, DE 905382...905267, 905172...905118, 904419...904318]; Length: 462. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 23 FIRST EXON; p-value: NaN. FT GENSCAN 24 24 AA on splice site: ac/t -> T. FT GENSCAN 25 63 INTERNAL EXON; p-value: NaN. FT GENSCAN 64 101 INTERNAL EXON; p-value: NaN. FT GENSCAN 102 102 AA on splice site: ag/g -> R. FT GENSCAN 103 120 INTERNAL EXON; p-value: NaN. FT GENSCAN 121 153 LAST EXON; p-value: NaN. SQ SEQUENCE 153 AA; 17206 MW; 8D010237EB4015F1 CRC64; MKRGKGEKKA TKSRDGSGQV VPLTEPVVTA TGMVGTRSWI GGLFTRSNRR QDKAVDYTLS PLQESLKALW NVAFPNVHLT GLVTEQWKEM GWQGPNPSTD FRGCGFIALE NLLFSARTYP EVLQATRNQL ERELSLDDIH RIQDLPAYNL LFQ // ID NC003070_185 HYPOTHETICAL; PRT; 432 AA. AC NC003070_185; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[907698...907754, 907848...907946, DE 908032...908569, 908654...909009, 909089...909214, 909517...909639]; DE Length: 1299. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: NaN. FT GENSCAN 20 52 INTERNAL EXON; p-value: NaN. FT GENSCAN 53 231 INTERNAL EXON; p-value: NaN. FT GENSCAN 232 232 AA on splice site: g/ga -> G. FT GENSCAN 233 350 INTERNAL EXON; p-value: NaN. FT GENSCAN 351 392 INTERNAL EXON; p-value: NaN. FT GENSCAN 393 432 LAST EXON; p-value: NaN. SQ SEQUENCE 432 AA; 47796 MW; 29DE4EF7889907C2 CRC64; MALQAAYSLL PSTISIQKEG KFNASLKETT FTGSSFSNHL RAEKISTLLT IKEQRRQKPR FSTGIRAQTV TATPPANEAS PEQKKTERKG TAVITGASSG LGLATAKALA DTGKWHVIMA CRNFLKAEKA ARSVGMSKED YTVMHLDLAS LESVKQFVEN FRRTEQPLDV LVCNAAVYQP TAKEPSFTAE GFEISVGTNH LGHFLLSRLL LDDLKKSDYP SKRMIIVGSI TGNTNTLAGN VPPKANLGDL RGLASGLNGQ NSSMIDGGEF DGAKAYKDSK VCNMLTMQEL HRRYHEETGV TFASLYPGCI ATTGLFREHI PLFRLLFPPF QKYITKGYVS EEEAGKRLAQ VVSDPSLGKS GVYWSWNNNS SSFENQLSKE ASDAEKAKKL WENLTSFFSL RELPDDRNYK TVNTGENWWD IMSWIFSSVP FL // ID NC003070_186 HYPOTHETICAL; PRT; 865 AA. AC NC003070_186; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[917204...917063, 916256...915537, DE 915455...914908, 914763...914413, 913235...913165, 911813...911555, DE 911219...910859, 909881...909736]; Length: 2598. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 47 FIRST EXON; p-value: NaN. FT GENSCAN 48 48 AA on splice site: g/ga -> G. FT GENSCAN 49 287 INTERNAL EXON; p-value: NaN. FT GENSCAN 288 288 AA on splice site: g/at -> D. FT GENSCAN 289 470 INTERNAL EXON; p-value: NaN. FT GENSCAN 471 587 INTERNAL EXON; p-value: NaN. FT GENSCAN 588 610 INTERNAL EXON; p-value: NaN. FT GENSCAN 611 611 AA on splice site: at/a -> I. FT GENSCAN 612 697 INTERNAL EXON; p-value: NaN. FT GENSCAN 698 817 INTERNAL EXON; p-value: NaN. FT GENSCAN 818 818 AA on splice site: g/cg -> A. FT GENSCAN 819 865 LAST EXON; p-value: NaN. SQ SEQUENCE 865 AA; 95590 MW; B7215D4AE315A62A CRC64; MIDPIVNELA QKYAGQFKFY KLNTDESPAT PGQYGVRSIP TIMIFVNGDF KVKVALNKSM KCDSKSETVS SSSTSGSLSD PDQWTIFKDK DESEIMNPAI LCAVRAGDKV SLLKRINDDV KVTQRLVDNQ GNSILHIAAA LGHVHIVEFI ISTFPNLLQN VNLMGETTLH VAARAGSLNI VEILVRFITE SSSYDAFIAA KSKNGDTALH AALKGKHVEV AFCLVSVKHD VSFDKNNDEA SPLYMAVEAG YHELVLKMLE SSSSPSILAS MFSGKSVIHA AMKANRRDIL GIVLRQDPGL IELRNEEGRT CLSYGASMGC YEGIRYILAE FDKAASSLCY VADDDGFTPI HMAAKEGHVR IIKEFLKHCP DSRELLNNQC QNIFHVAAIA GKSKVVKYLL KLDEGKRMMN EQDINGNTPL HLATKHRYPI VVNMLTWNDG INLRALNNEG FTALDIAETM KDNNAYVLYK SSKQSPERYK DSVNTLMVTA TLVATVTFAA GLTLPGGYMS SAPHLGMAAL VNKLNFKVFL LLNNIAMCTS VVTVMALIWA QLGDALLTKK AFRLALPLLL TAVVSMMMAS VAGLTLVPIE SYRLLDLWTN TVHQQFNTYS ITYLIAYGGT TIVAVVTFVM GFTFTKIYTS SAPDLGITGL VKKAVFIVFL VCNSIPVLLS VIATMNLIWP QHLGDIPLIQ QAMKRAMSSP PSLRTFIKIK SQFPWLTRSH RLRNQREIPQ NRIFCTMDKG VVVELIRGST SWAKVVEDIV KLEKKTFPKH ESLAQTFDAE LRKKNAGLLY VDAEGDTVGY AMYSWPSSLS ASITKLAAGI AQLGERQTED LKVACSIHAH RIFLKPFNRF KRHRVKKLGR KKGPL // ID NC003070_187 HYPOTHETICAL; PRT; 679 AA. AC NC003070_187; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[917520...917525, 917668...917852, DE 918433...918601, 918868...919008, 919267...919393, 919503...919609, DE 919737...919817, 919913...919995, 920087...920263, 920313...920391, DE 920990...921062, 921363...921401, 921611...921758, 923470...923755, DE 923828...923971, 924093...924287]; Length: 2040. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 2 FIRST EXON; p-value: NaN. FT GENSCAN 3 63 INTERNAL EXON; p-value: NaN. FT GENSCAN 64 64 AA on splice site: gt/g -> V. FT GENSCAN 65 120 INTERNAL EXON; p-value: NaN. FT GENSCAN 121 167 INTERNAL EXON; p-value: NaN. FT GENSCAN 168 209 INTERNAL EXON; p-value: NaN. FT GENSCAN 210 210 AA on splice site: g/ga -> G. FT GENSCAN 211 245 INTERNAL EXON; p-value: NaN. FT GENSCAN 246 272 INTERNAL EXON; p-value: NaN. FT GENSCAN 273 299 INTERNAL EXON; p-value: NaN. FT GENSCAN 300 300 AA on splice site: cg/g -> R. FT GENSCAN 301 358 INTERNAL EXON; p-value: NaN. FT GENSCAN 359 359 AA on splice site: ag/a -> R. FT GENSCAN 360 385 INTERNAL EXON; p-value: NaN. FT GENSCAN 386 409 INTERNAL EXON; p-value: NaN. FT GENSCAN 410 410 AA on splice site: g/ga -> G. FT GENSCAN 411 422 INTERNAL EXON; p-value: NaN. FT GENSCAN 423 423 AA on splice site: g/at -> D. FT GENSCAN 424 471 INTERNAL EXON; p-value: NaN. FT GENSCAN 472 472 AA on splice site: cg/g -> R. FT GENSCAN 473 567 INTERNAL EXON; p-value: NaN. FT GENSCAN 568 615 INTERNAL EXON; p-value: NaN. FT GENSCAN 616 679 LAST EXON; p-value: NaN. SQ SEQUENCE 679 AA; 76854 MW; 144575F363310EFA CRC64; MEITPRRNRE TLEFFESEDR ETRVLNPLDS GNTENICRVE KEPVGEEAIL ISDRIEIGGR EVHVEFDVCD EGIISVEEWR KWGPVSPFPS AVKQIVDDLK VLECKLDSPI DFGGNGGKLQ GPFGAYEDKK HRATYEALDD PEKKFRFFSA RQVACRLLGS RGYLCQKDFL RQNNTGKLLW QIFGVQSATL CVFGIAEDEE IMWNEFKRAG KSQVRCLYPN HNSEVTFSVK DAFGSSASEN HVSSTTDEDK TLHFILLDGT WNNSAAMLKR LKDHAKSVWG DEDLPCISLA TGASAMHKLR PQPSWDRTCT AAAAIGLLSE LSLLPQLSSY ELDKQADAVE EALVILLDSL TGRRLRMGRT SWDSIVTTHN YVMNASSMRI NGSKELNSTV LNSLEIQETK LMVKLTQRLG EFIMEVRSCV GPDGDNAADV KLISGSGSGP SGKKRKRKRR MVANMRSSSE ILRSDNRSSN RRVFEIGEKM DPKSSDVDTV EPEVMDADLS LKRKAETIEP ADSDEGCEEE ERNSESDDQV WGFDSFEGSD YESPDEPPED DEELEFRRYT RYYHESKGFK VDKLPKHVCY GIRPLDLDAP FFKPNMTGRD YMEIMANVAI DKYNQNKTLK LDHIVRVTVQ LSYGYKAYIT FMAKESPHGE LVEYQAKAER KAWQRNIHPI FCRPARSGA // ID NC003070_188 HYPOTHETICAL; PRT; 326 AA. AC NC003070_188; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[932831...932728, 931612...931468, DE 931353...931164, 926345...926247, 925780...925551, 925208...924996]; DE Length: 981. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 34 FIRST EXON; p-value: NaN. FT GENSCAN 35 35 AA on splice site: aa/t -> N. FT GENSCAN 36 83 INTERNAL EXON; p-value: NaN. FT GENSCAN 84 146 INTERNAL EXON; p-value: NaN. FT GENSCAN 147 147 AA on splice site: g/ga -> G. FT GENSCAN 148 179 INTERNAL EXON; p-value: NaN. FT GENSCAN 180 180 AA on splice site: g/ag -> E. FT GENSCAN 181 256 INTERNAL EXON; p-value: NaN. FT GENSCAN 257 326 LAST EXON; p-value: NaN. SQ SEQUENCE 326 AA; 36427 MW; 2210AA2F606D1E51 CRC64; MRTMRCCQFA IATVLYGKIG LRFRSWDYEL EDSLNSSAQI SNHGDFNFQH KERKNPSIKN LNQIADDGSN RIAKNIEWLS SKILVQAPVH VIAVSVKITG GLRVCGVRSC VWSHVDGCGG CGCGSGGYGD LDIGGCVACV FWEIEKGNGE KRAIHWWQTY KAPKVPNEIK REAATAKKKE ICWAIALTRL LQVIYNITQE YIAGRLRFDH DDLVVHLKMK KTRGKRPGSM KLKNLKDAIN HIAVKGLLKK RESKSKSIYN VSGVDMEDGD AGGHVVLIVG YGYTKENKLF FLIQNSWGED WGVKGFGRIF IDDESKTTLV YPDGPV // ID NC003070_189 HYPOTHETICAL; PRT; 1485 AA. AC NC003070_189; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[934055...934708, 934785...935069, DE 935164...935481, 935570...935797, 936188...936571, 937064...937120, DE 937813...937823, 937864...938226, 938446...938529, 938633...938700, DE 938719...939799, 939875...940001, 940091...940366, 940452...940677, DE 940772...941067]; Length: 4458. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 218 FIRST EXON; p-value: NaN. FT GENSCAN 219 313 INTERNAL EXON; p-value: NaN. FT GENSCAN 314 419 INTERNAL EXON; p-value: NaN. FT GENSCAN 420 495 INTERNAL EXON; p-value: NaN. FT GENSCAN 496 623 INTERNAL EXON; p-value: NaN. FT GENSCAN 624 642 INTERNAL EXON; p-value: NaN. FT GENSCAN 643 645 INTERNAL EXON; p-value: NaN. FT GENSCAN 646 646 AA on splice site: aa/t -> N. FT GENSCAN 647 766 INTERNAL EXON; p-value: NaN. FT GENSCAN 767 767 AA on splice site: gg/a -> G. FT GENSCAN 768 794 INTERNAL EXON; p-value: NaN. FT GENSCAN 795 795 AA on splice site: at/g -> M. FT GENSCAN 796 817 INTERNAL EXON; p-value: NaN. FT GENSCAN 818 818 AA on splice site: g/ag -> E. FT GENSCAN 819 1177 INTERNAL EXON; p-value: NaN. FT GENSCAN 1178 1178 AA on splice site: ag/g -> R. FT GENSCAN 1179 1220 INTERNAL EXON; p-value: NaN. FT GENSCAN 1221 1312 INTERNAL EXON; p-value: NaN. FT GENSCAN 1313 1387 INTERNAL EXON; p-value: NaN. FT GENSCAN 1388 1388 AA on splice site: g/gc -> G. FT GENSCAN 1389 1485 LAST EXON; p-value: NaN. SQ SEQUENCE 1485 AA; 169716 MW; 70D98557910F4823 CRC64; MGCVNSRHRP FRRKSTTLKE SSEEKRSSRI DSSRRIDDWI QPEDGFDRLS NSGDAKVRLI ESEMFSTSRC HDHQIGKILE NPATVAHMDR VVHDQELRRA SSAVVDSDLD IDPKVVKAKL DRWNSKDSKV RLIESEKLSS SMFSEHHQIE KGVEKPEVEA SVRVVHRELK RGSSIVSPKD AERKQVAAGW PSWLVSVAGE SLVDWAPRRA NTFEKLEKIG QGTYSSVYRA RDLLHNKIVA LKKVRFDLND MESVKFMARE IIVMRRLDHP NVLKLEGLIT APVSSSLYLV FEYMDHDLLG LSSLPGVKFT EPQVKCYMRQ LLSGLEHCHS RGVLHRDIKG SNLLIDSKGV LKIADFGLAT FFDPAKSVSL TSHVVTLWYR PPELLLGASH YGVGVDLWST GCILGELYAG KPILPGKTEV EQLHKIFKLC GSPTENYWRK QKLPSSAGFK TAIPYRRKVS EMFKDFPASV LSLLETLLSI DPDHRSSADR ALESEKQYKD LRSRNDSFKS FKEERTPHGP VPDYQNMQHN RNNQTGVRIS HSGPLMSNRN MAKSTMHVKE NALPRYPPAR VNPKMLSGSV SSKTLLERQD QPVTNQRRRD RRAYNRADTM DSRHMTAPID PSWLKNIRRF KLNYPLNKPR IKNNSNFSKK ASLFPLLSQR DRLAMSLLHT FKETLKPCGS FPSSSSLRVS STQELEPSRK PPKSSLSQQL LRLDDSYFLP SKHESKISKT QVEDFDHNED DHKRNIKFDE EEVDEDDERS IEFGRPGLLE HQREGVKFMY NLYKNNHGGI LGDDMGLGKT IQTIAFLAAV YGKDGDAESD KGPVLIICPS SIIHNWESEF SRWASFFKVS VYHGSNRDMI LEKLKARGVE VLVTSFDTFR IQGPVLSGIN WEIVIADEAH RLKNEKSKLY EACLEIKTKK RIGLTGTVMQ NKISELFNLF EWVAPGSLGT REHFRDFYDE PLKLGQRATA PERFVQIADK RKQHLGSLLR KYMLRRTKEE TIGHLMMGKE DNVVFCQMSQ LQRRVYQRMI QLPEIQCLVN KDNPCACGSP LKQSECCRRI VPDGTIWSYL HRDNHDGCDS CPFCLVLPCL MKLQQISNHL ELIKPNPKDE PEKQKKDAEF VSTVFGTDID LLGGISASKS FMDLSDVKHC GKMRALEKLM ASWISKGDKI LLFSYSVRML DILEKFLIRK GYSFARLDGS TPTNLRQSLV DDFNASPSKQ VFLISTKAGG LGLNLVSANR VVIFDPNWNP SHDLQAQDRS FRYGQKRHVV VFRLLSAGSL EELVYTRQVY KQQLSNIAVA GKMETRYFEG VQDCKEFQGE LFGISNLFRD LSDKLFTSDI VELHRDSNID ENKKRSLLET GVSEDEKEEE VMCSYKPEME KPILKDLGIV YAHRNEDIIN IGETTTSTSQ RLNGDGNSAD RKKKKRKGCS EEEDMSSSNR EQKREKYKML AEFKGMEILE FSRWVLSASP FDREKLLQDF LERVK // ID NC003070_190 HYPOTHETICAL; PRT; 291 AA. AC NC003070_190; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[943319...943122, 943038...942979, DE 942819...942652, 942566...942287, 941793...941693, 941571...941503]; DE Length: 876. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 66 FIRST EXON; p-value: NaN. FT GENSCAN 67 86 INTERNAL EXON; p-value: NaN. FT GENSCAN 87 142 INTERNAL EXON; p-value: NaN. FT GENSCAN 143 235 INTERNAL EXON; p-value: NaN. FT GENSCAN 236 236 AA on splice site: g/aa -> E. FT GENSCAN 237 269 INTERNAL EXON; p-value: NaN. FT GENSCAN 270 291 LAST EXON; p-value: NaN. SQ SEQUENCE 291 AA; 32929 MW; 71DFC22176811BF8 CRC64; MEPPAKGTVT PLGFSEEDVR KAASYMEEKI GEKRVEMNRL QQYVDENDNL INLVKKLPDQ LHHNVMVPFG KMAFFPGRLI HTNECLVLLG ENYYTDRSSK QTVDFLKRRD KTLQSQIHSL KAEIEDLQTE ASFFTTTASE VADGLVEIRE EYVEEDSSAP VVIHSSEKEP CNLSEGETEE GELEDDDFAR IMARLNELEI EEELEGEDGD SRGEDPDSSV ESVEQIQHDL VKGSREGQSR GGTPQQNAET WKDFRTNAKT NVLGPQKIEP QKPEPEFDST KVTYTSLSAK R // ID NC003070_191 HYPOTHETICAL; PRT; 276 AA. AC NC003070_191; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[944771...944981, 945779...945860, DE 945951...946071, 946168...946209, 946308...946602, 947398...947477]; DE Length: 831. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 70 FIRST EXON; p-value: NaN. FT GENSCAN 71 71 AA on splice site: g/gg -> G. FT GENSCAN 72 97 INTERNAL EXON; p-value: NaN. FT GENSCAN 98 98 AA on splice site: gg/g -> G. FT GENSCAN 99 138 INTERNAL EXON; p-value: NaN. FT GENSCAN 139 152 INTERNAL EXON; p-value: NaN. FT GENSCAN 153 250 INTERNAL EXON; p-value: NaN. FT GENSCAN 251 251 AA on splice site: g/at -> D. FT GENSCAN 252 276 LAST EXON; p-value: NaN. SQ SEQUENCE 276 AA; 31315 MW; AF7181768C377B35 CRC64; MPSLKSFSAA EEEDDQLGRN SEAERFNPEA VEKEEDPDKM DEKDESGDEE DDVKRDQVEA EDEEALGEEE GIIRKTRTVM ECLHRFCREC IDKSMRLGNN ECPTCRKHCA SRRSLRDDPN FDALIAALFK NIDKFEEEEL NFRQDDEARN KQIQASIAQV SQRQSKALVK RKSVGKGTAI LSRSRRSGGG SRRRRNCRNI EQDTSEANDD DDQNKRGKDS SSDEPCERQR KKRSATQPSS SNANNNDNCA DCSVPEKANR VEEVEASNSN SVNELD // ID NC003070_192 HYPOTHETICAL; PRT; 753 AA. AC NC003070_192; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[951650...951527, 951210...951125, DE 951040...950838, 950793...950581, 950499...950403, 950310...950197, DE 950124...949882, 949620...949540, 949451...949296, 949206...949075, DE 948815...948696, 948639...948428, 948342...948087, 948008...947784]; DE Length: 2262. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 41 FIRST EXON; p-value: NaN. FT GENSCAN 42 42 AA on splice site: c/ct -> P. FT GENSCAN 43 70 INTERNAL EXON; p-value: NaN. FT GENSCAN 71 137 INTERNAL EXON; p-value: NaN. FT GENSCAN 138 138 AA on splice site: ga/c -> D. FT GENSCAN 139 208 INTERNAL EXON; p-value: NaN. FT GENSCAN 209 209 AA on splice site: gg/g -> G. FT GENSCAN 210 241 INTERNAL EXON; p-value: NaN. FT GENSCAN 242 279 INTERNAL EXON; p-value: NaN. FT GENSCAN 280 360 INTERNAL EXON; p-value: NaN. FT GENSCAN 361 387 INTERNAL EXON; p-value: NaN. FT GENSCAN 388 439 INTERNAL EXON; p-value: NaN. FT GENSCAN 440 483 INTERNAL EXON; p-value: NaN. FT GENSCAN 484 523 INTERNAL EXON; p-value: NaN. FT GENSCAN 524 593 INTERNAL EXON; p-value: NaN. FT GENSCAN 594 594 AA on splice site: ga/g -> E. FT GENSCAN 595 679 INTERNAL EXON; p-value: NaN. FT GENSCAN 680 753 LAST EXON; p-value: NaN. SQ SEQUENCE 753 AA; 86414 MW; 1E5D11E40F878D39 CRC64; MVDETYEFLA PRWFDFVNGE TEDESRRAEL WFQSALSCAP SPSVPRIKAR RSFKVEAMCN FNEAEEETLK DKEPLEPVVP IVSLQSQPSQ AKKAEVAPSK ASTVKPSRIS SKDAEVNNKT VDARFSIVLW EIVSLLSDDP TTEPIEDKEN IAPACTPKPP MQFSLGAKSV DLKKQQTARK IASLLKNPST LRPKNQSQAK GSHQKSVKGE TNLNNIASTT NLIQENQAIK RQKLDDGKSR QILNPKPATL LHKTRNGLVN TGFNLCPSVT KHTPKENRKV YVREQIAPFV STAELMKKFQ TSTRDLFVQN RPKLTLTRPK EPEFVTSQRA RPLRVKSSAE LEEEMLAKIP KFKARPVNKK EFHLQTMARA NQHAETSSIA STEVSKQHND QKHHLTEPKS PVLQTMLRAR PTIAKTTAEL EQEELEKAPK FKAKPLNKKI FESKGEMGIF CNTKKHITIP QEFHFATDER ISRPESVLDI FDKERGAEKE KKFYMELMYK KLGDVKARVP KANPYPYTTD YPVKREMVLI NLTCSNSQVP PKPEPKQCTQ PEPFQLESLV RHEEEMRRER EERRRMETEE AQKRLFKAQP VIKEDPIPVP EKVRMPLTEI QEFNLHVEHR AVERADFDHK VSILFSHFSK QKNASITSEI DESILFWNQI KEKENQYKRY REESEAAKMV EEERALKQMR KTMVPHARPV PNFNKPFLPQ KSNKGTTKAK SPNLRVIKRT ERRTMMARPT ISAATSASAG QMR // ID NC003070_193 HYPOTHETICAL; PRT; 393 AA. AC NC003070_193; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[954588...955769]; Length: 1182. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 393 SINGLE EXON; p-value: NaN. SQ SEQUENCE 393 AA; 43615 MW; 4DF703696ED090B4 CRC64; MDVVCTEHQM RKPTVEIPPR KLLLSSKSFP SDSSSPRSPR KHNWNKSNKI TSEHEEDNED NNRENKEYCY DSDSDDPYAS DHFRMFEFKI RRCTRSRSHD WTDCPFAHPG EKARRRDPRR FQYSGEVCPE FRRGGDCSRG DDCEFAHGVF ECWLHPIRYR TEACKDGKHC KRKVCFFAHS PRQLRVLPPE NVSGVSASPS PAAKNPCCLF CSSSPTSTLL GNLSHLSRSP SLSPPMSPAN KAAAFSRLRN RAASAVSAAA AAGSMNYKDV LSELVNSLDS MSLAEALQAS SSSPVTTPVS AAAAAFASSC GLSNQRLHLQ QQQPSSPLQF ALSPSTPSYL TNSPQANFFS DDFTPRRRQM NDFTAMTAVR ENTNIEDGSC GDPDLGWVND LLT // ID NC003070_194 HYPOTHETICAL; PRT; 245 AA. AC NC003070_194; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[957997...957260]; Length: 738. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 245 SINGLE EXON; p-value: NaN. SQ SEQUENCE 245 AA; 26393 MW; 54FD48B77A9D72F6 CRC64; MTTEKENVTT AVAVKDGGEK SKEVSDKGVK KRKNVTKALA VNDGGEKSKE VRYRGVRRRP WGRYAAEIRD PVKKKRVWLG SFNTGEEAAR AYDSAAIRFR GSKATTNFPL IGYYGISSAT PVNNNLSETV SDGNANLPLV GDDGNALASP VNNTLSETAR DGTLPSDCHD MLSPGVAEAV AGFFLDLPEV IALKEELDRV CPDQFESIDM GLTIGPQTAV EEPETSSAVD CKLRMEPDLD LNASP // ID NC003070_195 HYPOTHETICAL; PRT; 143 AA. AC NC003070_195; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[958758...959007, 959427...959608]; Length: DE 432. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 83 FIRST EXON; p-value: NaN. FT GENSCAN 84 84 AA on splice site: g/tt -> V. FT GENSCAN 85 143 LAST EXON; p-value: NaN. SQ SEQUENCE 143 AA; 15670 MW; 13EC6AA08C78F969 CRC64; MSTTANTNAT TAPKRKPVFV KVDQLKPGTS GHTLIVKVLE SNPVKPAIRK SSLTQQPISS PRIAECLIGD DTGCILFTAR NDQVDLMKTG ATVILRNAKI DLFKDTMRMA VDRWGRIEIT GPVSFEVNRA NNLSLVEYEV ITA // ID NC003070_196 HYPOTHETICAL; PRT; 222 AA. AC NC003070_196; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[960675...960007]; Length: 669. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 222 SINGLE EXON; p-value: NaN. SQ SEQUENCE 222 AA; 25206 MW; 4E8A390B6611E468 CRC64; MASFALKPIF CFIAVFCFIV HNVEAREGKL FFSKFTHIDR PNNKDVALSP APAPGLAQAN GRLGNGSFGP GSGMIPQTKE SWPSSSTTTD EEFEKLMATF DEEKNTKLPE AFEEEEESED SEDLNEPKDK YNNNNNNNGY TYTTNNYNDN GRGYGNEEEK QGMSDTRVME NGKYFYDTRG RNSENTPSRG YENARGNDHT NEFETMEEYY KSLEGSQEEY EP // ID NC003070_197 HYPOTHETICAL; PRT; 1018 AA. AC NC003070_197; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[966621...966445, 966156...965995, DE 965735...965598, 965505...965452, 965308...965123, 965040...964951, DE 964872...964738, 964625...964503, 964460...964294, 964190...963638, DE 963545...963468, 963320...962127]; Length: 3057. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 59 FIRST EXON; p-value: NaN. FT GENSCAN 60 113 INTERNAL EXON; p-value: NaN. FT GENSCAN 114 159 INTERNAL EXON; p-value: NaN. FT GENSCAN 160 177 INTERNAL EXON; p-value: NaN. FT GENSCAN 178 239 INTERNAL EXON; p-value: NaN. FT GENSCAN 240 269 INTERNAL EXON; p-value: NaN. FT GENSCAN 270 314 INTERNAL EXON; p-value: NaN. FT GENSCAN 315 355 INTERNAL EXON; p-value: NaN. FT GENSCAN 356 410 INTERNAL EXON; p-value: NaN. FT GENSCAN 411 411 AA on splice site: ag/t -> S. FT GENSCAN 412 595 INTERNAL EXON; p-value: NaN. FT GENSCAN 596 621 INTERNAL EXON; p-value: NaN. FT GENSCAN 622 1018 LAST EXON; p-value: NaN. SQ SEQUENCE 1018 AA; 115607 MW; 96D981862F3CF1A3 CRC64; MRNPTDESGR AVQLVYVDEN GKLKTDPEAI GALQKLKGPV AVVSLFGKAL QGKSFIWNQL LSRSIGFEVQ TLHRPCNGDI WMWIEPVKRI SEDGTEYSLV LLDVELEDAK SIPTLGLNDI ALDLSRLLEI RKQDHVGEAK DNTFFELGQF SPMFVQLMMD INSETVEGGE DVTQNSKLKK LRPLLLYGVD ALMKFVSERV RPKQRGDTIV TGPPLAGFTK AFSENVNNNI VPKISSLWQT VEELEGRRAR DTATEVYMSS LERSETPDES MLLEAHNKAV VEALTAFCES SIGNVEVKQK YKRDLWSFFA KALEDHKRVA NVEAYSRCCN AIEDMGKKLW ALPCSQDANI GDMIKYSRHS KASCRATLSH YIINIVVICM KQALDTAVAE YEASINGPMK WQKLSSFLRE SVQDILVHRR GNQMDELMSE NSKLKLQQQS LESTMNLLKK QLEGREKMNK EYQKRYESAI DDICKLSDQF KNRINDLESK CKSIHDEHSN LMEVLGSTRL EASEWKRKYE GTLDENGVSN IRVGVDASIT RCSNKLIDWK IKYENTVSEQ KAVTEKIAAM EEKLKQASTT EDGLRAEFSR VLDEKEKIIT EKAAKLATLE QQLASTRAEL KKSALKVDEC SSEAKDVRLQ MSLLNEKYES VKSASELLET ETETLKREKD ELDKKCHIHL EELEKLVLRL TNVESEALEA KKLVDSLKLE AEAARDNENK LQTSLVERCI EIDRAKSRIE ELEKVCTLNS GEGEASASKK LVDSMKMEAE ASRKNENKLQ TLLEDKCIEI DRAKSRIEGL ERDCLKLKYA ESEAATVKEL VSSMKMEVES ARSNEKKLQL SLQEKTIEID RAKGQIEALE RQKMELSETL ETRAKQNEEE VTKWQRIINA EKSKNIRENL MEKEDSFMVW DEATPMQRVK RLKVEAAVTC SGSDFAQETE EDSVSQESRK VRTMTPRRCT SSEAGATSSS TGTGHSKYTM KKLRTEILEH GFGAELVGLK NPRKRDLVQL YERTVLRK // ID NC003070_198 HYPOTHETICAL; PRT; 506 AA. AC NC003070_198; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[970057...969907, 969710...969314, DE 968567...967595]; Length: 1521. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 50 FIRST EXON; p-value: NaN. FT GENSCAN 51 51 AA on splice site: g/at -> D. FT GENSCAN 52 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 183 AA on splice site: ag/g -> R. FT GENSCAN 184 506 LAST EXON; p-value: NaN. SQ SEQUENCE 506 AA; 55827 MW; C27006B44C77D1C7 CRC64; MTTEDQTISS SGGYVQSSST TDHVDHHHHD QHESLNPPLV KKKRNLPGNP DPEAEVIALS PKTLMATNRF LCEICGKGFQ RDQNLQLHRR GHNLPWKLKQ RTSKEVRKRV YVCPEKSCVH HHPTRALGDL TGIKKHFCRK HGEKKWKCEK CAKRYAVQSD WKAHSKTCGT REYRCDCGTI FSRRDSFITH RAFCDALAEE TARLNAASHL KSFAATAGSN LNYHYLMGTL IPSPSLPQPP SFPFGPPQPQ HHHHHQFPIT TNNFDHQDVM KPASTLSLWS GGNINHHQQV TIEDRMAPQP HSPQEDYNWV FGNANNHGEL ITTSDSLITH DNNINIVQSK ENANGATSLS VPSLFSSVDQ ITQDANAASV AVANMSATAL LQKAAQMGAT SSTSPTTTIT TDQSAYLQSF ASKSNQIVED GGSDRFFASF GSNSVELMSN NNNGLHEIGN PRNGVTVVSG MGELQNYPWK RRRVDIGNAG GGGQTRDFLG VGVQTICHSS SINGWI // ID NC003070_199 HYPOTHETICAL; PRT; 671 AA. AC NC003070_199; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[983367...982692, 981744...981695, DE 981096...980986, 980893...980777, 980354...980184, 980102...979890, DE 979798...979676, 977609...977367, 976500...976420, 976158...976033, DE 970846...970742]; Length: 2016. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 225 FIRST EXON; p-value: NaN. FT GENSCAN 226 226 AA on splice site: g/tt -> V. FT GENSCAN 227 242 INTERNAL EXON; p-value: NaN. FT GENSCAN 243 279 INTERNAL EXON; p-value: NaN. FT GENSCAN 280 318 INTERNAL EXON; p-value: NaN. FT GENSCAN 319 375 INTERNAL EXON; p-value: NaN. FT GENSCAN 376 446 INTERNAL EXON; p-value: NaN. FT GENSCAN 447 487 INTERNAL EXON; p-value: NaN. FT GENSCAN 488 568 INTERNAL EXON; p-value: NaN. FT GENSCAN 569 595 INTERNAL EXON; p-value: NaN. FT GENSCAN 596 637 INTERNAL EXON; p-value: NaN. FT GENSCAN 638 671 LAST EXON; p-value: NaN. SQ SEQUENCE 671 AA; 73453 MW; 54E8079B913127CC CRC64; MATTRLTLAP LLLIAAVLLA TKATAQPAAP APEPAGPINL TAILEKGGQF TTFIHLLNIT QVGSQVNIQV NSSSEGMTVF APTDNAFQNL KPGTLNQLSP DDQVKLILYH VSPKYYSMDD LLSVSNPVRT QASGRDNGVY GLNFTGQTNQ INVSTGYVET RISNSLRQQR PLAVYVVDMV LLPGEMFGEH KLSPIAPAPK SKSGGVTDDS GSTKKAASPS DKSGSVFTVF RCMLCLVVDS HTVSVIGGLG VYALTNSLYN VDGGHRAVMF NRLTGIKEKV YPEGTHFMVP WFERPIIYDV RARPYLVEST TGSHDLQMVK IGLRVLTRPM GDRLPQIYRT LGENYSERVL PSIIHETLKA VVAQYNASQL ITQREAVSRE IRKILTERAS NFDIALDDVS ITTLTFGKEF TAAIEAKQVA AQEAERAKFI VEKAEQDRRS AVIRAQGEAK SAQLIGQAIA NNQAFITLRK IEAAREIAQT IAQSANKNED KPSSSSSSLS WLTSGSPKPT SISNKRSSNL VVMENAVVVF ARRGCCLGHV AKRLLLTHGV NPVVVEIGEE DNNNYDNIAT PSRFEPIHGR RSQSPPSTES AGENSVTKKE ISLIMLKTKK YGNFLTTVIF VETENVPLRS RGEIRDHVFN FVVVCCWILP HEHVHGYTCI AGFVLNNKRE M // ID NC003070_200 HYPOTHETICAL; PRT; 455 AA. AC NC003070_200; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[985785...986067, 986278...986540, DE 986853...987281, 987523...987915]; Length: 1368. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 94 FIRST EXON; p-value: NaN. FT GENSCAN 95 95 AA on splice site: g/ga -> G. FT GENSCAN 96 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 325 INTERNAL EXON; p-value: NaN. FT GENSCAN 326 455 LAST EXON; p-value: NaN. SQ SEQUENCE 455 AA; 50560 MW; BE24BCBD2F69B538 CRC64; MGRVSSIISF SLTLLILFNG YTAQQWPNEC QLDQLNALEP SQIIKSEGGR IEVWDHHAPQ LRCSGFAFER FVIEPQGLFL PTFLNAGKLT FVVHGRGLMG RVIPGCAETF MESPVFGEGQ GQGQSQGFRD MHQKVEHLRC GDTIATPSGV AQWFYNNGNE PLILVAAADL ASNQNQLDRN LRPFLIAGNN PQGQEWLQGR KQQKQNNIFN GFAPEILAQA FKINVETAQQ LQNQQDNRGN IVKVNGPFGV IRPPLRRGEG GQQPHEIANG LEETLCTMRC TENLDDPSDA DVYKPSLGYI STLNSYNLPI LRLLRLSALR GSIRKNAMVL PQWNVNANAA LYVTNGKAHI QMVNDNGERV FDQEISSGQL LVVPQGFSVM KHAIGEQFEW IEFKTNENAQ VNTLAGRTSV MRGLPLEVIT NGYQISPEEA KRVKFSTIET TLTHSSPMSY GRPRA // ID NC003070_201 HYPOTHETICAL; PRT; 389 AA. AC NC003070_201; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[989435...989549, 989652...989917, DE 990048...990458, 990530...990907]; Length: 1170. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 38 FIRST EXON; p-value: NaN. FT GENSCAN 39 39 AA on splice site: g/ga -> G. FT GENSCAN 40 127 INTERNAL EXON; p-value: NaN. FT GENSCAN 128 264 INTERNAL EXON; p-value: NaN. FT GENSCAN 265 389 LAST EXON; p-value: NaN. SQ SEQUENCE 389 AA; 42691 MW; 0A15565B02C9AB47 CRC64; MSPELRCAGV TVARITLQPN SIFLPAFFSP PALAYVVQGE GVMGTIASGC PETFAEVEGS SGRGGGGDPG RRFEDMHQKL ENFRRGDVFA SLAGVSQWWY NRGDSDAVIV IVLDVTNREN QLDQVPRMFQ LAGSRTQEEE QPLTWPSGNN AFSGFDPNII AEAFKINIET AKQLQNQKDN RGNIIRANGP LHFVIPPPRE WQQDGIANGI EETYCTAKIH ENIDDPERSD HFSTRAGRIS TLNSLNLPVL RLVRLNALRG YLYSGGMVLP QWTANAHTVL YVTGGQAKIQ VVDDNGQSVF NEQVGQGQII VIPQGFAVSK TAGETGFEWI SFKTNDNAYI NTLSGQTSYL RAVPVDVIKA SYGVNEEEAK RIKFSQQETM LSMTPSSSS // ID NC003070_202 HYPOTHETICAL; PRT; 565 AA. AC NC003070_202; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[991251...991745, 992379...992690, DE 993297...993396, 993398...993630, 994353...994404, 994495...994631, DE 994827...994913, 995312...995593]; Length: 1698. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 165 FIRST EXON; p-value: NaN. FT GENSCAN 166 269 INTERNAL EXON; p-value: NaN. FT GENSCAN 270 302 INTERNAL EXON; p-value: NaN. FT GENSCAN 303 303 AA on splice site: a/ta -> I. FT GENSCAN 304 380 INTERNAL EXON; p-value: NaN. FT GENSCAN 381 397 INTERNAL EXON; p-value: NaN. FT GENSCAN 398 398 AA on splice site: g/tt -> V. FT GENSCAN 399 443 INTERNAL EXON; p-value: NaN. FT GENSCAN 444 472 INTERNAL EXON; p-value: NaN. FT GENSCAN 473 565 LAST EXON; p-value: NaN. SQ SEQUENCE 565 AA; 64767 MW; 9DCD1D4B3AA6ACD8 CRC64; MSFEEEEEEE TFEHTLLVVR EVSVYKIPPR TTSGGYKCGE WLQSDKIWSG RLRVVSCKDR CEIRLEDSNS GDLFAACFVD PGRRENSVEP SLDSSRYFVL RIDDGRGKYA FIGLGFAERN EAFDFNVALS DHEKYVRREK EKETGETSES DNHIDIHPAV NHRLKENSWH LDLVEPEKIV VLMGFKEGET IRINVKPKPT TNGTGMLSAA LSGTGKPKPL ALAPPPKAAG VTRSPLPPPP NDPVASRIAS DGCKESRRNE PLSDLSQLKR YNRYITDVAK RYWYDVDLAL NETYLKDCIF DQINNVGPPT NRYLLLVSDR LRAFGDWQKW RRKMRRRVAM TPLELAVCNS HTKLRIQYSS TSTSIFQLVL AVFWLVPMDL GEVPLQGDFS AEHMIFGVEG TDPVRREKLI DLLDINLQWR MHKVSDGQKR RVQICMGLLH PFKVLLLDEV TVDLDVVARM DLLEFFKEEC DQRGATIVYA THIFDGLETW ATHLAYIQDG ELNRLSKMTD IEELKTSPNL LSVVESWLRS EIKLVKKKKK PVAPWKPSPF DNSPFRSSRH MAYYR // ID NC003070_203 HYPOTHETICAL; PRT; 1618 AA. AC NC003070_203; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[996431...996733, 996847...996938, DE 997176...997275, 997417...997488, 997603...997665, 997754...997894, DE 998419...998514, 998732...999016, 999108...999410, 999503...999679, DE 999759...1000025, 1000111...1000217, 1001486...1001746, DE 1001808...1002066, 1002142...1002383, 1002470...1002686, DE 1002782...1002911, 1003407...1003501, 1003606...1003653, DE 1003809...1003893, 1004124...1004222, 1005421...1005513, DE 1005611...1005651, 1005745...1005814, 1005925...1006073, DE 1006175...1006270, 1006427...1006497, 1006580...1006641, DE 1006733...1006796, 1006916...1007000, 1007080...1007206, DE 1007289...1007374, 1007458...1007524, 1007601...1007731, DE 1007827...1007979, 1008060...1008179]; Length: 4857. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 101 FIRST EXON; p-value: NaN. FT GENSCAN 102 131 INTERNAL EXON; p-value: NaN. FT GENSCAN 132 132 AA on splice site: ac/a -> T. FT GENSCAN 133 165 INTERNAL EXON; p-value: NaN. FT GENSCAN 166 189 INTERNAL EXON; p-value: NaN. FT GENSCAN 190 210 INTERNAL EXON; p-value: NaN. FT GENSCAN 211 257 INTERNAL EXON; p-value: NaN. FT GENSCAN 258 289 INTERNAL EXON; p-value: NaN. FT GENSCAN 290 384 INTERNAL EXON; p-value: NaN. FT GENSCAN 385 485 INTERNAL EXON; p-value: NaN. FT GENSCAN 486 544 INTERNAL EXON; p-value: NaN. FT GENSCAN 545 633 INTERNAL EXON; p-value: NaN. FT GENSCAN 634 668 INTERNAL EXON; p-value: NaN. FT GENSCAN 669 669 AA on splice site: ag/a -> R. FT GENSCAN 670 755 INTERNAL EXON; p-value: NaN. FT GENSCAN 756 756 AA on splice site: ag/a -> R. FT GENSCAN 757 842 INTERNAL EXON; p-value: NaN. FT GENSCAN 843 922 INTERNAL EXON; p-value: NaN. FT GENSCAN 923 923 AA on splice site: ag/a -> R. FT GENSCAN 924 995 INTERNAL EXON; p-value: NaN. FT GENSCAN 996 1038 INTERNAL EXON; p-value: NaN. FT GENSCAN 1039 1039 AA on splice site: a/gt -> S. FT GENSCAN 1040 1070 INTERNAL EXON; p-value: NaN. FT GENSCAN 1071 1086 INTERNAL EXON; p-value: NaN. FT GENSCAN 1087 1114 INTERNAL EXON; p-value: NaN. FT GENSCAN 1115 1115 AA on splice site: g/ag -> E. FT GENSCAN 1116 1147 INTERNAL EXON; p-value: NaN. FT GENSCAN 1148 1148 AA on splice site: c/ag -> Q. FT GENSCAN 1149 1178 INTERNAL EXON; p-value: NaN. FT GENSCAN 1179 1179 AA on splice site: g/gt -> G. FT GENSCAN 1180 1192 INTERNAL EXON; p-value: NaN. FT GENSCAN 1193 1215 INTERNAL EXON; p-value: NaN. FT GENSCAN 1216 1216 AA on splice site: a/ct -> T. FT GENSCAN 1217 1265 INTERNAL EXON; p-value: NaN. FT GENSCAN 1266 1297 INTERNAL EXON; p-value: NaN. FT GENSCAN 1298 1320 INTERNAL EXON; p-value: NaN. FT GENSCAN 1321 1321 AA on splice site: ag/a -> R. FT GENSCAN 1322 1341 INTERNAL EXON; p-value: NaN. FT GENSCAN 1342 1342 AA on splice site: g/aa -> E. FT GENSCAN 1343 1362 INTERNAL EXON; p-value: NaN. FT GENSCAN 1363 1363 AA on splice site: ag/c -> S. FT GENSCAN 1364 1391 INTERNAL EXON; p-value: NaN. FT GENSCAN 1392 1433 INTERNAL EXON; p-value: NaN. FT GENSCAN 1434 1434 AA on splice site: g/gt -> G. FT GENSCAN 1435 1462 INTERNAL EXON; p-value: NaN. FT GENSCAN 1463 1484 INTERNAL EXON; p-value: NaN. FT GENSCAN 1485 1485 AA on splice site: g/ga -> G. FT GENSCAN 1486 1528 INTERNAL EXON; p-value: NaN. FT GENSCAN 1529 1579 INTERNAL EXON; p-value: NaN. FT GENSCAN 1580 1618 LAST EXON; p-value: NaN. SQ SEQUENCE 1618 AA; 185979 MW; 93C89233B4C3A85E CRC64; MGSHGKGKRD RSGRQKKRRD ESESGSESES YTSDSDGSDD LSPPRSSRRK KGSSSRRTRR RSSSDDSSDS DGGRKSKKRS SSKDYSEEKV TEYMSKKAQK KALRAAKKLK TQSVSGYSND SNPFGDSNLT ETFVWRKKIE KDVHRGVPLE EFSVKAEKRR HRERMTEVEK VKKRREERAV EKARHEEEMA LLARERARAE FHDWEKKEEE FHFDQSKVRS EIRLREGRLK PIDVLCKHLD GSDDLDIELS EPYMVFKGLT VKDMEELRDD IKMYLDLDRA TPTRVQYWEA LIVVCDWELA EARKRDALDR ARVRGEEPPA ELLAQERGLH AGVEADVRKL LDGKTHAELV ELQLDIESQL RSGSAKVVEY WEAVLKRLEI YKAKACLKEI HAEMLRRHLH RLEQLSEGED DVEVNPGLTR VVEENEEEIN DTNLSDAEEA FSPEPVAEEE EADEAAEAAG SFSPELMHGD DREEAIDPEE DKKLLQMKRM IVLEKQKKRL KEAMDSKPAP VEDNLELKAM KAMGAMEEGD AIFGSNAEVN LDSEVYWWHD KYRPRKPKYF NRVHTGYEWN KYNQTHYDHD NPPPKIVQGY KFNIFYPDLV DKIKAPIYTI EKDGTSAETC MIRFHAGPPY EDIAFRIVNK EWEYSHKKGF KCTFERGILH LYFNFKRHRS WFHKFQPRDK PRKKDMFSGS TYGGGVTETT VPDGGNDTET ATKLPPLGGD GEALSNSTKQ KVAAAKQYIE NHYKEQMKNL NERKERRTTL EKKLADADVC EEDQTNLMKF LEKKETEYMR LQRHKMGADD FELLTMIGKG AFGEVRVVRE INTGHVFAMK KLKKSEMLRR GQVEHVRAER NLLAEVDSNC IVKLYCSFQD NEYLYLIMEY LPGGDMMTLL MRKDTLSEDE AKFYIAESVL AIESIHNRNY IHRDIKPDNL LLDRYGHLRL SDFGLCKPLD CSVIDGEDFT VGNAGSGGGS ESVSTTPKRS QQEQLEHWQK NRRMLAYSTV GTPDYIAPEV LLKKGYGMEC DWYVYTLVYF QLVNKEQKSV QWEKIYQMEA AFIPEVNDDL DTQNFEKFDE EDNQTQAPSR TGPWRKMLSS KDINFVGYTY KNFEIVNDYQ VPGIESESDS SSSGSEQQTI NRSYSNPTPR GMEPNLRQIV GENMDLVIGG KFKLGRKIGS GSFGELYLGI NVQTGEEVAV KLESVKTKHP QLHYESKLYM LLQGGTGVPN LKWYGVEGDY NVMVIDLLGP SLEDLFNYCN RKLSLKTVLM LADQLINRVE FMHTRGFLHR DIKPDNFLMG LGRKANQVYI IDFGLGKKYR DLQTHRHIPY RENKNLTGTA RYASVNTHLG VEQSRRDDLE ALGYVLMYFL KGSLPWQGLK AGTKKQKYDR ISEKKVATPI EVLCKNQPSE FVSYFRYCRS LRFDDKPDYS YLKRLFRDLF IREGYQFDYV FDWTVLKYPQ IGSSSGSSSR TRNHTTANPG LTAGASLEKQ ERIAGKETRE NRFSGAVEAF SRRHPATSTT RDRSASRNSV DGPLSKHPPG DSERPRSSSR YGSSSRRAIP SSSRPSSAGG PSDSRSSSRL VTSTGGVGTK HSRRSIKEEL RASHSQKMKP DKQESHYYTY NPKEQTDR // ID NC003070_204 HYPOTHETICAL; PRT; 469 AA. AC NC003070_204; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1010950...1009541]; Length: 1410. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 469 SINGLE EXON; p-value: NaN. SQ SEQUENCE 469 AA; 51927 MW; FE5CC01DBB80CC21 CRC64; MVAHLQPPKI IETCHISPPK GTVPSTTLPL TFFDAPWLSL PLADSLFFFS YQNSTESFLQ DFVPNLKHSL SITLQHFFPY AGKLIIPPRP DPPYLHYNDG QDSLVFTVAE STETDFDQLK SDSPKDISVL HGVLPKLPPP HVSPEGIQMR PIMAMQVTIF PGAGICIGNS ATHVVADGVT FSHFMKYWMS LTKSSGKDPA TVLLPSLPIH SCRNMIKDPG EVGAGHLERF WSQNSAKHSS HVTPENMVRA TFTLSRKQID NLKSWVTEQS ENQSPVSTFV VTLAFIWVSL IKTLVQDSET KANEEDKDEV FHLMINVDCR NRLKYTQPIP QTYFGNCMAP GIVSVKKHDL LGEKCVLAAS DAITARIKDM LSSDLLKTAP RWGQGVRKWV MSHYPTSIAG APKLGLYDMD FGLGKPCKME IVHIETGGSI AFSESRDGSN GVEIGIALEK KKMDVFDSIL QQGIKKFAT // ID NC003070_205 HYPOTHETICAL; PRT; 119 AA. AC NC003070_205; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1012232...1012164, 1012012...1011941, DE 1011857...1011777, 1011688...1011653, 1011556...1011501, DE 1011432...1011387]; Length: 360. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 23 FIRST EXON; p-value: NaN. FT GENSCAN 24 47 INTERNAL EXON; p-value: NaN. FT GENSCAN 48 74 INTERNAL EXON; p-value: NaN. FT GENSCAN 75 86 INTERNAL EXON; p-value: NaN. FT GENSCAN 87 104 INTERNAL EXON; p-value: NaN. FT GENSCAN 105 105 AA on splice site: ag/t -> S. FT GENSCAN 106 119 LAST EXON; p-value: NaN. SQ SEQUENCE 119 AA; 12733 MW; 2DE2EA31C24303DD CRC64; MHAHTSVAAG MQGATKAMAA MSKNMDPAKQ AKVMREFQKQ SAQMDMTTEM MSDSIDDALD NDEAEDETED LTNQVLDEIG IDIASQLSSA PKGKIGGKKA EDVGSSGIDE LEKRLAALR // ID NC003070_206 HYPOTHETICAL; PRT; 425 AA. AC NC003070_206; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1014084...1014236, 1014333...1014542, DE 1015427...1015517, 1015683...1015824, 1016078...1016225, DE 1016339...1016448, 1016794...1016950, 1017057...1017182, DE 1017331...1017471]; Length: 1278. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 51 FIRST EXON; p-value: NaN. FT GENSCAN 52 121 INTERNAL EXON; p-value: NaN. FT GENSCAN 122 151 INTERNAL EXON; p-value: NaN. FT GENSCAN 152 152 AA on splice site: g/ct -> A. FT GENSCAN 153 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 199 AA on splice site: ag/g -> R. FT GENSCAN 200 248 INTERNAL EXON; p-value: NaN. FT GENSCAN 249 284 INTERNAL EXON; p-value: NaN. FT GENSCAN 285 285 AA on splice site: tg/g -> W. FT GENSCAN 286 337 INTERNAL EXON; p-value: NaN. FT GENSCAN 338 379 INTERNAL EXON; p-value: NaN. FT GENSCAN 380 425 LAST EXON; p-value: NaN. SQ SEQUENCE 425 AA; 49110 MW; 32277733CF2F6704 CRC64; MDIDGVDDDL HILDPELLQL PGLSPSPLKP TSLIADDLFS QWLSLPETAT LVKSLIDDAK SGTPTNKSKN LPSVFLSSST PPLSPRSSSG SPRFSRQRTS PPSLHSPLRS LKEPKRQLIP QADFKPVLDE LLATHPGLEF LRTISEFQER YAETVIYRIF YYINRSGTGC LTLRELRRGN LIAAMQQLDE EDDINKIIRY FSYEHFYVIY CKFWELDGDH DCFIDKDNLI KYGNNALTYR IVDRIFSQIP RKFTSKVEGK MSYEDFVYFI LAEEDKSSEP SLEYWFKCVD LDGNGVITSN EMQFFFEEQL HRMECITQEA VLFSDILCQI IDMIGPEKEN CITLQDLKGS KLSANVFNIL FNLNKFMAFE TRDPFLIRQE REDPNLTEWD RFAQREYARL SMEEDVDEVS NGSADVWDEP LEPPF // ID NC003070_207 HYPOTHETICAL; PRT; 270 AA. AC NC003070_207; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1018236...1019048]; Length: 813. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 270 SINGLE EXON; p-value: NaN. SQ SEQUENCE 270 AA; 30587 MW; 36BC37F954EFF11E CRC64; MASFKLMSSS NSDLSRRNSS SASSSPSIRS SHHLRPNPHA DHSRISFAYG GGVNDYTFAS DSKPFEMAID VDRSIGDRNS VNNGKSVDDV WKEIVSGEQK TIMMKEEEPE DIMTLEDFLA KAEMDEGASD EIDVKIPTER LNNDGSYTFD FPMQRHSSFQ MVEGSMGGGV TRGKRGRVMM EAMDKAAAQR QKRMIKNRES AARSRERKQA YQVELETLAA KLEEENEQLL KEIEESTKER YKKLMEVLIP VDEKPRPPSR PLSRSHSLEW // ID NC003070_208 HYPOTHETICAL; PRT; 599 AA. AC NC003070_208; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1022911...1022830, 1022745...1022492, DE 1021975...1021799, 1021442...1021286, 1021219...1020981, DE 1020817...1020654, 1020563...1020512, 1020413...1020186, DE 1020091...1019905, 1019778...1019603, 1019393...1019310]; Length: 1800. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 27 FIRST EXON; p-value: NaN. FT GENSCAN 28 28 AA on splice site: g/at -> D. FT GENSCAN 29 112 INTERNAL EXON; p-value: NaN. FT GENSCAN 113 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 223 INTERNAL EXON; p-value: NaN. FT GENSCAN 224 224 AA on splice site: g/gg -> G. FT GENSCAN 225 303 INTERNAL EXON; p-value: NaN. FT GENSCAN 304 357 INTERNAL EXON; p-value: NaN. FT GENSCAN 358 358 AA on splice site: gg/g -> G. FT GENSCAN 359 375 INTERNAL EXON; p-value: NaN. FT GENSCAN 376 451 INTERNAL EXON; p-value: NaN. FT GENSCAN 452 513 INTERNAL EXON; p-value: NaN. FT GENSCAN 514 514 AA on splice site: g/gt -> G. FT GENSCAN 515 572 INTERNAL EXON; p-value: NaN. FT GENSCAN 573 599 LAST EXON; p-value: NaN. SQ SEQUENCE 599 AA; 68211 MW; 6B3554A0A910D6DF CRC64; MKDFKVLGGY SPALIGNIKE DASCIFEDST RSRDIPRLPK SSRERSSTLG GSPTKERSRR RGSSQYNGNP KVSRRSSKES SAIPQDGGFN KKSRRKKSKD CVDGGSRRSS RRRERERERE RELFFYIVFE PVPDSFGASL LHSQTLTMSM ASLYRRSLSP PAIDFASFEG KQIFNEALQK GTMEGFFGLI SYFQTQSEPA FCGLASLSMV LNSLSIDPGR KWKGPWRWFD ESMLECCEPL EIVKDKGISF GKVVCLAHSS GAKVEAFRTN QSTIDDFRKY VVKCSTSDNC HMISTYHRQV LKQTGTGHFS PIGGYNAERD MALILDVARF KYPPHWVPLK LLWDAMDSID QSTGRRRGFM LISRPHREPG LLYTLSCKDE SWISIAKYLK EDVPRLVSSQ HVDTIERILY VVFKSLPANF NQFIKWMAEI RRTEDVNQNL SSEEKSRLKL KQELLKQVQE TKLFKHVDKF LSSVYEDNLP YVAAKVYCDG DEILSGYESD ESCCKETCVK CIKGLGEEKV TVVAYPSGND VFTALLLALP PQTWSGIKDQ SLLQEMKQLI SMVSHPTLLQ QEVLHLRRQL EMLKRCQENK EDEELSAPA // ID NC003070_209 HYPOTHETICAL; PRT; 704 AA. AC NC003070_209; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1023955...1024024, 1025232...1025395, DE 1025439...1025567, 1025864...1027615]; Length: 2115. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 23 FIRST EXON; p-value: NaN. FT GENSCAN 24 24 AA on splice site: c/tg -> L. FT GENSCAN 25 78 INTERNAL EXON; p-value: NaN. FT GENSCAN 79 121 INTERNAL EXON; p-value: NaN. FT GENSCAN 122 704 LAST EXON; p-value: NaN. SQ SEQUENCE 704 AA; 77918 MW; 0ABEB2DF56B8D55B CRC64; MPKQREKPKI RQQESPRGPR TMNLHNLLVL TCNWNDNKLI KKSGGRATGD QGNTIDSFSG ENSFENTHIQ TGDVTALRFS EMSLEKREKV LQRWNTQWYN PLARIGFMMI KAIFLFYYFT WTNENSENPV WDAINYSVEI GENEDMEQKE RPLDEGIIET AKEDEMTIKQ RMINKGLKVT EDRERDTYKI ECDAVVVGSG CGGGVAAAIL AKSGLRVVVI EKGNYFAPRD YSALEGPSMF ELFESNSLMM THDGRFRFMA GSTVGGGSVV NWAASLKTPD AIIEEWSVHR GISIYSSEKY KAAMGIVCKR LGVTEKIIRE GFQNQILRKG CEKLGLDVTI VPRNSTEKHY CGSCSYGCPT GEKRGTDSTW LVDAVNNNAV ILTQCKAEKL ILADNDANKR EESGRRKRCL GVAASLSHQT RKKLQINAKV TIVACGSLKT PGLLASSGLK NSNISRGLHI HPIMMAWGYF PEKNSELEGA AHEGEIVTSL HYVHPMDSTT PNITLETPAI GPGTFAALTP WVSGSDMKER MAKYARTAHI FAMVRDEGVG EVKGDIVKYR LTKADEENLT IGLKQALRIL VAAGAAEVGT YRSDGQRMKC DGIKQKDLEA FLDTVNAPPG VVSMSKHWTQ SFTAHQIGCC RMGATEKEGA IDGKGESWEA EDLYVCDASV LPTALGVNPM ITVQSTAYCI SNRIAELMKK RKKD // ID NC003070_210 HYPOTHETICAL; PRT; 152 AA. AC NC003070_210; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1028872...1028414]; Length: 459. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 152 SINGLE EXON; p-value: NaN. SQ SEQUENCE 152 AA; 16854 MW; 3F8F039E381ADBEB CRC64; MNTKTMRLPP RRVLTADKRK ERDAFISSVT DNPPEIAKFP SPPPKLVPPP VNPISKKSST AAAEPIGSNQ LMLAGYLSHE YLTQGTLFGE QWNQARAQAE SSKIKPSHTV EPAEECEPKR KRYREVANLL RSDGAQLPGI VNPAQLARFL KL // ID NC003070_211 HYPOTHETICAL; PRT; 534 AA. AC NC003070_211; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1036127...1035885, 1035517...1035382, DE 1035283...1035139, 1035054...1034885, 1034777...1034701, DE 1034122...1033815, 1033536...1033437, 1033280...1033186, DE 1032858...1032807, 1032652...1032581, 1032438...1032232]; Length: 1605. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 81 FIRST EXON; p-value: NaN. FT GENSCAN 82 126 INTERNAL EXON; p-value: NaN. FT GENSCAN 127 127 AA on splice site: g/gt -> G. FT GENSCAN 128 174 INTERNAL EXON; p-value: NaN. FT GENSCAN 175 175 AA on splice site: aa/g -> K. FT GENSCAN 176 231 INTERNAL EXON; p-value: NaN. FT GENSCAN 232 232 AA on splice site: g/ga -> G. FT GENSCAN 233 257 INTERNAL EXON; p-value: NaN. FT GENSCAN 258 359 INTERNAL EXON; p-value: NaN. FT GENSCAN 360 360 AA on splice site: tt/g -> L. FT GENSCAN 361 393 INTERNAL EXON; p-value: NaN. FT GENSCAN 394 424 INTERNAL EXON; p-value: NaN. FT GENSCAN 425 425 AA on splice site: ag/g -> R. FT GENSCAN 426 442 INTERNAL EXON; p-value: NaN. FT GENSCAN 443 466 INTERNAL EXON; p-value: NaN. FT GENSCAN 467 534 LAST EXON; p-value: NaN. SQ SEQUENCE 534 AA; 60097 MW; 7BB15C142871B5C3 CRC64; MGANSKSVTA SFTVIAVFFL ICGGRTAVED ETEFHGDYSK LSGIIIPGFA STQLRAWSIL DCPYTPLDFN PLDLVWLDTT KLLSAVNCWF KCMVLDPYNQ TDHPECKSRP DSGLSAITEL DPGYITGPLS TVWKEWLKWC VEFGIEANAI VAVPYDWRLS PTKLEERDLY FHKLKLTFET ALKLRGGPSI VFAHSMGNNV FRYFLEWLRL EIAPKHYLKW LDQHIHAYFA VGAPLLGSVE AIKSTLSGVT FGLPVSERLD SVYATVTLTK VSPRMFTLIF RSFDVYPSVT ETALVNMTSM ECGLPTLLSF TARELADGTL FKAIEDYDPD SKRMLHQLKK YVPFFVIRNI AHRSSLAGFL LYHDDPVFNP LTPWERPPIK NVFCIYGAHL KTEVGYYFAP SGKPYPDNWI ITDIIYETEG SLVSRSGTVV DGNAGPITGD ETVPYHSLSW CKNWLGPKVN ITMAPQILIG KIKQQPEHDG SDVHVELNVD HEHGSDIIAN MTKAPRVKYI TFYEDSESIP GKRTAVWELD KSGY // ID NC003070_212 HYPOTHETICAL; PRT; 1004 AA. AC NC003070_212; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1036641...1036691, 1036912...1037013, DE 1037177...1037440, 1037530...1037745, 1037923...1038174, DE 1038275...1038375, 1038518...1038704, 1039223...1039388, DE 1039487...1039605, 1039694...1039981, 1040575...1040679, DE 1040873...1040973, 1041048...1042110]; Length: 3015. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 17 FIRST EXON; p-value: NaN. FT GENSCAN 18 51 INTERNAL EXON; p-value: NaN. FT GENSCAN 52 139 INTERNAL EXON; p-value: NaN. FT GENSCAN 140 211 INTERNAL EXON; p-value: NaN. FT GENSCAN 212 295 INTERNAL EXON; p-value: NaN. FT GENSCAN 296 328 INTERNAL EXON; p-value: NaN. FT GENSCAN 329 329 AA on splice site: tg/g -> W. FT GENSCAN 330 391 INTERNAL EXON; p-value: NaN. FT GENSCAN 392 446 INTERNAL EXON; p-value: NaN. FT GENSCAN 447 447 AA on splice site: t/gg -> W. FT GENSCAN 448 486 INTERNAL EXON; p-value: NaN. FT GENSCAN 487 582 INTERNAL EXON; p-value: NaN. FT GENSCAN 583 617 INTERNAL EXON; p-value: NaN. FT GENSCAN 618 650 INTERNAL EXON; p-value: NaN. FT GENSCAN 651 651 AA on splice site: tg/t -> C. FT GENSCAN 652 1004 LAST EXON; p-value: NaN. SQ SEQUENCE 1004 AA; 112626 MW; 5B6E5D6C7A888690 CRC64; MGSSSPEARA RAQVPSMILI FLEIICTVHV YTNRRKLNRD VLSANLNIPK RIPNDCNYKN DALNNSNSPK HGESEDSEMT DKDVSKRSGG TDSSSRDGSP LPTSEESDPR PKHQDWTEKQ LSDHLLLYEF ESEYDAANHT PESYTEQAAK NVRDITASEQ PSNAARKRIC GDSFIQESSP NPKTQDPTLL RLMESLRSDD PTDYVKAQNH QLPKSHTEQD SKRKRDITAS DAMENHLKVP KRENNLMQKS ADIDCNGKCS ANSDDQLSEK ISKALEQTSS NITICGFCQS ARVSEATGEM LHYSRGRPVD GDDIFRSNVI HVHSACIEWA PQVYYEGDTV KNLKAELARG MKIKCTKCSL KGAALGCFVK SCRRSYHVPC AREISRCRWD YKLMESLAVR FNATISRYWN PSVTHVIAST DEKGACTRTL KVLMGILNGK WIINAAWMKA SLKASQPVDE EPFEIQIDTQ GCQDGPKTAR LRAETNKPKL FEGLKFYFFG DFYKGYKEDL QNLVKVAGGT ILNTEDELGA ESSNNVNDQR SSSIVVYNID PPHGCALGEE VTIIWQRAND AEALASQTGS RLTLRIFGTM GCFSGCFGGR KNRRRQRRRD SDEARDNKLS VETAEPHHLN DRVHIVEEIP KASVIPITEI CDEAEEKCSP STISRKRVTF DSKVKTYEHV VSEESVELSE EKNEEVESEK RSLKSSKTDD QIIEVASNSS GSYPENHRYK NCRESDDDIE EDEFDCSDSD LDEDEEYYSD VGFSEDSLHN PTKEVYTQDI GDKTEEIDSK LRRSNETVRD GNHYDGQGVL NPVENLTQWK SAKSKGRTKQ KQSQKENSNF IADQEEKRDS SSFGTDPQID DITLSVKPKC RIEPKKLRNQ ELAVDASLST WLSTSESGSE CNSASMYTLT PEKLKSTSCY SKPLRINHDD RPVLCALTLE DIKQFSATST PRKSPSKSPD ETPIIGTVGG YWGNRSKAID CGSASSFKGI PNTSSKYREV RINQ // ID NC003070_213 HYPOTHETICAL; PRT; 219 AA. AC NC003070_213; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1043818...1043391, 1043162...1042931]; DE Length: 660. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 142 FIRST EXON; p-value: NaN. FT GENSCAN 143 143 AA on splice site: gg/a -> G. FT GENSCAN 144 219 LAST EXON; p-value: NaN. SQ SEQUENCE 219 AA; 25018 MW; E4AC58F4608C6ADD CRC64; MDRTMFLSLT IASLLVGVVS AGDWNILNQL RGLGSSSSQN GIVSKGIKTD LKGYCESWRI NVEVHNIRKF DVVPQECVSH IKDYMTSSQY KDDVARTVDE VILHFGSMCC SKSKCDGMDA WIFDIDDTLL STIPYHKKNG FFGGEKLNST KFEDWIQKKK APAVPHMKKL YHDIRERGIK IFLISSRKEY LRSATVDNLI QAGYYGWSNL MLRYNYTGY // ID NC003070_214 HYPOTHETICAL; PRT; 890 AA. AC NC003070_214; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1051473...1051300, 1051103...1050845, DE 1049725...1049617, 1049208...1049013, 1048914...1048852, DE 1048757...1048719, 1048629...1048498, 1048421...1048151, DE 1048004...1046986, 1046876...1046623, 1046358...1046328, DE 1046194...1046098, 1045994...1045966]; Length: 2673. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 58 FIRST EXON; p-value: NaN. FT GENSCAN 59 144 INTERNAL EXON; p-value: NaN. FT GENSCAN 145 145 AA on splice site: g/tt -> V. FT GENSCAN 146 180 INTERNAL EXON; p-value: NaN. FT GENSCAN 181 181 AA on splice site: tt/a -> L. FT GENSCAN 182 246 INTERNAL EXON; p-value: NaN. FT GENSCAN 247 267 INTERNAL EXON; p-value: NaN. FT GENSCAN 268 280 INTERNAL EXON; p-value: NaN. FT GENSCAN 281 324 INTERNAL EXON; p-value: NaN. FT GENSCAN 325 414 INTERNAL EXON; p-value: NaN. FT GENSCAN 415 415 AA on splice site: g/at -> D. FT GENSCAN 416 754 INTERNAL EXON; p-value: NaN. FT GENSCAN 755 838 INTERNAL EXON; p-value: NaN. FT GENSCAN 839 839 AA on splice site: ag/a -> R. FT GENSCAN 840 849 INTERNAL EXON; p-value: NaN. FT GENSCAN 850 881 INTERNAL EXON; p-value: NaN. FT GENSCAN 882 882 AA on splice site: a/ag -> K. FT GENSCAN 883 890 LAST EXON; p-value: NaN. SQ SEQUENCE 890 AA; 100050 MW; C6320E23358E94D0 CRC64; MLVASRTLYQ SVNAIFESSF NVRELSAFDF GVFRLHTKRG GYRVRDDEPK RKATRIVLKI GAGKGDSSIL AKISNYDIVS QGRRAACDAV YVSKKLLKST GKAAWIAGTT FLILAVPLIL ELEQDHRLGE IDFEQASLLG TPPVVFHYIL FSEESRKSGS LTTESHPLKT LNQFQSWFFC LNLLKMAPNL RIKKACDAMK LLGISETKTR AFLRKLLKTY ENNWDFIEED AYKVLLDAIF DEADAQSTEK NKKEEEKKKK EEEKKSRSVA TSRGRRKAPE PLVQDEEDDM DEDEFPLKRR LRSRRGRASS SSSSSSSYNN EDLKTQPEEE DEDDGVTELP PLKRYVRRNG ERGLAMTVYN NASPSSSSRL SMEPEEVPPM VLLPAHPMET KVSEASALVI LNDEPNIDHK PVISDTGNCS APMLEMGKSN IHVQEWDWET KDILNDTTAM DVSPSSAIGE SSEHKVAAAS VELASSTSGE AKICLSFAPA TGETTNLHLP SMEDLRRAME EKCLKSYKIV HPEFSVLGFM KDMCSCYIDL AKNSTSQLLE TETVCDMSKA GDESGAVGIS MPLVVVPECE ISGDGWKAIS NMKDITAGEE NVEIPWVNEI NEKVPSRFRY MPHSFVFQDA PVIFSLSSFS DEQSCSTSCI EDCLASEMSC NCAIGVDNGF AYTLDGLLKE EFLEARISEA RDQRKQVLRF CEECPLERAK KVEILEPCKG HLKRGAIKEC WFKCGCTKRC GNRVVQRGMH NKLQVFFTPN GKGWGLRTLE KLPKGAFICE YIGEILTIPE LYQRSFEDKP TLPVILDAHW GSEERLEGDK ALCLDGMFYG NISRFLNHRD IEAMEELAWD YGIDFNDNDS LMKPFDCLCG SRFCRNKKRS TKTMQILNKA // ID NC003070_215 HYPOTHETICAL; PRT; 1339 AA. AC NC003070_215; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1051820...1052036, 1052159...1052268, DE 1052545...1052774, 1053326...1053493, 1053806...1054199, DE 1054662...1054805, 1054935...1055087, 1055173...1055289, DE 1055477...1055707, 1055797...1056102, 1056186...1056287, DE 1056433...1056539, 1057202...1057329, 1057634...1059246]; Length: 4020. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 72 FIRST EXON; p-value: NaN. FT GENSCAN 73 73 AA on splice site: g/gc -> G. FT GENSCAN 74 109 INTERNAL EXON; p-value: NaN. FT GENSCAN 110 185 INTERNAL EXON; p-value: NaN. FT GENSCAN 186 186 AA on splice site: ag/g -> R. FT GENSCAN 187 241 INTERNAL EXON; p-value: NaN. FT GENSCAN 242 242 AA on splice site: ag/t -> S. FT GENSCAN 243 373 INTERNAL EXON; p-value: NaN. FT GENSCAN 374 421 INTERNAL EXON; p-value: NaN. FT GENSCAN 422 472 INTERNAL EXON; p-value: NaN. FT GENSCAN 473 511 INTERNAL EXON; p-value: NaN. FT GENSCAN 512 588 INTERNAL EXON; p-value: NaN. FT GENSCAN 589 690 INTERNAL EXON; p-value: NaN. FT GENSCAN 691 724 INTERNAL EXON; p-value: NaN. FT GENSCAN 725 759 INTERNAL EXON; p-value: NaN. FT GENSCAN 760 760 AA on splice site: ac/g -> T. FT GENSCAN 761 802 INTERNAL EXON; p-value: NaN. FT GENSCAN 803 803 AA on splice site: g/gt -> G. FT GENSCAN 804 1339 LAST EXON; p-value: NaN. SQ SEQUENCE 1339 AA; 149360 MW; 4A56DC2F4D3037F5 CRC64; MVSEGYTSAP YGDYNASAAT VESTGQETAP IVDASHSVNN DSLVNGTAPV ENGSATDNVA VTAPAAEHGD NTGSTLSTEE ERLWNIVRAN SLEFNAWTAL IDETERIAQD NIAKIRKVYD AFLAEFPLCY GYWKKFADHE ARVGAMDKVV EVYERAVLGV TYSVDIWLHY CTFAINTYGD PETIRRLFER ALVYVGTDFL SSPLWDKYIE YEYMQQDWSR VALIYTRILE NPIQNLDRYF SSFKELAETR PLSELRSAEE SAAAAVAVAG DASESAASES GEKADEGRSQ VDGSTEQSPK LESASSTEPE ELKKYVGIRE AMYIKSKEFE SKIIGYEMAI RRPYFHVRPL NVAELENWHN YLDFIERDGD FNKVVKLYER CVVTCANYPE YWIRYVTNME ASGSADLAEN ALARATQVFV KKQPEIHLFA ARLKEQNGDI AGARAAYQLV HSEISPGLLE AVIKHANMEY RLGNLDDAFS LYEQVIAVEK GKEHSTILPL LYAQYSRFSY LVSRDAEKAR RIIVEALDHV QPSKPLMEAL IHFEAIQPPP REIDYLEPLV EKVIKPDADA QNIASSTERE ELSLIYIEFL GIFGDVKSIK KAEDQHVKLF YPHRSTSELK KRSADDFLAS DRTKMAKTYN GTPPAQPVSN AYPNAQAQWS GGYAAQPQTW PPAQAAPAQP QQWNPAYGQQ AAYGAYGGYP AGYTAPQAPT PVPQAAAYGA YPAQTYPTQS YAPPVAAAAP AAAPVQQPAA AVAPQAYYNT VSRDQIRMLG YKCLHWNNLI DLPPLKDPET FSLPSSIPHW PPGQGFGSGT INLGKLQVIK ITDFEFIWRY RSTEKKKNIS FYKPKGLLPK DFHCLGHYCQ SDSHPLRGYV LAARDLVDSL EQVEKPALVE PVDFTLVWSS NDSAENECSS KSECGYFWLP QPPEGYRSIG FVVTKTSVKP ELNEVRCVRA DLTDICEPHN VIVTAVSESL GVPLFIWRTR PSDRGMWGKG VSAGTFFCRT RLVAAREDLG IGIACLKNLD LSLHAMPNVD QIQALIQHYG PTLVFHPGET YLPSSVSWFF KNGAVLCEKG NPIEEPIDEN GSNLPQGGSN DKQFWIDLPC DDQQRDFVKR GNLESSKLYI HIKPALGGTF TDLVFWIFCP FNGPATLKLG LVDISLISIG QHVCDWEHFT LRISNFSGEL YSIYLSQHSG GEWIEAYDLE IIPGSNKAVV YSSKHGHASF PRAGTYLQGS TMLGIGIRND TARSELLVDS SSRYEIIAAE YLSGNSVLAE PPWLQYMREW GPKVVYDSRE EIERLVNRFP RTVRVSLATV LRKLPVELSG EEGPTGPKEK NNWYGDERC // ID NC003070_216 HYPOTHETICAL; PRT; 267 AA. AC NC003070_216; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1059850...1060079, 1060323...1060633, DE 1060731...1060863, 1060956...1061017, 1061197...1061264]; Length: 804. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 76 FIRST EXON; p-value: NaN. FT GENSCAN 77 77 AA on splice site: ag/a -> R. FT GENSCAN 78 180 INTERNAL EXON; p-value: NaN. FT GENSCAN 181 181 AA on splice site: a/ga -> R. FT GENSCAN 182 224 INTERNAL EXON; p-value: NaN. FT GENSCAN 225 225 AA on splice site: ca/g -> Q. FT GENSCAN 226 245 INTERNAL EXON; p-value: NaN. FT GENSCAN 246 246 AA on splice site: g/gg -> G. FT GENSCAN 247 267 LAST EXON; p-value: NaN. SQ SEQUENCE 267 AA; 28888 MW; A870D84D44718DD8 CRC64; MIGLPAEEDE NAAHSSEDSS CPDESVSETE LDLALGLSIG RRKVRSSLSS SSSSLTRESG TKRSADSSPA AASNATRQVA VGWPPLRTYR INSLVNQAKS LATEGGLSSG IQKETTKSVV VAAKNDDACF IKSSRTSMLV KVTMDGVIIG RKVDLNALDS YAALEKTLDL MFFQIPSPVT RSNTQGYKTI KETCTSKLLD GSSEYIITYQ DKDGDWMLVG DVPWQMFLGS VTRLRIMKTS IGAGVGFFTL LVPDPIYHYN NKGMHLD // ID NC003070_217 HYPOTHETICAL; PRT; 775 AA. AC NC003070_217; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1063783...1061456]; Length: 2328. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 775 SINGLE EXON; p-value: NaN. SQ SEQUENCE 775 AA; 83777 MW; 7740B20397C7C211 CRC64; MEPKPFFLCI IFLLFCSSSS EILQKQTYIV QLHPNSETAK TFASKFDWHL SFLQEAVLGV EEEEEEPSSR LLYSYGSAIE GFAAQLTESE AEILRYSPEV VAVRPDHVLQ VQTTYSYKFL GLDGFGNSGV WSKSRFGQGT IIGVLDTGVW PESPSFDDTG MPSIPRKWKG ICQEGESFSS SSCNRKLIGA RFFIRGHRVA NSPEESPNMP REYISARDST GHGTHTASTV GGSSVSMANV LGNGAGVARG MAPGAHIAVY KVCWFNGCYS SDILAAIDVA IQDKVDVLSL SLGGFPIPLY DDTIAIGTFR AMERGISVIC AAGNNGPIES SVANTAPWVS TIGAGTLDRR FPAVVRLANG KLLYGESLYP GKGIKNAGRE VEVIYVTGGD KGSEFCLRGS LPREEIRGKM VICDRGVNGR SEKGEAVKEA GGVAMILANT EINQEEDSID VHLLPATLIG YTESVLLKAY VNATVKPKAR IIFGGTVIGR SRAPEVAQFS ARGPSLANPS ILKPDMIAPG VNIIAAWPQN LGPTGLPYDS RRVNFTVMSG TSMSCPHVSG ITALIRSAYP NWSPAAIKSA LMTTADLYDR QGKAIKDGNK PAGVFAIGAG HVNPQKAINP GLVYNIQPVD YITYLCTLGF TRSDILAITH KNVSCNGILR KNPGFSLNYP SIAVIFKRGK TTEMITRRVT NVGSPNSIYS VNVKAPEGIK VIVNPKRLVF KHVDQTLSYR VWFVLKKKNR GGKVASFAQG QLTWVNSHNL MQRVRSPISV TLKTN // ID NC003070_218 HYPOTHETICAL; PRT; 1355 AA. AC NC003070_218; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1069918...1068356, 1068246...1067926, DE 1067848...1067762, 1067549...1066851, 1066770...1066606, DE 1066524...1066230, 1066148...1065934, 1065854...1065549, DE 1065442...1065379, 1065293...1065054, 1064959...1064847]; Length: 4068. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 521 FIRST EXON; p-value: NaN. FT GENSCAN 522 628 INTERNAL EXON; p-value: NaN. FT GENSCAN 629 657 INTERNAL EXON; p-value: NaN. FT GENSCAN 658 890 INTERNAL EXON; p-value: NaN. FT GENSCAN 891 945 INTERNAL EXON; p-value: NaN. FT GENSCAN 946 1043 INTERNAL EXON; p-value: NaN. FT GENSCAN 1044 1044 AA on splice site: a/gc -> S. FT GENSCAN 1045 1115 INTERNAL EXON; p-value: NaN. FT GENSCAN 1116 1217 INTERNAL EXON; p-value: NaN. FT GENSCAN 1218 1238 INTERNAL EXON; p-value: NaN. FT GENSCAN 1239 1239 AA on splice site: g/ta -> V. FT GENSCAN 1240 1318 INTERNAL EXON; p-value: NaN. FT GENSCAN 1319 1319 AA on splice site: g/gt -> G. FT GENSCAN 1320 1355 LAST EXON; p-value: NaN. SQ SEQUENCE 1355 AA; 150286 MW; 42117586445B379F CRC64; MYVDGRRLAI EGWSRCSSHV VANLAVTPAL GFLCFLAWRG VSGIQVTRSS SDLQEPLLVE EEAACLKVTP YSTAGLVSLI TLSWLDPLLS AGSKRPLELK DIPLLAPRDR AKSSYKVLKS NWKRCKSENP SKPPSLARAI MKSFWKEAAC NAVFAGLNTL VSYVGPYLIS YFVDYLGGKE IFPHEGYVLA GIFFTSKLIE TVTTRQWYMG VDILGMHVRS ALTAMVYRKG LKLSSIAKQN HTSGEIVNYM AVDVQRIGDY SWYLHDIWML PMQIVLALAI LYKSVGIAAV ATLVATIISI LVTIPLAKVQ EDYQDKLMTA KDERMRKTSE CLRNMRVLKL QAWEDRYRVR LEEMREEEYG WLRKALYSQA FVTFIFWSSP IFVAAVTFAT SIFLGTQLTA GGVLSALATF RILQEPLRNF PDLVSMMAQT KVSLDRISGF LQEEELQEDA TVVIPRGLSN IAIEIKDGVF CWDPFSSRPT LSGIQMKVEK GMRVAVCGTV GSGKSSFISC ILGEIPKISG EVRICGTTGY VSQSAWIQSG NIEENILFGS PMEKTKYKNV IQACSLKKDI ELFSHGDQTI IGERGINLSG GQKQRVQLAR ALYQDADIYL LDDPFSALDA HTGSDLFRDY ILSALAEKTV VFVTHQVEFL PAADLILVLK EGRIIQSGKY DDLLQAGTDF KALVSAHHEA IEAMDIPSPS SEDSDENPIR DSLVLHNPKS DVFENDIETL AKEVQEGGSA SDLKAIKEKK KKAKRSRKKQ LVQEEERVKG KVSMKVYLSY MGAAYKGALI PLIILAQAAF QFLQIASNWW MAWANPQTEG DESKVDPTLL LIVYTALAFG SSVFIFVRAA LVATFGLAAA QKLFLNMLRS VFRAPMSFFD STPAGRILNR VSIDQSVVDL DIPFRLGGFA STTIQLCGIV AVMTNVTWQV FLLVVPVAVA CFWMQKYYMA SSRELVRIVS IQKSPIIHLF GESIAGAATI RGFGQEKRFI KRNLYLLDCF VRPFFCSIAA IEWLCLRMEL LSTLVFAFCM VLLVSFPHGT IDPSMAGLAV TYGLNLNGRL SRWILSFCKL ENKIISIERI YQYSQIVGEA PAIIEDFRPP SSWPATGTIE LVDVKVRYAE NLPTVLHGVS CVFPGGKKIG IVGRTGSGKS TLIQALFRLI EPTAGKITID NIDISQIGLH DLRSRLGIIP QDPTLFEGTI RANLDPLEEH SDDKIWEALD KSQLGDVVRG KDLKLDSPVL ENGDNWSVGQ RQLVSLGRAL LKQAKILVLD EATASVDTAT DNLIQKIIRT EFEDCTVCTI AHRIPTVIDS DLVLVLSDGR VAEFDTPARL LEDKSSMFLK LVTEYSSRST GIPEL // ID NC003070_219 HYPOTHETICAL; PRT; 401 AA. AC NC003070_219; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1071933...1071971, 1073365...1073568, DE 1073648...1073860, 1073939...1074130, 1074224...1074368, DE 1074488...1074609, 1074897...1075020, 1075107...1075199, DE 1075300...1075373]; Length: 1206. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 13 FIRST EXON; p-value: NaN. FT GENSCAN 14 81 INTERNAL EXON; p-value: NaN. FT GENSCAN 82 152 INTERNAL EXON; p-value: NaN. FT GENSCAN 153 216 INTERNAL EXON; p-value: NaN. FT GENSCAN 217 264 INTERNAL EXON; p-value: NaN. FT GENSCAN 265 265 AA on splice site: a/ac -> N. FT GENSCAN 266 305 INTERNAL EXON; p-value: NaN. FT GENSCAN 306 346 INTERNAL EXON; p-value: NaN. FT GENSCAN 347 347 AA on splice site: c/tg -> L. FT GENSCAN 348 377 INTERNAL EXON; p-value: NaN. FT GENSCAN 378 378 AA on splice site: g/tc -> V. FT GENSCAN 379 401 LAST EXON; p-value: NaN. SQ SEQUENCE 401 AA; 45230 MW; 7F7AEF18CC5A7B02 CRC64; MRMTTYYTCH SCKNSKLLLL LLLLLLSSSR LIEYFTICFL SFCCESMALW MDAGATPTTE KEKADLEAIS ALKESTAIEF KEEGNECVRK GKKHYSEAID CYTKAISQGV LSDSETSILF SNRSHVNLLL GNYRRALTDA EESMRLSPHN VKAVYRAAKA SMSLDLLNEA KSYCEKGIEN DPSNEDMKKL LKLVNSKKQE KEQHEAQASQ AVVEAKACLS AIENRGVKIG KAMYRELTGL KKPMLDKNNI LHWPVLLLYA EAMTNCSMRF MSLLSALNMF SEDSPPLPWD KNNEYSRDVI ELYYEASSGT PLPRSRVLQY LLEGTKGSQA ETTGEEDTSA TKTPSYLKGS SGMVKVNERR TLHDVLKEPK FVIPEIPVFY IVSKRSKFYK DFTAGKWTPP N // ID NC003070_220 HYPOTHETICAL; PRT; 1729 AA. AC NC003070_220; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1084245...1081426, 1080497...1080390, DE 1080384...1080255, 1079712...1079652, 1079462...1079409, DE 1079329...1079245, 1078945...1078837, 1078748...1078681, DE 1078551...1078426, 1078352...1077340, 1077235...1077035, DE 1076931...1076871, 1076678...1076565, 1076423...1076310, DE 1076224...1076099]; Length: 5190. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 940 FIRST EXON; p-value: NaN. FT GENSCAN 941 976 INTERNAL EXON; p-value: NaN. FT GENSCAN 977 1019 INTERNAL EXON; p-value: NaN. FT GENSCAN 1020 1020 AA on splice site: a/aa -> K. FT GENSCAN 1021 1039 INTERNAL EXON; p-value: NaN. FT GENSCAN 1040 1040 AA on splice site: ca/g -> Q. FT GENSCAN 1041 1057 INTERNAL EXON; p-value: NaN. FT GENSCAN 1058 1058 AA on splice site: ca/t -> H. FT GENSCAN 1059 1086 INTERNAL EXON; p-value: NaN. FT GENSCAN 1087 1122 INTERNAL EXON; p-value: NaN. FT GENSCAN 1123 1123 AA on splice site: t/at -> Y. FT GENSCAN 1124 1145 INTERNAL EXON; p-value: NaN. FT GENSCAN 1146 1187 INTERNAL EXON; p-value: NaN. FT GENSCAN 1188 1524 INTERNAL EXON; p-value: NaN. FT GENSCAN 1525 1525 AA on splice site: ag/t -> S. FT GENSCAN 1526 1591 INTERNAL EXON; p-value: NaN. FT GENSCAN 1592 1592 AA on splice site: tc/a -> S. FT GENSCAN 1593 1612 INTERNAL EXON; p-value: NaN. FT GENSCAN 1613 1650 INTERNAL EXON; p-value: NaN. FT GENSCAN 1651 1688 INTERNAL EXON; p-value: NaN. FT GENSCAN 1689 1729 LAST EXON; p-value: NaN. SQ SEQUENCE 1729 AA; 191369 MW; DBEF978251B2DD7B CRC64; MTEAKTGTGN ERLVVEIVGA HNLMPKDGED SSSPFVEVQF ENQRLRTKVK PKDLNPIWNE KLVFHVIDVN DLRHKALEIN VYNEKRSSNS RNFLGKVRVL GSSVGREGES VVQLYTLEKR SLFSSVRGEI SVKHYMTTTA ENGENVRRVN RSGGSKKSKK VQNVSSSMAI QQQQQQQQQQ ISLHNHNRGN QQQSQQNGQG QRMLPFYPHQ SEIKPLVITA LPSPMPGPGP RPIVYSNGSS EFSLKETKPC LGGTSNGLGG LSSHKDKTSS TYDLVEQMQY LYVNIVKAKD LSVLGEVVSE VKLGNYRGVT KKVSSNSSNP EWNQVFVFSK ERIQSSVVEL FVKEGNKDEY TGRVLFDLSE IPTRVPPDSP LAPQWYKIEN RNGGRGNGEL MVSVWFGTQA DEAFAEAWHS KAGNVHIEEL SSIKSKVYLS PKLWYLRISV IEAQDVAIMD KGSSLMRFPE LSAKLQVGSQ ILRTAIASAI PTKSFSNPYW NEDLMFVVAE PFEDCVTVVV EDRLNGGAIG GQNDVAVGRV QIPISAVERR TGDTLVGSRW FSLDNGNNNN RFGSRIHLRL SLDGGYHVLD EATMYNSDVR PTAKELWKPQ VGLLEIGILS ATGLMPMKVR DGKCGGIADS YCVAKYGPKW VRTRTVVDSL CPKWNEQYTW EVYDPCTVVT VGVFDNARVN ENNNSRDVRI GKVRIRLSTL ETGRVYTHSY PLIVLHPSGV KKTGELHLAV RLSCGNAVNM LHMYALPLLP KMHYTQPLGV HMLERLRYQT LNAVAARLSR AEPPLGREVV EYMLDHDFHV WSMRRSKANF FRLVNVISGL VAVAKLVEVM RSWSKPVYST VFVLAFLFMV LFPELLLPCL LLYTAAVGVW RFRRRSRYPP HMDARISHAE TVFPDELDEE FDTFPTSRGF DVVRMRYDRV RSIAGRVQTV VGDMASQGER LFTVNNPDPS RGRSRRRGNS GRLDRRFGLV FNEDRLGFVR IYGFSVKWGI DMNFGFRDDG IYLVKRSIFS SVDDTFSKTK NGGVKADVML IRRVEPLANQ STIAAAFSSD GRTLASTHGD HTVKIIDCET GKCLKILTGH RRTPWVVRFH PRHSEIVASG SLDHEVRLWN AKTGECIRTH DFYRPIASIA FHAGGELLAV ASGHKLHIWH YNKGGDDSAP AIVLKTRRSL RAVHFHPHGV PLLLTAEVTD IDSSDSAMTR STSPGYLRYP PPAIFFTNTQ SGSRTSLAAE LPLVPLPYLL LPSYSADDPR ILYSSGTTGP RNAQTRFQSN QSSVEHGSRT ISPSPLPMAT SADLSGSYHV PDNSASNTFA TQAGARNSTT AVDAMDVDEA QPVGRNRVPS QVSSQPDLLE FGQLQQLFHF RDRGSWELPF LQGWLMAQSQ AGANSVALPT GSSGHVNSTP YMGSSSASHS STASLEAGVA SLEIPGGVNL YGVSARGDSR DRILQSRFAG SGLAEGRSSR NTQHEGADAQ PVVNRIPSEL ASSIAAAELP CTVKLRVWSH DIKDPCSILK SDKCRLTIHH AVLCSEMGAH FSPCGRYLAA CVACVIPHAE TDPSLQTLVQ QDSGLATSPT RHPVTAHQVM YELRVYSLEK ESFGSVLVSR AIRAAHCLTS IQFSPTSEHI LLAYGRRHGS LLKSIVSDGE TTSHFFTVLE IYRVSDMELV RVLPSSEDEV NVACFHPSPG GGLVYGTKEG KLRIFRYNTA AASNLTAPNS SPDENLAEVE LLTRLIFLK // ID NC003070_221 HYPOTHETICAL; PRT; 1851 AA. AC NC003070_221; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1086494...1086496, 1087113...1087241, DE 1087369...1087512, 1087656...1087801, 1087889...1088048, DE 1088250...1088308, 1088410...1088569, 1088663...1088812, DE 1088894...1089030, 1089120...1089266, 1089392...1089504, DE 1090309...1090346, 1090405...1090442, 1090566...1090692, DE 1090814...1090981, 1091303...1091412, 1091501...1091561, DE 1091655...1091832, 1091925...1092130, 1092301...1092420, DE 1092571...1092669, 1092775...1092987, 1093108...1093247, DE 1093335...1093449, 1093540...1093584, 1093824...1093994, DE 1094311...1094463, 1094550...1094741, 1094900...1095028, DE 1095106...1095176, 1095346...1095442, 1095517...1095573, DE 1095803...1095966, 1096051...1096144, 1097398...1097530, DE 1097618...1097679, 1097799...1097911, 1097998...1098100, DE 1098378...1098605, 1098717...1098935, 1099036...1099296, DE 1099399...1099701]; Length: 5556. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 1 FIRST EXON; p-value: NaN. FT GENSCAN 2 44 INTERNAL EXON; p-value: NaN. FT GENSCAN 45 92 INTERNAL EXON; p-value: NaN. FT GENSCAN 93 140 INTERNAL EXON; p-value: NaN. FT GENSCAN 141 141 AA on splice site: ag/a -> R. FT GENSCAN 142 194 INTERNAL EXON; p-value: NaN. FT GENSCAN 195 213 INTERNAL EXON; p-value: NaN. FT GENSCAN 214 214 AA on splice site: ag/t -> S. FT GENSCAN 215 267 INTERNAL EXON; p-value: NaN. FT GENSCAN 268 317 INTERNAL EXON; p-value: NaN. FT GENSCAN 318 362 INTERNAL EXON; p-value: NaN. FT GENSCAN 363 363 AA on splice site: at/g -> M. FT GENSCAN 364 411 INTERNAL EXON; p-value: NaN. FT GENSCAN 412 412 AA on splice site: tg/g -> W. FT GENSCAN 413 449 INTERNAL EXON; p-value: NaN. FT GENSCAN 450 450 AA on splice site: g/tt -> V. FT GENSCAN 451 462 INTERNAL EXON; p-value: NaN. FT GENSCAN 463 474 INTERNAL EXON; p-value: NaN. FT GENSCAN 475 475 AA on splice site: tg/c -> C. FT GENSCAN 476 517 INTERNAL EXON; p-value: NaN. FT GENSCAN 518 573 INTERNAL EXON; p-value: NaN. FT GENSCAN 574 609 INTERNAL EXON; p-value: NaN. FT GENSCAN 610 610 AA on splice site: aa/t -> N. FT GENSCAN 611 630 INTERNAL EXON; p-value: NaN. FT GENSCAN 631 689 INTERNAL EXON; p-value: NaN. FT GENSCAN 690 690 AA on splice site: g/gt -> G. FT GENSCAN 691 758 INTERNAL EXON; p-value: NaN. FT GENSCAN 759 798 INTERNAL EXON; p-value: NaN. FT GENSCAN 799 831 INTERNAL EXON; p-value: NaN. FT GENSCAN 832 902 INTERNAL EXON; p-value: NaN. FT GENSCAN 903 948 INTERNAL EXON; p-value: NaN. FT GENSCAN 949 949 AA on splice site: ag/g -> R. FT GENSCAN 950 987 INTERNAL EXON; p-value: NaN. FT GENSCAN 988 1002 INTERNAL EXON; p-value: NaN. FT GENSCAN 1003 1059 INTERNAL EXON; p-value: NaN. FT GENSCAN 1060 1110 INTERNAL EXON; p-value: NaN. FT GENSCAN 1111 1174 INTERNAL EXON; p-value: NaN. FT GENSCAN 1175 1217 INTERNAL EXON; p-value: NaN. FT GENSCAN 1218 1240 INTERNAL EXON; p-value: NaN. FT GENSCAN 1241 1241 AA on splice site: ag/c -> S. FT GENSCAN 1242 1273 INTERNAL EXON; p-value: NaN. FT GENSCAN 1274 1292 INTERNAL EXON; p-value: NaN. FT GENSCAN 1293 1346 INTERNAL EXON; p-value: NaN. FT GENSCAN 1347 1347 AA on splice site: ag/a -> R. FT GENSCAN 1348 1378 INTERNAL EXON; p-value: NaN. FT GENSCAN 1379 1422 INTERNAL EXON; p-value: NaN. FT GENSCAN 1423 1423 AA on splice site: g/ga -> G. FT GENSCAN 1424 1443 INTERNAL EXON; p-value: NaN. FT GENSCAN 1444 1480 INTERNAL EXON; p-value: NaN. FT GENSCAN 1481 1481 AA on splice site: aa/g -> K. FT GENSCAN 1482 1515 INTERNAL EXON; p-value: NaN. FT GENSCAN 1516 1591 INTERNAL EXON; p-value: NaN. FT GENSCAN 1592 1664 INTERNAL EXON; p-value: NaN. FT GENSCAN 1665 1751 INTERNAL EXON; p-value: NaN. FT GENSCAN 1752 1851 LAST EXON; p-value: NaN. SQ SEQUENCE 1851 AA; 206836 MW; FFC24CB281202BFA CRC64; MVATFNPAVG SHVWVEDPDE AWLDGEVVEI NGDQIKVLCA SGKQVVVKDS NIYPKDVEAP ASGVEDMTRL AYLHEPGVLQ NLQSRYDINE IYTYTGSILI AVNPFRRLPH LYSSHMMTQY KGASLGELSP HPFAVADAAY RQMVNEGVSQ SILVSGESGA GKTESTKLLM RYLAFMGGRG AATEGRTVEQ KVLESNPVLE AFGNAKTVKN NNSSRFGKFV EIQFDQSGRI SGAAIRTYLL ERSRVCQVSD PERNYHCFYM LCAAPEEDAK KFKLGDPKIY HYLNQSKCIQ LDAMNDAEEY HATKKAMDVV GISSEEQDAI FRVVASILHL GNIEFAKGTE IDSSIPRDEK SWFHLKTAAE LLMCNEKSLE DSLCKRIMAT RDETITKTLD PEAALLSRDA LAKVMYSRLF DWLVEKINTS IGQDPDSKYL IGVLDIYGFE SFKTNRCLTV LIAIDVVNVI NIKPGGIIAL LDEACMFPRS THETFAQKLY QTYKNHKRFT KPKLARSDFT ICHYAGDVTY QTELFLDKNK DYVIAEHQAL LNASTCSFVA NLFPPVSDDS KQSKFSSIGT RFKGVMEAIR ISCAGYPTRK HFDEFLNRFG IIAPQVLDKN SNEPAACKKL LDKAGLEGYQ IGKSKVFLRA GQMADLDTRR TEILGRSASI IQRKVRSYLA QKTFIQLRIS ATQIQAVCRG YLARSIYEGM RREAAALKIQ RDLRKFLARK AYTELFSATI LIQAGMRGMV SRKELCLRRQ TKAATIIQTR CRVYLARLHY RKLKKAAITT QCAWRGKVAR KELKNLKMAA RETGALQEAK NKLEKQVEEL TWRLQLEKRM RTDLEEAKKQ ENAKYESSLE EIQNKFKETE ALLIKEREAA KTVSEVLPII KEVPVVDQEL MEKLTNENEK LKGMVSSLEI KIDETAKELH ETARISQDRL KQALAAESKV AKLKTAMQRL EEKISDMETE KQIMLQQTIL NTPVKSVAGH PPTATIKNLE NGHRTNLENQ FNENVDTLID CVKENIGFSN GKPIAAFTIY KCLLHWKCFE SEKTSAFDRL IEMIGSAIEN EDDNGHLAYW LTNTSALLFL LQKSLKPAGA GATASKKPPI TTSLFGRMAL SFRSSPNLAA AAEAAALAVI RPVEAKYPAL LFKQQLAAYV EKIFGMIRDN LKKELSALIS MCIQAPRISK GGIQRSARSL GKDSPAIHWQ SIIDGLNSLL AILKDNYVPL VLIQKIHTQT FSFVNVQLFN SLLLRKECCT FSNGEFVKSG LAELELWCGQ VNEYAGPSWD ELKHIRQAVG FLILSVQQLY RICTLYWDDC YNTRSVSQEV ISSMRALMTE ESNDADSNSF LLDDNSRRDF EFDARKRLCE CQTGQRATGK PRIRILALEQ KLQSLIMSRN KGLAEQDLKK LDVTVLHPLS PEVISRQATI NIGTIGHVAH GKSTVVKAIS GVQTVRFKNE LERNITIKLG YANAKIYKCE DEKCPRPMCY KAYGSGKEDT PNCDVPGFEN SKMKLLRHVS FVDCPGHDIL MATMLNGAAI MDGALLLIAA NETCPQPQTS EHLAAVEIMQ LKHIIILQNK IDLIQENVAI NQHEAIQKFI MNTVADAAPI VPVSAQLKYN IDVVCEYIVK KIPIPERNFV SPPNMIVIRS FDVNKPGYEV DEIKGGVAGG SILRGVLRVN QLIEIRPGIV TKDERGNSKC TPIYSRIISL YAEQNELQFA VPGGLIGVGT TMDPTLTRAD RLVGQVLGEI GSLPDVFVEL EVNFFLLRRL LGVRTKGSEK QGKVSKLTKG EILMLNIGSM STGAKVVGVK VDLAKLQLTA PVCTSKGEKV ALSRRVEKHW RLIGWGQIQA GTTIEVPPSP F // ID NC003070_222 HYPOTHETICAL; PRT; 421 AA. AC NC003070_222; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1104622...1105287, 1105388...1105987]; DE Length: 1266. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 222 FIRST EXON; p-value: NaN. FT GENSCAN 223 421 LAST EXON; p-value: NaN. SQ SEQUENCE 421 AA; 47403 MW; F616FF3E1987CF79 CRC64; MENMFRLMAS EEYFSERRCV WVNGPVIVGA GPSGLATAAC LHDQGVPFVV VERSDCIASL WQKRTYDRLK LHLPKKFCQL PKMPFPDHYP EYPTKRQFID YLESYANRFD IKPEFNKSVE SARFDETSGL WRVRTTSDGE EMEYICRWLV VATGENAERV VPEINGLMTE FDGEVIHACE YKSGEKFRGK RVLVVGCGNS GMEVSLDLAN HNAITSMVVR SSVHVLPREI MGKSTFGISV MMMKWLPLWL VDKLLLILSW LVLGSLSNYG LKRPDIGPME LKSMTGKTPV LDIGALEKIK SGDVEIVPAI KQFSRHHVEL VDGQKLDIDA VVLATGYRSN VPSWLQESEF FSKNGFPKSP FPNAWKGKSG LYAAGFTRKG LAGASVDAVN IAQDIGNVWR EETKRQKMRR NVGHRRCISV A // ID NC003070_223 HYPOTHETICAL; PRT; 358 AA. AC NC003070_223; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1108556...1108399, 1108077...1107984, DE 1107869...1107555, 1107446...1107267, 1107188...1107069, DE 1106989...1106856, 1106691...1106616]; Length: 1077. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 52 FIRST EXON; p-value: NaN. FT GENSCAN 53 53 AA on splice site: ag/t -> S. FT GENSCAN 54 84 INTERNAL EXON; p-value: NaN. FT GENSCAN 85 189 INTERNAL EXON; p-value: NaN. FT GENSCAN 190 249 INTERNAL EXON; p-value: NaN. FT GENSCAN 250 289 INTERNAL EXON; p-value: NaN. FT GENSCAN 290 333 INTERNAL EXON; p-value: NaN. FT GENSCAN 334 334 AA on splice site: ag/g -> R. FT GENSCAN 335 358 LAST EXON; p-value: NaN. SQ SEQUENCE 358 AA; 40557 MW; D63A5F736DB91FAD CRC64; MAEKAGKATN GGEAEKSLKE KGNEFFKAGN FLKAAALYTQ AIKLDPSNAT LYSNRAAAFL SLVKLSKALA DAETTIKLNP QWEKGYFRKG CVLEAMEKYE DVYVSLILVP SKFFLSQSYS MVVKTVMIVS QALAAFEMAL QYNPQSTEVS RKIKRLGQLQ KEKQRAQELE NLRSNVDMAK HLESFKSEMS ENYGTEECWK EMFSFIVETM ETAVKSWHET SKVDTRVYFL LDKEKTQTDK YAPAVNIDKA FQSPDTHSNC FTYLRQYAEE SFSKAACLVT PKSSISYPQV WKGVGSRKWK LGPNDGIFVQ FESPSLRKVW FISSSKEKGQ TLCREPQALD IGAHEILPRI FKEKSKSS // ID NC003070_224 HYPOTHETICAL; PRT; 1816 AA. AC NC003070_224; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1109675...1109864, 1109983...1110222, DE 1110242...1110362, 1110411...1110585, 1110677...1110818, DE 1110982...1111227, 1111304...1111391, 1111522...1111624, DE 1111790...1111918, 1112290...1112408, 1112499...1112589, DE 1112860...1112949, 1113071...1113190, 1113300...1113391, DE 1113506...1113563, 1113785...1113832, 1114635...1115191, DE 1115556...1115820, 1115937...1116099, 1116225...1116960, DE 1117052...1117481, 1117574...1117755, 1117988...1118144, DE 1118222...1118446, 1118530...1118880, 1118967...1119068, DE 1119152...1119382]; Length: 5451. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 63 FIRST EXON; p-value: NaN. FT GENSCAN 64 64 AA on splice site: g/ca -> A. FT GENSCAN 65 143 INTERNAL EXON; p-value: NaN. FT GENSCAN 144 144 AA on splice site: g/tt -> V. FT GENSCAN 145 183 INTERNAL EXON; p-value: NaN. FT GENSCAN 184 184 AA on splice site: ag/t -> S. FT GENSCAN 185 242 INTERNAL EXON; p-value: NaN. FT GENSCAN 243 289 INTERNAL EXON; p-value: NaN. FT GENSCAN 290 290 AA on splice site: g/ca -> A. FT GENSCAN 291 371 INTERNAL EXON; p-value: NaN. FT GENSCAN 372 372 AA on splice site: t/tt -> F. FT GENSCAN 373 400 INTERNAL EXON; p-value: NaN. FT GENSCAN 401 401 AA on splice site: at/g -> M. FT GENSCAN 402 435 INTERNAL EXON; p-value: NaN. FT GENSCAN 436 478 INTERNAL EXON; p-value: NaN. FT GENSCAN 479 517 INTERNAL EXON; p-value: NaN. FT GENSCAN 518 518 AA on splice site: aa/g -> K. FT GENSCAN 519 548 INTERNAL EXON; p-value: NaN. FT GENSCAN 549 578 INTERNAL EXON; p-value: NaN. FT GENSCAN 579 618 INTERNAL EXON; p-value: NaN. FT GENSCAN 619 648 INTERNAL EXON; p-value: NaN. FT GENSCAN 649 649 AA on splice site: tg/t -> C. FT GENSCAN 650 668 INTERNAL EXON; p-value: NaN. FT GENSCAN 669 684 INTERNAL EXON; p-value: NaN. FT GENSCAN 685 869 INTERNAL EXON; p-value: NaN. FT GENSCAN 870 870 AA on splice site: ag/a -> R. FT GENSCAN 871 958 INTERNAL EXON; p-value: NaN. FT GENSCAN 959 1012 INTERNAL EXON; p-value: NaN. FT GENSCAN 1013 1013 AA on splice site: g/gc -> G. FT GENSCAN 1014 1257 INTERNAL EXON; p-value: NaN. FT GENSCAN 1258 1258 AA on splice site: ag/g -> R. FT GENSCAN 1259 1401 INTERNAL EXON; p-value: NaN. FT GENSCAN 1402 1461 INTERNAL EXON; p-value: NaN. FT GENSCAN 1462 1462 AA on splice site: ag/g -> R. FT GENSCAN 1463 1514 INTERNAL EXON; p-value: NaN. FT GENSCAN 1515 1589 INTERNAL EXON; p-value: NaN. FT GENSCAN 1590 1706 INTERNAL EXON; p-value: NaN. FT GENSCAN 1707 1740 INTERNAL EXON; p-value: NaN. FT GENSCAN 1741 1816 LAST EXON; p-value: NaN. SQ SEQUENCE 1816 AA; 204374 MW; FFAD09391F62F129 CRC64; MGGVPSTPRK TGGDDVSVAE YLIATFVGEK SFPLASDFWN KLLELPLSSR WPSDRVQQAC ELFAQSNGYT RHLAKLLIHL SWCLQELLQA SDDQSSLYKK AVNATYISSV FLKYLIENGK SDSLQELHLS LDESEPVPHG FVMVISSVKV FLSWLIKSVL MADQDIQNFV MHSVLSFIGS NEVSIRCFIL LIFIIYLASP NSYVLHQELL NFMLVTMSTQ LLSGPSHGPT DANPFIDAAM TQEKSLVSLV VRRLLLNYIS RHRTPPNAKS YMYSDGDSQG ILERVGSAAA SLVLLPLNYL VSNSGGSKNP LAECSLHVLL ILINYHKSIM SDESMTDKSD DSATSESVSK VHVFSSDNTF SKALANARDV EFDRSDVEGN AHPAGPHVRI PFASLFDTLG MFLADEGAVL LLYSLLQGNS DFKEYVLVRT DLDTLLMPIL ETLYNASKRT SSNQIYMMLI VLLILSQDSS FNSSIHKMDV YLQTTCLATL ANMAPHAHHL SAYASQRLVS LFYMLSRKYN KLSDLTGDKL QSIKINLSGE DVGVSEDLIV YAIMHRQEVF QPFKNHPRFH ELVENIYTVL DFFNSRMDSQ RSDREWSVQK VLQFIINNCR SWRGEGMKMF TQLHFSYEQE SHPEEFFIPY VWQLAFSRCG FGFNPDAINL FPVPHPVEKE IEDERGEESE GKAKIQWRSL IRCGTSTEAI GCIKMDSKIK KPANLIEDAD IDGGSESDST ISSVLSLEDD SVVDVSGQNL EFSLLDNVDD SVKGLYFFRN VFNLIPKSIG GLGRLRKLKF FSNEIDLFPP ELGNLVNLEY LQVKISSPGF GDGLSWDKLK GLKELELTKV PKRSSALTLL SEISGLKCLT RLSVCHFSIR YLPPEIGCLK SLEYLDLSFN KIKSLPNEIG YLSSLTFLKV AHNRLMELSP VLALLQNLES LDVSNNRLTT LHPLDLNLMP RLQILNLRYN KLPSYCWIPT WIQCNFEGNY EEMGVDTCSS SMVEMDVFET PYENNVITVP HKGSHRNPLN MSTGISSISR CFSARKSSKR WKRRQYYFQQ RARQERLNNS RKWKGEVPPE GLSLKMEVEE TGKQGMKVPQ NTDRGSVDNS CSDENDKLFE EASVITSEEE ESSLKADVVS DNSQCVETQL TSERDNYESC EIKTSSPSSG DAPGTVDYNS SSERKKPNNK SKRCSEKYLD NPKGSKCHKL STDITNLSRK YSSNSFCSTE DSLPDGFFDA GRDRPFMTLS KYEKVLPLDS REVILLDRAK DEVLDAITLS ARALVARLKK LNCLTPDVDQ VSIDNLQVAS FLALFVSDHF GGSDRTAIIE RTRKAVSGTN YQKPFICTCL TGNQDDLAAL NKQVSTTAED AILSDVCEKS LRSIKSKRNS IVVPLGKLQF GICRHRALLM KYLCDRMEPP VPCELVRGYL DFMPHAWNIV PVKQGSSWVR MVVDACRPHD IREDTDQEYF CRYIPLNRLN ESIRIKEKLE PGCSVSSLST GKGVERANSS LIRCKLGSTE AVVKMRTLEV SGASLDDIRT FEYTCLGEVR ILGALKHDCI VELYGHEISS KWITSENGNE HRVLQSSILM EHIKGGSLKG HIEKLSEAGK HHVPMDLALS IARDISGALM ELHSKDIIHR DIKSENVLID LDNQSANGEP IVKLCDFDRA VPLRSHLHGC CIAHVGIPPP NICVGTPRWM SPEVFRAMHE QNFYGLEVDI WSFGCLIFEL LTLQNPYFDL SELQIHESLQ NGKRPKLPKK LETLISETEE EESTNKLSEV FDLTESDLDT MRFLIDVFHQ CTEESPSDRL NAGDLHEMIL SRKKRE // ID NC003070_225 HYPOTHETICAL; PRT; 528 AA. AC NC003070_225; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1122482...1121715, 1120670...1119852]; DE Length: 1587. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 256 FIRST EXON; p-value: NaN. FT GENSCAN 257 528 LAST EXON; p-value: NaN. SQ SEQUENCE 528 AA; 59529 MW; FC47749CBBB26B3E CRC64; MNENHIQSDH MNNTIHVTNK KLPNFLLSVR LKYVKLGYHY LISNAVYILI LPVGLLAATS SSFSLTDLTL LYNHLLKFHF LSSTLFAALL IFLTTLYFTT RPRRIFLLDF ACYKPDSSLI CTRETFMDRS QRVGIFTEDN LAFQQKILER SGLGQKTYFP EALLRVPPNP CMSEARKEAE TVMFGAIDAV LEKTGVNPKD IGILVVNCSL FNPTPSLSAM IVNKYKLRGN VLSYNLGGMG CSAGLISIDL AKQLLQVQPN SYALVVSTEN ITLNWYLGND RSMLLSNCIF RMGGAAVLLS NRSSDRCRSK YQLIHTVRTH KGSDDNAFNC VYQREDNDDN KQIGVSLSKN LMAIAGEALK TNITTLGPLV LPMSEQLLFF ATLVARKVFN VKKIKPYIPD FKLAFEHFCI HAGGRAVLDE IEKNLDLSEW HMEPSRMTLN RFGNTSSSSL WYELAYSEAK GRIKRGDRTW QIAFGSGFKC NSAVWRALRT IDPSKEKKKK TNPWIDEIHE FPVPVPRTSP VTSSSESR // ID NC003070_226 HYPOTHETICAL; PRT; 325 AA. AC NC003070_226; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1125575...1125727, 1125994...1126123, DE 1126232...1126362, 1126534...1126672, 1126847...1126961, DE 1127192...1127255, 1127433...1127672, 1127822...1127827]; Length: 978. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 51 FIRST EXON; p-value: NaN. FT GENSCAN 52 94 INTERNAL EXON; p-value: NaN. FT GENSCAN 95 95 AA on splice site: g/ag -> E. FT GENSCAN 96 138 INTERNAL EXON; p-value: NaN. FT GENSCAN 139 184 INTERNAL EXON; p-value: NaN. FT GENSCAN 185 185 AA on splice site: g/ag -> E. FT GENSCAN 186 222 INTERNAL EXON; p-value: NaN. FT GENSCAN 223 223 AA on splice site: aa/a -> K. FT GENSCAN 224 244 INTERNAL EXON; p-value: NaN. FT GENSCAN 245 324 INTERNAL EXON; p-value: NaN. FT GENSCAN 325 325 LAST EXON; p-value: NaN. SQ SEQUENCE 325 AA; 37150 MW; B35DC2296031FC89 CRC64; MAHGGYARRR VAERKTTAGT SRRSKGLRVE KKPKNSSLKN QIRSIGRMIR KDLPPEVREA LEKKLDDLKK QQDIHFRLAF ERKIFLRNRK VRFFERRKIE RSIRRLEKLH RSTSGGYVQD AEIGGQLNRL KEDLEYVRFF PKNEKYVSLF SGSDDLQVSE RRSKLRKQIK ANIIFAAASG KELEETGSED DALLDLSDDD FFVNGSSSDE ADADDEWTDK STKEPVSSAS GRATSSMSSD ERNQKPYSTR VLMPPPRSRF ASTSRQYSSV KRNEIPSSSN TSHRRSQSSH AATSSHTSQS SNLSSNSDAH KPKRKRRPKK KKLQA // ID NC003070_227 HYPOTHETICAL; PRT; 223 AA. AC NC003070_227; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1130395...1130345, 1129369...1129134, DE 1129051...1128743, 1128638...1128563]; Length: 672. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 17 FIRST EXON; p-value: NaN. FT GENSCAN 18 95 INTERNAL EXON; p-value: NaN. FT GENSCAN 96 96 AA on splice site: aa/g -> K. FT GENSCAN 97 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 199 AA on splice site: ga/g -> E. FT GENSCAN 200 223 LAST EXON; p-value: NaN. SQ SEQUENCE 223 AA; 25386 MW; 547F73AD946C1C4A CRC64; MEPKLRSGSI RVGYVKDDNN QKLFFLSSPA ILEEMDEFVN LKETELRLGL PGTDNVCEAK ERVSCCNNNN KRVLSTDTEK EIESSSRKTE TSPPRKAQIV GWPPVRSYRK NNIQSKKNES EHEGQGIYVK VSMDGAPYLR KIDLSCYKGY SELLKALEVM FKFSVGEYFE RDGYKGSDFV PTYEDKDGDW MLIGDVPWEM FICTCKRLRI MKGSEAKGLG CGV // ID NC003070_228 HYPOTHETICAL; PRT; 218 AA. AC NC003070_228; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1136384...1136619, 1136933...1137144, DE 1137459...1137597, 1137739...1137800, 1138699...1138706]; Length: 657. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 78 FIRST EXON; p-value: NaN. FT GENSCAN 79 79 AA on splice site: aa/g -> K. FT GENSCAN 80 149 INTERNAL EXON; p-value: NaN. FT GENSCAN 150 150 AA on splice site: g/gc -> G. FT GENSCAN 151 195 INTERNAL EXON; p-value: NaN. FT GENSCAN 196 196 AA on splice site: cc/a -> P. FT GENSCAN 197 216 INTERNAL EXON; p-value: NaN. FT GENSCAN 217 217 AA on splice site: g/tc -> V. FT GENSCAN 218 218 LAST EXON; p-value: NaN. SQ SEQUENCE 218 AA; 24014 MW; BB735D2F9B076678 CRC64; MGSVELNLRE TELCLGLPGG DTVAPVTGNK RGFSETVDLK LNLNNEPANK EGSTTHDVVT FDSKEKSACP KDPAKPPAKA QVVGWPPVRS YRKNVMVSCQ KSSGGPEAAA FVKVSMDGAP YLRKIDLRMY KSYDELSNAL SNMFSSFTMG KHGGEEGMID FMNERKLMDL VNSWDYVPSY EDKDGDWMLV GDVPWPMFVD TCKRLRLMKG SDAIGLVS // ID NC003070_229 HYPOTHETICAL; PRT; 323 AA. AC NC003070_229; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1142959...1142957, 1142868...1142720, DE 1142632...1142575, 1142099...1141904, 1141313...1140748]; Length: 972. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 1 FIRST EXON; p-value: NaN. FT GENSCAN 2 50 INTERNAL EXON; p-value: NaN. FT GENSCAN 51 51 AA on splice site: ag/g -> R. FT GENSCAN 52 70 INTERNAL EXON; p-value: NaN. FT GENSCAN 71 135 INTERNAL EXON; p-value: NaN. FT GENSCAN 136 136 AA on splice site: g/ga -> G. FT GENSCAN 137 323 LAST EXON; p-value: NaN. SQ SEQUENCE 323 AA; 35261 MW; 6ACC4D358C6EB5F4 CRC64; MADVEPEVAA AGVPKKRTFK KFAFKGVDLD ALLDMSTDDL VKLFSSRIRR RFSRGLTRKP MALIKKLRKA KREAPQGEKP EPVRTHLRNM IIVPEMIGSI IGVYNGKTFN QVEIKPEMIG HYLAEFSISY KPVKHGEIQS AMANQVITGI KETAQSITGA ARPWGDFLDL SAFSFPSSIA DATTRVTQNL THFRINYSII LSILLGLTLI TRPIAILAFI AVGLAWFFLY FAREEPLTIF GFTIDDGIVA VLLIGLSIGS LVTTGVWLRA LTTVGFGVLV LILHAALRGT DDLVSDDLES PYGPMLSTSG GGNDGARGDY SGI // ID NC003070_230 HYPOTHETICAL; PRT; 525 AA. AC NC003070_230; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1148351...1148172, 1147863...1147779, DE 1146319...1146140, 1145719...1145496, 1145358...1145294, DE 1144783...1144654, 1144555...1144233, 1144123...1143979, DE 1143887...1143642]; Length: 1578. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 60 FIRST EXON; p-value: NaN. FT GENSCAN 61 88 INTERNAL EXON; p-value: NaN. FT GENSCAN 89 89 AA on splice site: g/ct -> A. FT GENSCAN 90 148 INTERNAL EXON; p-value: NaN. FT GENSCAN 149 149 AA on splice site: g/ca -> A. FT GENSCAN 150 223 INTERNAL EXON; p-value: NaN. FT GENSCAN 224 244 INTERNAL EXON; p-value: NaN. FT GENSCAN 245 245 AA on splice site: ag/g -> R. FT GENSCAN 246 288 INTERNAL EXON; p-value: NaN. FT GENSCAN 289 395 INTERNAL EXON; p-value: NaN. FT GENSCAN 396 396 AA on splice site: cg/g -> R. FT GENSCAN 397 444 INTERNAL EXON; p-value: NaN. FT GENSCAN 445 525 LAST EXON; p-value: NaN. SQ SEQUENCE 525 AA; 59426 MW; D07B545EE840D7A1 CRC64; MDLESVKKYL EGDEDEKAKE PMVAKLPHRF LERFVTNGLK VDLIEPGRIV CSMKIPPHLL EEIEIESKAL RVGKAVAVVS VELRKKTTAS LGGLAAAVAA AYAGELLLRR RKLDQGASMG YKDVKIAPLI ERKDSGRRSN LERFSHYVAR QLGFEDPNEY PQLCKLANGY LLKTKGYDEN VDEYLENEAE RDSLYVHLLE EFDRCILTYF SFNWTQSSNL ISQALSDESD QKVPKLKDFV MAATRSFWSE AQADAVVIEA DAFKETDVIY RALSSRGHHD DMLQTAELVH QSSTDAASSL LVTALNDGRD VIMDGTLSWE PFVEQMIEMA RNVHKQKYRM GEGYKVSEEG TITEKYWEEE EEETKENGKQ QNLKPYRIEL VGVVCDAYLA VARGIRRALM VKRAVRVKPQ LNSHKRFANA FPKYCELVDN ARLYCTNAVG GPPRLIAWKD GNSKLLVDPE DIDCLKRVSS LNPDAESIYE LYPDPSQLSK PGSVWNDVVL VPSRPKVQKE LSDAIRRIEK AQPKN // ID NC003070_231 HYPOTHETICAL; PRT; 1680 AA. AC NC003070_231; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1158520...1158482, 1157593...1157545, DE 1157048...1155349, 1155276...1155171, 1153887...1153717, DE 1153344...1153233, 1153123...1153035, 1152808...1152676, DE 1152474...1152316, 1151850...1151502, 1151323...1151211, DE 1151117...1150967, 1150881...1150391, 1150318...1150117, DE 1149995...1148817]; Length: 5043. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 13 FIRST EXON; p-value: NaN. FT GENSCAN 14 29 INTERNAL EXON; p-value: NaN. FT GENSCAN 30 30 AA on splice site: g/aa -> E. FT GENSCAN 31 596 INTERNAL EXON; p-value: NaN. FT GENSCAN 597 631 INTERNAL EXON; p-value: NaN. FT GENSCAN 632 632 AA on splice site: g/ag -> E. FT GENSCAN 633 688 INTERNAL EXON; p-value: NaN. FT GENSCAN 689 689 AA on splice site: g/gc -> G. FT GENSCAN 690 725 INTERNAL EXON; p-value: NaN. FT GENSCAN 726 726 AA on splice site: tg/g -> W. FT GENSCAN 727 755 INTERNAL EXON; p-value: NaN. FT GENSCAN 756 756 AA on splice site: g/at -> D. FT GENSCAN 757 799 INTERNAL EXON; p-value: NaN. FT GENSCAN 800 800 AA on splice site: ag/a -> R. FT GENSCAN 801 852 INTERNAL EXON; p-value: NaN. FT GENSCAN 853 853 AA on splice site: ag/c -> S. FT GENSCAN 854 969 INTERNAL EXON; p-value: NaN. FT GENSCAN 970 1006 INTERNAL EXON; p-value: NaN. FT GENSCAN 1007 1007 AA on splice site: ag/c -> S. FT GENSCAN 1008 1057 INTERNAL EXON; p-value: NaN. FT GENSCAN 1058 1220 INTERNAL EXON; p-value: NaN. FT GENSCAN 1221 1221 AA on splice site: aa/a -> K. FT GENSCAN 1222 1288 INTERNAL EXON; p-value: NaN. FT GENSCAN 1289 1680 LAST EXON; p-value: NaN. SQ SEQUENCE 1680 AA; 187688 MW; 3B04AC471AB1E9F4 CRC64; MECLTKSCLI SSKLLRFLEK RFLADHYVAE DDGSLSLCNC DDEDSLFSYE TILNSQKVGD FLIAIAYFSI PIELVYFVSR TNVPSPYNWV VCEFIAFIVL CGMTHLLAGF TYGPHWPWVM TAVTVFKMLT GIVSFLTALS LVTLLPLLLK AKVREFMLSK KTRELDREVG IIMKQTETSL HVRMLTTKIR TSLDRHTILY TTLVELSKTL GLKNCAVWIP NEIKTEMNLT HELRPRIDDE NENEHFGGYA GFSIPISESD VVRIKRSEEV NMLSPGSVLA SVTSRGKSGP TVGIRVPMLR VCNFKGGTPE AIHMCYAILV CVLPLRQPQA WTYQELEIVK VVADQVAVAI SHAVILEESQ LMREKLAEQN RALQVARENA LRANQAKAAF EQMMSDAMRC PVRSILGLLP LILQDGKLPE NQTVIVDAMR RTSELLVQLV NNAGDINNGT IRAAETHYFS LHSVVKESAC VARCLCMANG FGFSAEVYRA LPDYVVGDDR KVFQAILHML GVLMNRKIKG NVTFWVFPES GNSDVSERKD IQEAVWRHCY SKEYMEVRFG FEVTAEGEES SSSSSGSNLE EEEENPSLNA CQNIVKYMQG NIRVVEDGLG LVKSVSVVFR FQLRRSMMSR GEAVDEDSGV GRSLEESSNG QHSQAGEALS EWRSSGQVEN GTPSTSPSYW DIDDDDDYGL KPSELYGQYT WKIPKFSEIT KREHRSNVFE AGGYKWYILI YPQGCDVCNH LSLFLCVANY DKLLPDTLHR FWKKEHDWGW KKFMELPKLK DGFIDESGCL TIEAKVQVIR ERVDRPFRCL DCGYRRELVR VYFQNVEQIC RRFVEEKRSK LGRLIEDKAR WTSFGVFWLG MDQNSRRRMC REKVDVILKG VVKHFFVEKE VSSTLVMDSL YSGLKALEGQ TKNMKARSRL LDAKQLPAPI VSVDKDMFVL VDDVLLLLER AALEPLPPKD EKGRQNRTKD GNDGEEVNKE ADERDERRLT ELGRRTVEIF ILSHIFSTKI EVAHQEAIAL KRQEELIREE EEAWLAETEQ RAKRGAAERE KKSKKKQAKQ KRNKNKGKDK RKEEKVSFAT HAKDLEENQN QNQNDEEEKD SVTEKAQSSA EKPDTLGDVS DISDSVDGSA DILQPDLEDR DSSSVLWDTD ALEIHPPSSE GSSRGRGISI STPNGITEGK SHSTMDDSSS TCSNDSIRSG VTNGSYQGNS LNFRNQKSPN KGKNQQVKAM TDAHSLASET DDQPSTLGTD PKGQNYSSEA SNVGESDWVV VSHIQEPEGS RNRIPVGRER KTVQSIVNSV DMDRPKEKST AVLSSPRNVA KNPSPLTQTK PEKKSISTAD GIPNRKVLAT GPPSSSQVVL PSDIQSQTVG LRADMQKLSA PKQPPATTIS RPSSAPIIPA MRPSPITVSS SVQTTTSLPR SVSSAGRLGP DPSLHNQQTY TPQSYKNAIV GNSLGSSSSS FNHHPSSHGV VPTTLPSSSY SQAPTSSYQS SFPYSQDGLL WTGRSPSSVN MGMYNNTYSP AVTSNRSLNH MDVQIAQQQA QSMMTDEFPH LDIINDLLED EQCSNMVYNG SIFNPQPQVF NGQYSSYHGE LLSGGRTRSF GEEGLHYMAR GPYGTDGMMP RQWQMTNMDL SLPAMRSNGM EDGTSSAANY HHSYFGLDAS NPSFTSGING YTEFRPSNGH // ID NC003070_232 HYPOTHETICAL; PRT; 196 AA. AC NC003070_232; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1164736...1164669, 1164085...1163971, DE 1163879...1163790, 1163703...1163605, 1161801...1161583]; Length: 591. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 22 FIRST EXON; p-value: NaN. FT GENSCAN 23 23 AA on splice site: at/g -> M. FT GENSCAN 24 61 INTERNAL EXON; p-value: NaN. FT GENSCAN 62 91 INTERNAL EXON; p-value: NaN. FT GENSCAN 92 124 INTERNAL EXON; p-value: NaN. FT GENSCAN 125 196 LAST EXON; p-value: NaN. SQ SEQUENCE 196 AA; 21887 MW; C2BF260F87A4A030 CRC64; MGFFSFLGRV LFASLFILSA WQMFNDFGTD GGPAAKELAP KLDLTKAHLS SILGVSLPNL EVKQVVWAIV ALKGLGGLLF VIGNIFGAYL LAVYLVVVSP ILYDFYNYGP EDRQFSLLLT EFLQINPCEA NWKVAIPLLS PTESPPQKPP AVMKREEQRW GKEAEKPPVF KKWQHPAAPF YYQPAPSSNQ PFAWPN // ID NC003070_233 HYPOTHETICAL; PRT; 199 AA. AC NC003070_233; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1165295...1165639, 1166283...1166537]; DE Length: 600. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 115 FIRST EXON; p-value: NaN. FT GENSCAN 116 199 LAST EXON; p-value: NaN. SQ SEQUENCE 199 AA; 22516 MW; A15D7976DBF47687 CRC64; METKEFDSYS ERKAFDETKT GVKGLIDAHI TEIPRIFCLP QGSLSDKKPF VSTTDFAIPI IDFEGLHVSR EDIVGKIKDA ASNWGFFQVI NHGVPLNVLQ EIQDGVRRFH EEAPELITND KVISVEHRVL ANRAATPRIS VASFFSTSMR PNSTVYGPIK ELLSEENPSK YRVIDLKEYT EGYFKKGLDG TSYLSHYKI // ID NC003070_234 HYPOTHETICAL; PRT; 52 AA. AC NC003070_234; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1167061...1167191, 1167229...1167256]; DE Length: 159. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 43 FIRST EXON; p-value: NaN. FT GENSCAN 44 44 AA on splice site: ag/t -> S. FT GENSCAN 45 52 LAST EXON; p-value: NaN. SQ SEQUENCE 52 AA; 5747 MW; 624FD70E98B05E7A CRC64; MLSSSSVRFD TDLKGESAAT KIYLGLYAFM HLGLYVFKPF GPHSSLLGFV FV // ID NC003070_235 HYPOTHETICAL; PRT; 381 AA. AC NC003070_235; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1168651...1167506]; Length: 1146. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 381 SINGLE EXON; p-value: NaN. SQ SEQUENCE 381 AA; 42617 MW; 6FAE42915BF3E796 CRC64; MDLTDRRNPF NNLVFPPPPP PPSTTFTSPI FPRTSSSGTN FPILAIAVIG ILATAFLLVS YYIFVIKCCL NWHQIDIFRR RRRSSDQNPL MIYSPHEVNR GLDESAIRAI PVFKFKKRDV VAGEEDQSKN SQECSVCLNE FQEDEKLRII PNCCHVFHID CIDIWLQGNA NCPLCRTSVS CEASFTLDLI SAPSSPRENS PHSRNRNLEP GLVLGGDDDF VVIELGASNG NNRESVRNID FLTEQERVTS NEVSTGNSPK SVSPLPIKFG NRGMYKKERK FHKVTSMGDE CIDTRGKDGH FGEIQPIRRS ISMDSSVDRQ LYLAVQEEIS RRNRQIPVAG DGEDSSSSGG GNSRVMKRCF FSFGSSRTSK SSSILPVYLE P // ID NC003070_236 HYPOTHETICAL; PRT; 74 AA. AC NC003070_236; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1169450...1169521, 1170713...1170865]; DE Length: 225. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 24 FIRST EXON; p-value: NaN. FT GENSCAN 25 74 LAST EXON; p-value: NaN. SQ SEQUENCE 74 AA; 8584 MW; FF76D9EB3407FA13 CRC64; MIDIEDLVMI RNGPRKIPEK DTRLNEVLRI IIMRHVDDSG QHSNGEMDQH SAARDAWSVT CVRQTIIDTR RSEI // ID NC003070_237 HYPOTHETICAL; PRT; 133 AA. AC NC003070_237; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1175176...1175577]; Length: 402. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 133 SINGLE EXON; p-value: NaN. SQ SEQUENCE 133 AA; 14744 MW; 31D9B9279811BC59 CRC64; MDQGGRSSGS GGGGAEQGKY RGVRRRPWGK YAAEIRDSRK HGERVWLGTF DTAEDAARAY DRAAYSMRGK AAILNFPHEY NMGTGSSSTA ANSSSSSQQV FEFEYLDDSV LDELLEYGEN YNKTHNINMG KRQ // ID NC003070_238 HYPOTHETICAL; PRT; 301 AA. AC NC003070_238; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1178383...1177920, 1177623...1177525, DE 1177429...1177182, 1176802...1176708]; Length: 906. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 154 FIRST EXON; p-value: NaN. FT GENSCAN 155 155 AA on splice site: ag/g -> R. FT GENSCAN 156 187 INTERNAL EXON; p-value: NaN. FT GENSCAN 188 188 AA on splice site: ag/g -> R. FT GENSCAN 189 270 INTERNAL EXON; p-value: NaN. FT GENSCAN 271 271 AA on splice site: g/gg -> G. FT GENSCAN 272 301 LAST EXON; p-value: NaN. SQ SEQUENCE 301 AA; 33860 MW; A1322F0B364DD7C6 CRC64; MVVKNSIKFN SQSERKSLEE TKVPPIFGLP PDALDDKKPT SDFAVPIIDF AGVHKSREAV VEKIKAAAEN WGIFQVINHG VPLSVLEEIQ NGVVRFHEED PEVKKSYFSL DLTKTFIYHN NFELYSSSAG NWRDSFVCYM DPDPSNPEDL PVACRTTSVV FKFFIKTVGL MSRLFPGHLS STSEISCRFS YSGCAAQLVQ LMTNDKFISV DHRVLTNRVG PRISIACFFS SSMNPNSTVY GPIKELLSEE NPPKYRDFTI PEYSKGYIEK GFRNDSRTSK NALVETITYL DETQGQHGSP I // ID NC003070_239 HYPOTHETICAL; PRT; 868 AA. AC NC003070_239; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1184895...1184753, 1183427...1183301, DE 1183197...1182690, 1182586...1182453, 1182322...1182080, DE 1181768...1181668, 1181480...1180716, 1180630...1180478, DE 1180109...1179677]; Length: 2607. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 47 FIRST EXON; p-value: NaN. FT GENSCAN 48 48 AA on splice site: tc/g -> S. FT GENSCAN 49 90 INTERNAL EXON; p-value: NaN. FT GENSCAN 91 259 INTERNAL EXON; p-value: NaN. FT GENSCAN 260 260 AA on splice site: g/ct -> A. FT GENSCAN 261 304 INTERNAL EXON; p-value: NaN. FT GENSCAN 305 385 INTERNAL EXON; p-value: NaN. FT GENSCAN 386 418 INTERNAL EXON; p-value: NaN. FT GENSCAN 419 419 AA on splice site: tg/c -> C. FT GENSCAN 420 673 INTERNAL EXON; p-value: NaN. FT GENSCAN 674 674 AA on splice site: tc/g -> S. FT GENSCAN 675 724 INTERNAL EXON; p-value: NaN. FT GENSCAN 725 725 AA on splice site: ag/c -> S. FT GENSCAN 726 868 LAST EXON; p-value: NaN. SQ SEQUENCE 868 AA; 97728 MW; 45C3387BD075B157 CRC64; MFDNGIGFVQ RGVDGAWMRV DVIGIVVEKR VEARSDSDNV FFTISFGSVC DEKEKKWKCT DIEIQRHVVK SISAFLDCFS RATANNRLIK DSISDIAGAL VFILGSKNRA VVGLAANVVI RLIRIVPPSI LHSYSLDLVE SLSPLLCCQQ FDVSLPCAVA LNAILVNVRE TKEKEVWKIL EDEKTVVSVV GNLQIFSEGS MSVEWFQEMA LLLSTIMLKW PQSRYSVWNN PALMGVLESV SQKPDMGLTV ATLKLYSSLA LCGHGANELL DNGKPMLDMM ISCMEESSSQ NARIEGLKLA QRLAATVRTM GKWFLSSGKL ELDQMSLLVE ACKLALITRW EGQHHIYFWK YRISEALLSL VVENFHSQSL DGYVSLEEEV LVAEKRVSIK SSANDDMFSL QIHIIEGPSY PVIYPRRRCW YQRNWENIGA SSFAPSSQSI TEKRICCWVC TEDWDNKDAF LLYALLALAE LVNHSFFGQN HAEELSMKSG NLKDRLCTTL KEIRDGTYGS GPRWYAAHIL SYFGYYGFEH KLGKRLMCAY EDEEYSDMRL LFASGNSASV NKVIIAVRCP MLLPPKEGAH SSSTISTEKS QRTVQEIRMS ANVDILALVK LLEFAYSGYV EVESTTLKKL KPLAKHCKAK VLLQMLCRRR PKWGSSIPEI DIPLALTPKL IHFSDVILVP KETNVACFNC RMCSLTSPHA HSHRVILSSG CEYLRALFRS GMQESHLDRL NVPVSWLGLT KLVSWFYSDE LPKPPSGCKW NNMDTEAKLD ELQAYVEIYS LSEWWIMEEL QNDCAHVILS CLESARELSI KTIELAASFS MWKLVEAAAN HAAPIYHQLR DSGELDELDD ELVNLIRTAA VQFSQQGG // ID NC003070_240 HYPOTHETICAL; PRT; 610 AA. AC NC003070_240; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1187894...1187588, 1187483...1187259, DE 1187163...1186451, 1186305...1185718]; Length: 1833. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 102 FIRST EXON; p-value: NaN. FT GENSCAN 103 103 AA on splice site: g/at -> D. FT GENSCAN 104 177 INTERNAL EXON; p-value: NaN. FT GENSCAN 178 178 AA on splice site: g/cg -> A. FT GENSCAN 179 415 INTERNAL EXON; p-value: NaN. FT GENSCAN 416 610 LAST EXON; p-value: NaN. SQ SEQUENCE 610 AA; 69199 MW; 286DF8DFC3150B8E CRC64; MDKKTIVWFR RDLRIEDNPA LAAAAHEGSV FPVFIWCPEE EGQFYPGRAS RWWMKQSLAH LSQSLKALGS DLTLIKTHNT ISAILDCIRV TGATKVVFNH LYDPVSLVRD HTVKEKLVER GISVQSYNGD LLYEPWEIYC EKGKPFTSFN SYWKKCLDMS IESVMLPPPW RLMPITAAAE AIWACSIEEL GLENEAEKPS NALLTRAWSP GWSNADKLLN EFIEKQLIDY AKNSKKVVGN STSLLSPYLH FGEISVRHVF QCARMKQIIW ARDKNSEGEE SADLFLRGIG LREYSRYICF NFPFTHEQSL LSHLRFFPWD ADVDKFKAWR QGRTGYPLVD AGMRELWATG WMHNRIRVIV SSFAVKFLLL PWKWGMKYFW DTLLDADLEC DILGWQYISG SIPDGHELDR LDNPALQGAK YDPEGEYIRQ WLPELARLPT EWIHHPWDAP LTVLKASGVE LGTNYAKPIV DIDTARELLA KAISRTREAQ IMIGAAPDEI VADSFEALGA NTIKEPGLCP SVSSNDQQVP SAVRYNGSKR VKPEEEEERD MKKSRGFDER ELFSTAESSS SSSVFFVSQS CSLASEGKNL EGIQDSSDQI TTSLGKNGCK // ID NC003070_241 HYPOTHETICAL; PRT; 332 AA. AC NC003070_241; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1191266...1191224, 1190904...1190746, DE 1190639...1190464, 1190306...1190029, 1189939...1189804, DE 1189711...1189600, 1189511...1189417]; Length: 999. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 14 FIRST EXON; p-value: NaN. FT GENSCAN 15 15 AA on splice site: g/ga -> G. FT GENSCAN 16 67 INTERNAL EXON; p-value: NaN. FT GENSCAN 68 68 AA on splice site: g/gt -> G. FT GENSCAN 69 126 INTERNAL EXON; p-value: NaN. FT GENSCAN 127 218 INTERNAL EXON; p-value: NaN. FT GENSCAN 219 219 AA on splice site: tg/g -> W. FT GENSCAN 220 264 INTERNAL EXON; p-value: NaN. FT GENSCAN 265 301 INTERNAL EXON; p-value: NaN. FT GENSCAN 302 302 AA on splice site: g/gc -> G. FT GENSCAN 303 332 LAST EXON; p-value: NaN. SQ SEQUENCE 332 AA; 35572 MW; C85D11E2BDFF556D CRC64; MAKEPVRVLV TGAAGQIGYA LVPMIARGIM LGADQPVILH MLDIPPAAEA LNGVKMELID AAFPLLKGVV ATTDAVEGCT GVNVAVMVGG FPRKEGMERK DVMSKNVSIY KSQAAALEKH AAPNCKVLVV ANPANTNALI LKEFAPSIPE KNISCLTRLD HNRALGQISE RLSVPVSDVK NVIIWGNHSS SQYPDVNHAK VQTSSGEKPV RELVKDDAWL DGEFISTVQQ RGAAIIKARK LSSALSAASS ACDHIRDWVL GTPEGTFVSM GVYSDGSYSV PSGLIYSFPV TCRNGDWSIV QGLPIDEVSR KKMDLTAEEL KEEKDLAYSC LS // ID NC003070_242 HYPOTHETICAL; PRT; 1549 AA. AC NC003070_242; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1191633...1191857, 1191938...1192033, DE 1192213...1192296, 1192412...1192572, 1192717...1192834, DE 1192924...1193059, 1193142...1193320, 1193459...1193673, DE 1194247...1194276, 1198293...1198325, 1198966...1199113, DE 1199196...1199360, 1199458...1199640, 1199709...1200303, DE 1200423...1200535, 1200642...1200862, 1200961...1201226, DE 1202975...1203015, 1203103...1203172, 1203271...1203419, DE 1203494...1203589, 1203697...1203767, 1203843...1203904, DE 1204043...1204106, 1204181...1204265, 1204392...1204518, DE 1204794...1204857, 1205070...1205250, 1205575...1205740, DE 1207356...1207861]; Length: 4650. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 75 FIRST EXON; p-value: NaN. FT GENSCAN 76 107 INTERNAL EXON; p-value: NaN. FT GENSCAN 108 135 INTERNAL EXON; p-value: NaN. FT GENSCAN 136 188 INTERNAL EXON; p-value: NaN. FT GENSCAN 189 189 AA on splice site: tg/g -> W. FT GENSCAN 190 228 INTERNAL EXON; p-value: NaN. FT GENSCAN 229 273 INTERNAL EXON; p-value: NaN. FT GENSCAN 274 274 AA on splice site: g/tt -> V. FT GENSCAN 275 333 INTERNAL EXON; p-value: NaN. FT GENSCAN 334 404 INTERNAL EXON; p-value: NaN. FT GENSCAN 405 405 AA on splice site: ag/a -> R. FT GENSCAN 406 414 INTERNAL EXON; p-value: NaN. FT GENSCAN 415 415 AA on splice site: ag/t -> S. FT GENSCAN 416 425 INTERNAL EXON; p-value: NaN. FT GENSCAN 426 426 AA on splice site: cg/c -> R. FT GENSCAN 427 475 INTERNAL EXON; p-value: NaN. FT GENSCAN 476 530 INTERNAL EXON; p-value: NaN. FT GENSCAN 531 591 INTERNAL EXON; p-value: NaN. FT GENSCAN 592 789 INTERNAL EXON; p-value: NaN. FT GENSCAN 790 790 AA on splice site: c/at -> H. FT GENSCAN 791 827 INTERNAL EXON; p-value: NaN. FT GENSCAN 828 900 INTERNAL EXON; p-value: NaN. FT GENSCAN 901 901 AA on splice site: tg/g -> W. FT GENSCAN 902 989 INTERNAL EXON; p-value: NaN. FT GENSCAN 990 990 AA on splice site: g/ga -> G. FT GENSCAN 991 1003 INTERNAL EXON; p-value: NaN. FT GENSCAN 1004 1026 INTERNAL EXON; p-value: NaN. FT GENSCAN 1027 1027 AA on splice site: a/ct -> T. FT GENSCAN 1028 1076 INTERNAL EXON; p-value: NaN. FT GENSCAN 1077 1108 INTERNAL EXON; p-value: NaN. FT GENSCAN 1109 1131 INTERNAL EXON; p-value: NaN. FT GENSCAN 1132 1132 AA on splice site: ag/g -> R. FT GENSCAN 1133 1152 INTERNAL EXON; p-value: NaN. FT GENSCAN 1153 1153 AA on splice site: g/ag -> E. FT GENSCAN 1154 1173 INTERNAL EXON; p-value: NaN. FT GENSCAN 1174 1174 AA on splice site: ag/c -> S. FT GENSCAN 1175 1202 INTERNAL EXON; p-value: NaN. FT GENSCAN 1203 1244 INTERNAL EXON; p-value: NaN. FT GENSCAN 1245 1245 AA on splice site: g/cc -> A. FT GENSCAN 1246 1265 INTERNAL EXON; p-value: NaN. FT GENSCAN 1266 1266 AA on splice site: aa/a -> K. FT GENSCAN 1267 1326 INTERNAL EXON; p-value: NaN. FT GENSCAN 1327 1381 INTERNAL EXON; p-value: NaN. FT GENSCAN 1382 1382 AA on splice site: a/aa -> K. FT GENSCAN 1383 1549 LAST EXON; p-value: NaN. SQ SEQUENCE 1549 AA; 175545 MW; E1C430C97482B232 CRC64; MASSYVTYST VTPVVSSSNI RGVFRLAFTR RRVTLSPNAR RRILRVSAKA STKNAMEYRK LGDSDLNISE VTMGTMTFGE QNTEKESHEM LSYAIEEGIN CIDTAEAYPI PMKKETQGKT DLYISSWLKS QQRDKIVLAT KVCGYSERSA YIRDSGEILR VDAANIKESV EKSLKRLGTD YIDLLQIHWP DRYVPLFGDF YYETSKWRPS VPFAEQLRAF QDLIVEGKVR YIGVSNETSY GVTEFVNTAK LEGLPKIVSI QNGYSLLVRC RYEVDLVEVC HPKNCNVGLL AYSPLGGGSL SGKYLATDQE ATKNARLNLF PGYMERYKGS LAKEATIQYV EVAKKYGLTP VELALGFVRD RPFVTSTIIG ATSVKQLKED IDAFLMTERP FSQEVMADID AVFKRSGGKT QQIISAQWLS LFSFGRQGAS ALEYGRSLRK LGSSYLSGDD DNGDTKQDDS VANAEDSLVV AKSFPVCDDR HSEIIPCLDR NFIYQMRLKL DLSLMEHYER HCPPPERRFN CLIPPPSGYK VPIKWPKSRD EVWKANIPHT HLAKEKSDQN WMVEKGEKIS FPGGGTHFHY GADKYIASIA NMLNFSNDVL NDEGRLRTVL DVGCGVASFG AYLLASDIMT MSLAPNDVHQ NQIQFALERG IPAYLGVLGT KRLPYPSRSF EFAHCSRCRI DWLQRDGLLL LELDRVLRPG GYFAYSSPEA YAQDEENLKI WKEMSALVER MCWRIAVKRN QTVVWQKPLS NDCYLEREPG TQPPLCRSDA DPDAVAGVSM EACITPYSKH DHKTKGSGLA PWPARLTSSP PRLADFGYST DMFEKDTELW KQQVDSYWNL MSSKVKSNTV RNIMDMKAHM GSFAAALKDK DVWVMNVVSP DGPNTLKLIY DRGLIGTNHN WCEAFSTYPR TYDLLHAWSI FSDIKSKGCS AEDLLIEMDR ILRPTGFVII RDKQSVVESI KKYLQALHWE TVASEKVNTS SELDQDSEDG VNVQTGEEVA VKLEPLRARH PQLHYESKLY MLLQGGTGIP HLKWFGVEGE FNCMVIDLLG PSMEEFFNYC SRSFSLKTVL MLADQMINRV EYMHVKGFLH RDIKPDNFLM GLGRKANQVY IIDYGLAKKY RDLQTHKHIP YRENKNLTGT ARYASVNTHL GIEQSRRDDL ESLGYLLMYF LRGSLPWQGL RAGTKKQKYD KISEKKRLTP VEVLCKNFPP EFTSYFLYVR SLRFEDKPDY SYLKRLFRDL FIREANFKTS YEYSSTICRQ SGKASKGNLD INKLFSELYT VGQDSRERFS GVFEAYTRRN GSGTGVQADQ SSRPRTSENV LASKDTPQKL HQREPLVTLP SRALSSSVLG TARGSDCLHF AARQVHSWII FYPSLFQKLL VKPQECAVCK RVFLSSHQLI SHYNAAHSNR QYSTFSSSPA AAAAAAPTTF RHYTNVNGRN PNPDFQARNQ FDVNYYRRGY LDDQGRFHKG SPHAPAMSPT RKYKFLLPKM PPTTMPETPK LMDLFPETSS GGTRTLPLLC QLDQWRPEDS AVAENGGANS SPIDLSLRL // ID NC003070_243 HYPOTHETICAL; PRT; 220 AA. AC NC003070_243; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1209005...1209059, 1209158...1209258, DE 1209346...1209852]; Length: 663. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 18 FIRST EXON; p-value: NaN. FT GENSCAN 19 19 AA on splice site: g/ac -> D. FT GENSCAN 20 52 INTERNAL EXON; p-value: NaN. FT GENSCAN 53 220 LAST EXON; p-value: NaN. SQ SEQUENCE 220 AA; 24257 MW; A15369C2ECFFDCA2 CRC64; MATVKGLLKG LRYITQIFDE EKDKDMQIGF PTDVKHVAHI GSDGPATNVP SWMGDFKPQE NENGQVVSRA DANNNQIGEG VGLQELLPPT DKPKHKKTRR KSETVSQNGS PPRRNSSASA SDMQPKNTRR HHRSRHGSID SSNDPSVRRR RVVSVTTNDM EGSYPLSDSS THSRKSTSRH RKPKGSGGGE LSMKKTKGKT ENPIVESVDT CNDNNISDKE // ID NC003070_244 HYPOTHETICAL; PRT; 1074 AA. AC NC003070_244; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1214590...1214164, 1214061...1213979, DE 1213896...1213229, 1213157...1212485, 1212407...1212221, DE 1212143...1211190, 1210550...1210318]; Length: 3225. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 142 FIRST EXON; p-value: NaN. FT GENSCAN 143 143 AA on splice site: g/aa -> E. FT GENSCAN 144 170 INTERNAL EXON; p-value: NaN. FT GENSCAN 171 392 INTERNAL EXON; p-value: NaN. FT GENSCAN 393 393 AA on splice site: ag/g -> R. FT GENSCAN 394 617 INTERNAL EXON; p-value: NaN. FT GENSCAN 618 679 INTERNAL EXON; p-value: NaN. FT GENSCAN 680 680 AA on splice site: g/gt -> G. FT GENSCAN 681 997 INTERNAL EXON; p-value: NaN. FT GENSCAN 998 998 AA on splice site: g/gt -> G. FT GENSCAN 999 1074 LAST EXON; p-value: NaN. SQ SEQUENCE 1074 AA; 120496 MW; 850F30FCABE81095 CRC64; MVRHSRRESF FDMVGFEVCP DTDLLWPFGK LDGLDRDEIR ETAYEIFFAA CRSSPGFGGR NALTFYSKHN AGDHQGDGIG GGGGSGSSNG SGFGSLGRKE VLTTPTSRVK RALGLKMLKR SPSRRMSTVG TVVGAVSAPS SPEIMRQQMK VTEQSDTRLR KTLMRTLVGQ TGRRAETIIL PLELLRHVKP SEFGDVHEYQ IWQRRQLKVL EAGLLIHPSI PLEKTNNFAM RLREIIRQSE TKAIDTSKNS DIMPTLCNLV ASLSWRNATP TTDICHWADG YPLNIHLYVA LLQSIFDIRD ETLVLDEIDE LLELMKKTWI MLGITRAIHN LCFTWVLFHQ YIVTSQMEPD LLGASHAMLA EVANDAKKSD REALYVKLLT STLASMQGWT EKRLLSYHDY FQRGNVGLIE NLLPLALSSS KILGEDVTIS QMNGLEKGDV KLVDSSGDRV DYYIRASIKN AFSKVIENMK AEIEETEEGE EEAATMLLRL AKETEDLALR ESECFSPILK RWHLVAAGVA SVSLHQCYGS ILMQYLAGRS TITKETVEVL QTAGKLEKVL VQMVAENSDE CEDGGKGLVR EMVPYEVDSI ILRLLRQWIE EKLQTVQECL SRAKEAETWN PKSKSEPYAQ SAGELMKLAN DAIEEFFEIP IGITEDLVHD LAEGLEKLFQ EYTTFVASCG SKQSYIPTLP PLTRCNRDSK FVKLWKKATP CAASGEELNQ MGEAPGGNHP RPSTSRGTQR LYIRLNTLHF LSSQLHSLNK SLSLNPRVLP ATRKRCRERT KSSSYFEFTQ AGIESACQHV SEVAAYRLIF LDSYSVFYES LYPGDVANGR IKPALRILKQ NLTLMTAILA DKAQALAMKE VMKASFEVVL TVLLAGGHSR VFCRTDHDLI EEDFESLKKV YCTCGEGLIP EEVVDREAET VEGVIQLMGQ PTEQLMEDFS IVTCESSGMG LVGTGQKLPM PPTTGRWNRS DPNTILRVLC YRDDRVANQF LKKSFQLGRI CPFTGKKANR ANKVSFSNHK TKKLQFVNLQ YKRVWWEAGK RFVILRLSTK ALKTIEKNGL EAVSKKAGID LRKK // ID NC003070_245 HYPOTHETICAL; PRT; 175 AA. AC NC003070_245; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1216054...1216121, 1216193...1216527, DE 1217132...1217256]; Length: 528. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 22 FIRST EXON; p-value: NaN. FT GENSCAN 23 23 AA on splice site: ag/c -> S. FT GENSCAN 24 134 INTERNAL EXON; p-value: NaN. FT GENSCAN 135 135 AA on splice site: g/at -> D. FT GENSCAN 136 175 LAST EXON; p-value: NaN. SQ SEQUENCE 175 AA; 19239 MW; 0E4ABF75D17FEB0C CRC64; MRHYILNRNR FFFLQAVNNV EASFSNLVLI VVDLDFFRLG RGGTSGNKFR MSLGLPVAAT VNCADNTGAK NLYIISVKGI KGRLNRLPSA CVGDMVMATV KKGKPDLRKK VLPAVIVRQR KPWRRKDGVF MYFEDNAGVI VNPKGEMKGS AITGPIGKEC ADLWPRIASA ANAIV // ID NC003070_246 HYPOTHETICAL; PRT; 770 AA. AC NC003070_246; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1225165...1225117, 1224365...1224193, DE 1223180...1222488, 1222364...1222173, 1222080...1221955, DE 1219351...1219070, 1219036...1218239]; Length: 2313. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 16 FIRST EXON; p-value: NaN. FT GENSCAN 17 17 AA on splice site: g/ac -> D. FT GENSCAN 18 74 INTERNAL EXON; p-value: NaN. FT GENSCAN 75 305 INTERNAL EXON; p-value: NaN. FT GENSCAN 306 369 INTERNAL EXON; p-value: NaN. FT GENSCAN 370 411 INTERNAL EXON; p-value: NaN. FT GENSCAN 412 505 INTERNAL EXON; p-value: NaN. FT GENSCAN 506 770 LAST EXON; p-value: NaN. SQ SEQUENCE 770 AA; 86041 MW; 42ECAEB382DE6658 CRC64; MVVRKTTYIE NDNSKHDITS STTTFKKPPS IIFPFSWGIF SLDSFPENHS QFGDHKEHIN MSQDVISSHE QLSMDEITSP LTAQIFDFCD SQLFQETFNQ TSEVTSASNG CGYVENNNTN NFPDKSNSGS NQDHEDNNDN ADLSIIFDSQ DDFDNDITAS IDFSSSIQFP ASDQLQEQFD FTGIQLHQPP NTLYSSSSGD LLPPPLSVFE EDCLSSVPSY NLGSINPSSP SCSFLGNTGL PTYMTVTGNM MNTGLGSGFY SGNIHLGSDF KPSHDQLMEI QADNGGLFCP DPIKPIFNPG DHHLQGLDGV ENQNHMVAQP VLPQLGTEIT GLDDPSFNKV GKLSAEQRKE KIHRYMKKRN ERNFSKKIKY ACRKTLADSR PRVRGRFAKN DEFGEPNRQA CSSHHEDDDD DEFLWHLAIE MEGQTHDAQN NVPEHDNPLM NVKNPSSNLF SKKDAQCLSV LCNGKWLQIV RSSPKVSCEA STCPSHILTK VGENYVAESE KMHKKGTLQF TMRANGTPHF VFKLENQKDV YVASLSSNVQ DQNSYMIHLQ RGESSASSSH LVGRINVSTL FSEKVLEREF VLFSSNGENL KIPRTRKNRG LSKKVVHAVK NERRTARLSR TSFIPDLGSW DEQFQAQNYD CLLKNKLPTN LETLAVVVKQ ETIEDEIGGW GLKFLKRSPM FQRSNDASET ETSTSSISMN VVIPSGIHGG PEDGPSSLIE RWKSQGNCDC GGWDLCCSLT LLKGQPRKDQ YFELFIEVKL SFISSNTSIL // ID NC003070_247 HYPOTHETICAL; PRT; 721 AA. AC NC003070_247; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1226748...1226760, 1226851...1226936, DE 1227086...1227157, 1227246...1227308, 1227579...1227665, DE 1227815...1227956, 1228045...1228166, 1228330...1228446, DE 1228705...1228800, 1228896...1228955, 1229053...1229127, DE 1229503...1229622, 1229716...1229793, 1229892...1229976, DE 1230064...1230115, 1231165...1231307, 1231909...1232271, DE 1232604...1232995]; Length: 2166. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 4 FIRST EXON; p-value: NaN. FT GENSCAN 5 5 AA on splice site: a/tt -> I. FT GENSCAN 6 33 INTERNAL EXON; p-value: NaN. FT GENSCAN 34 57 INTERNAL EXON; p-value: NaN. FT GENSCAN 58 78 INTERNAL EXON; p-value: NaN. FT GENSCAN 79 107 INTERNAL EXON; p-value: NaN. FT GENSCAN 108 154 INTERNAL EXON; p-value: NaN. FT GENSCAN 155 155 AA on splice site: g/gt -> G. FT GENSCAN 156 195 INTERNAL EXON; p-value: NaN. FT GENSCAN 196 234 INTERNAL EXON; p-value: NaN. FT GENSCAN 235 266 INTERNAL EXON; p-value: NaN. FT GENSCAN 267 286 INTERNAL EXON; p-value: NaN. FT GENSCAN 287 311 INTERNAL EXON; p-value: NaN. FT GENSCAN 312 351 INTERNAL EXON; p-value: NaN. FT GENSCAN 352 377 INTERNAL EXON; p-value: NaN. FT GENSCAN 378 405 INTERNAL EXON; p-value: NaN. FT GENSCAN 406 406 AA on splice site: g/tg -> V. FT GENSCAN 407 422 INTERNAL EXON; p-value: NaN. FT GENSCAN 423 423 AA on splice site: ag/c -> S. FT GENSCAN 424 470 INTERNAL EXON; p-value: NaN. FT GENSCAN 471 471 AA on splice site: a/at -> N. FT GENSCAN 472 591 INTERNAL EXON; p-value: NaN. FT GENSCAN 592 592 AA on splice site: g/gg -> G. FT GENSCAN 593 721 LAST EXON; p-value: NaN. SQ SEQUENCE 721 AA; 78029 MW; 88DE20DA0DA804BA CRC64; MNCAISGEVP EEPVVSKKSG LLYEKRLIQT HISDYGKCPV TGEPHTLDDI VPIKTGKIVK PKPLHTASIP GLLGTFQTEW DSLMLSNFAL EQQLHTARQE LSHALYQHDA ACRVIARLKK ERDESRQLLA EAERQLPAAP EVATSNAALS NGKRGIDDGE QGPNAKKMRL GISAEVITEL TDCNAALSQQ RKKRQIPKTL ASVDALEKFT QLSSHPLHKT NKPGIFSMDI LHSKDVIATG GIDTTAVLFD RPSGQILSTL TGHSKKVTSI KFVGDTDLVL TASSDKTVRI WGCSEDGNYT SRHTLKDHSA EVTDASENDV NYTAAAFHPD GLILGTGTAQ SIVKIWDVKS QANVAKFGGH NGEITSISFS ENGYFLATAA LDGVRLWDLR KLKNFRTFDF PDANSVEFDH SGSYLGIAAS DISSSLVIPL LVKFPKHVFG INNQVSARKF VSFSKPNFQS NQLLNQNVTQ NLNVVVKSAT TEYTTLIYKG CARQQFSDPS GLYSQALSAM FGSLVSQSTK TRFYKTTTGT STTTITGLFQ CRGDLSNHDC YNCVSRLPVL SDKLCGKTIA SRVQLSGCYL LYEVSGFSQI SGMEMLFKTC GKNNIAGTGF EERRDTAFGV MQNGVVSGHG FYATTYESVY VLGQCEGDVG DTDCSGCVKN ALEKAQVECG SSISGQIYLH KCFIAYSYYP NGVPRRSSSS SSSSSSSSSG SSNSDPSTST G // ID NC003070_248 HYPOTHETICAL; PRT; 310 AA. AC NC003070_248; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1235894...1235454, 1235356...1235184, DE 1235107...1234993, 1234757...1234603, 1234503...1234455]; Length: 933. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 147 FIRST EXON; p-value: NaN. FT GENSCAN 148 204 INTERNAL EXON; p-value: NaN. FT GENSCAN 205 205 AA on splice site: ag/c -> S. FT GENSCAN 206 243 INTERNAL EXON; p-value: NaN. FT GENSCAN 244 294 INTERNAL EXON; p-value: NaN. FT GENSCAN 295 295 AA on splice site: ag/c -> S. FT GENSCAN 296 310 LAST EXON; p-value: NaN. SQ SEQUENCE 310 AA; 34554 MW; 9516D4F50EF6F32F CRC64; MLKSEASLSI YCDSGDGFKS EDPVTGIEEN LERTVTIGDA IDGGEFSFAK HKEEDSGEGE RGVFEEVIKK LGIGVRDELG FEIERPPSPP MHLAAGLGID KFDLYGSEIK FDLPGYDDKN CGDYYKGMLE EYPLHPLLLK NYAKFLEYKG DLSGAEEYYH KCTVVEPSDG VALANYGRLV MKLHQDEAKA MSYFERAVQA SPDDSIVLAA YASFLWEINA DDDDEDDDED DDESSGQGKD EFEADAAGKG KSSLSKTEDG ETLCRYAKAF WSINNDHEKA LFYFEKAVEA SPNDSIILGE YARFLWEIDE // ID NC003070_249 HYPOTHETICAL; PRT; 838 AA. AC NC003070_249; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1237255...1238899, 1240762...1240783, DE 1240806...1240824, 1240997...1241342, 1241424...1241541, DE 1241615...1241676, 1243441...1243511, 1243607...1243742, DE 1243851...1243948]; Length: 2517. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 548 FIRST EXON; p-value: NaN. FT GENSCAN 549 549 AA on splice site: g/ga -> G. FT GENSCAN 550 555 INTERNAL EXON; p-value: NaN. FT GENSCAN 556 556 AA on splice site: ag/a -> R. FT GENSCAN 557 562 INTERNAL EXON; p-value: NaN. FT GENSCAN 563 677 INTERNAL EXON; p-value: NaN. FT GENSCAN 678 678 AA on splice site: g/gt -> G. FT GENSCAN 679 716 INTERNAL EXON; p-value: NaN. FT GENSCAN 717 717 AA on splice site: ag/a -> R. FT GENSCAN 718 737 INTERNAL EXON; p-value: NaN. FT GENSCAN 738 738 AA on splice site: g/ct -> A. FT GENSCAN 739 761 INTERNAL EXON; p-value: NaN. FT GENSCAN 762 806 INTERNAL EXON; p-value: NaN. FT GENSCAN 807 807 AA on splice site: g/ca -> A. FT GENSCAN 808 838 LAST EXON; p-value: NaN. SQ SEQUENCE 838 AA; 94169 MW; 37342C2A7F8D2D16 CRC64; MSMMFPSFQL LELNIISAQD LAPVARKTKT YAVAWVHSER KLTTRVDYNG GTNPTWNDKF VFRVNEEFLY ADTSAVVIEI YALHWFRDVH VGTVRVLISN LIPPNRRPGY RTSNNEYRRT PPPGMRFVAL QVRRTSGRPQ GILNIGVGLI DGSMRSMPLY THMDSSAVGY RDLLGEEDHH LQHLHLNSNK GSSKNPQSPS SRQYQSVISR PELRRTKSDT SSMVVSDLLS RAERSRLANR QPASAIVSSE SETLPTTTDS DEKKSSEYTP PSKNLRVPRQ RYNSIESDLI NPSPMENHHV VVSRRERHDV MPYSSYQQTG KTPRKKTRIE KQRSVKDYDR GRASPYLSKH GTPLRSNIIA STPMRSNGVG STPMRSNIIA MSPMHPNMVG STPMRTPMRS NMVGSTPMRS NIVGSTPIRS NYMATPMRTH HDFGTPVRNL AGRRILTESE LGPSPSEVAD KLAKDRSHET ESSILSEWSI DESSIEGLRS KLERWRTELP PLYDIGSSHI SSTDYDGASV PAATAGGGMS SRRKTPTTKK HNRRHTDGGA SPPRSRKLKS CDMITSEFRE PLSGNFLVCC GSQVVGWPPI GLHRMNSLVN NQAMKAARAE EGDGEKKVVK NDELKDVSMK VNPKVQGLGF VKVNMDGVGI GRKVDMRAHS SYENLAQTLE EMFFGMTGTT CREKVKPLRL LDGSSDFVLT YEDKEGDWML VGDVPWRMFI NSVKRLRIMG TSEASGLAQL RYIKEAEELR LKMQPLELIR RVREIEQEAS AGQETEQQKD VKQTTAVDLS KRLKDFRALN DASSLKALEE WRKRKMERAR QRDLEKTGGV SSSTKTSS // ID NC003070_250 HYPOTHETICAL; PRT; 1774 AA. AC NC003070_250; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1257509...1257374, 1256883...1255276, DE 1255124...1254865, 1254749...1254540, 1254437...1254219, DE 1254146...1253315, 1253180...1253011, 1252924...1252826, DE 1252733...1252646, 1252404...1252239, 1251668...1251461, DE 1251141...1251116, 1248513...1248350, 1248166...1248116, DE 1248002...1247205, 1247132...1246962, 1246239...1246121]; Length: 5325. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 45 FIRST EXON; p-value: NaN. FT GENSCAN 46 46 AA on splice site: g/gg -> G. FT GENSCAN 47 581 INTERNAL EXON; p-value: NaN. FT GENSCAN 582 582 AA on splice site: g/gt -> G. FT GENSCAN 583 668 INTERNAL EXON; p-value: NaN. FT GENSCAN 669 738 INTERNAL EXON; p-value: NaN. FT GENSCAN 739 811 INTERNAL EXON; p-value: NaN. FT GENSCAN 812 1088 INTERNAL EXON; p-value: NaN. FT GENSCAN 1089 1089 AA on splice site: g/gt -> G. FT GENSCAN 1090 1145 INTERNAL EXON; p-value: NaN. FT GENSCAN 1146 1178 INTERNAL EXON; p-value: NaN. FT GENSCAN 1179 1207 INTERNAL EXON; p-value: NaN. FT GENSCAN 1208 1208 AA on splice site: g/tg -> V. FT GENSCAN 1209 1262 INTERNAL EXON; p-value: NaN. FT GENSCAN 1263 1263 AA on splice site: aa/t -> N. FT GENSCAN 1264 1332 INTERNAL EXON; p-value: NaN. FT GENSCAN 1333 1340 INTERNAL EXON; p-value: NaN. FT GENSCAN 1341 1341 AA on splice site: ca/g -> Q. FT GENSCAN 1342 1395 INTERNAL EXON; p-value: NaN. FT GENSCAN 1396 1396 AA on splice site: g/at -> D. FT GENSCAN 1397 1412 INTERNAL EXON; p-value: NaN. FT GENSCAN 1413 1413 AA on splice site: g/tg -> V. FT GENSCAN 1414 1678 INTERNAL EXON; p-value: NaN. FT GENSCAN 1679 1679 AA on splice site: a/tt -> I. FT GENSCAN 1680 1735 INTERNAL EXON; p-value: NaN. FT GENSCAN 1736 1736 AA on splice site: g/cc -> A. FT GENSCAN 1737 1774 LAST EXON; p-value: NaN. SQ SEQUENCE 1774 AA; 195916 MW; BBDC9B33E2393E9D CRC64; MAGDDLVFAV NGEKFEVLSV NPSTTLLEFL RSNTCFKSVK LSCGEGGCGA CIVILSKYDP VLDQVEEYSI NSCLTLLCSL NGCSITTSDG LGNTEKGFHP IHKRFAGFHA SQCGFCTPGM CISLYSALSK AHNSQSSPDY LTALAAEKSI AGNLCRCTGY RPIADACKSF ASDVDIEDLG FNSFWRKGES REEMLKKLPP YNPEKDLITF PDFLKEKIKC QHNVLDQTRY HWSTPGSVAE LQEILATTNP GKDRGLIKLV VGNTGTGYYK EEKQYGRYID ISHIPEMSMI KKDDREIEIG AVVTISKVID ALMEENTSAY VFKKIGVHME KVANHFIRNS GSIGGNLVMA QSKSFPSDIT TLLLAADASV HMINAGRHEK LRMGEYLVSP PILDTKTVLL KVHIPRWIAS STTGLLFETY RAALRPIGSA LPYINAAFLA VVSHDASSSG IIVDKCRLAF GSYGGYHSIR AREVEDFLTG KILSHSVLYE AVRLLKGIIV PSIDTSYSEY KKSLAVGFLF DFLYPLIESG SWDSEGKHID GHIDPTICLP LLSSAQQVFE SKEYHPVGEA IIKFGAEMQA SGEAVYVDDI PSLPHCLHGA FIYSTKPLAW IKSVGFSGNV TPIGVLAVIT FKDIPEVGQN IGYITMFGTG LLFADEVTIS AGQIIALVVA DTQKHADMAA HLAVVEYDSR NIGTPVLSVE DAVKRSSLFE VPPEYQPEPV GDISKGMAEA DRKIRSVELR LGSQYFFYME TQTALALPDE DNCLVVYSST QAPEFTQTVI ATCLGIPEHN VRVITRRVGG GFGGKAIKSM PVATACALAA KKMQRPVRIY VNRKTDMIMA GGRHPLKITY SVGFRSDGKL TALDLNLFID AGSDVDVSLV MPQNIMNSLR KYDWGALSFD IKVCKTNLPS RTSLRAPGEV QGSYIAESII ENVASSLKMD VDVVRRINLH TYESLRKFYK QAAGEPDEYT LPLLWDKLEV SADFRRRAES VKEFNRCNIW RKRGISRVPI IHLVIHRPTP GKVSILNDGS VAVEVAGIEV GQGLWTKVQQ MVAYGLGMIK CEGSDDLLER IRLLQTDTLS MSQSSYTAGI TKLDNSPYID PDFKEISPYV KLQANAQSVD LSARTFYKPE SSSAEYLNYG VGASEVEVDL VTGRTEIIRS DIIYDCGKSL NPAVDLGQKI EGAFVQGIGF FMYEEYTTNE NGLVNEEVHC ATRSAIREAR KQYLSWNCID DDHRERCDLG FELPVPATMP VVKQLCGLES IENLCFFLIK FYSSLTKRTH RSRERQKVDD HPRQCHRIRL ITRLKISPPF LFSVSVFAFH DLYQIMDRQS EVVRILDSPN QQQQQQERQS NNNTLFMGEA RPRRATTFAC LSIRRRRNNN GVSEFEETAR FEQVGDVLYI GSGRRVPYIA IGVFLQVLAW GSMGIFQGAR EVLPSLVACV LLSNLGASIT EVAKDALVAE YGLRYRINGL QSYALMASAA GGVLGNLLGG YLLLTTPPKI SFLVFSALLS LQLVVSLSSK EESFGLPRIA ETSSVLESVK KQISNLKEAI QADEISQPLI WAVVSIAMVP LLSGSVFCYQ TQVLNLDPSV IGMSKVIGQL MLLCLTVVYD RYLKTLPMRP LIHIIQLLYG LSILLDYILV KQINLGFGIS NEVYVLCFSS LAEILAQFKI LPFAVRLAIS AFLGVGLANL IGITSSNYSN LSSGILIQSL AALAPLCFMH LVPMSEPVIE KEGKRAPKFG YVNIGHQPNR MQPGPKLEYS SDSLTSDRFI WILY // ID NC003070_251 HYPOTHETICAL; PRT; 444 AA. AC NC003070_251; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1261410...1261269, 1260977...1260941, DE 1260805...1260745, 1260625...1260524, 1260124...1260011, DE 1259870...1259727, 1259569...1259357, 1259218...1259026, DE 1258898...1258763, 1258289...1258097]; Length: 1335. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 47 FIRST EXON; p-value: NaN. FT GENSCAN 48 48 AA on splice site: g/gt -> G. FT GENSCAN 49 59 INTERNAL EXON; p-value: NaN. FT GENSCAN 60 60 AA on splice site: ag/a -> R. FT GENSCAN 61 80 INTERNAL EXON; p-value: NaN. FT GENSCAN 81 114 INTERNAL EXON; p-value: NaN. FT GENSCAN 115 152 INTERNAL EXON; p-value: NaN. FT GENSCAN 153 200 INTERNAL EXON; p-value: NaN. FT GENSCAN 201 271 INTERNAL EXON; p-value: NaN. FT GENSCAN 272 335 INTERNAL EXON; p-value: NaN. FT GENSCAN 336 336 AA on splice site: g/aa -> E. FT GENSCAN 337 380 INTERNAL EXON; p-value: NaN. FT GENSCAN 381 381 AA on splice site: gc/t -> A. FT GENSCAN 382 444 LAST EXON; p-value: NaN. SQ SEQUENCE 444 AA; 50364 MW; DDE5E80A8B4B85DD CRC64; MAGFSSLIPR TLLKRAVSSA TRNLTAPLAS ISAARNLFIG GGCDENYGEP GVFSHSNGIR KYNTVGNAGA LTGPFHHLLG VNQTNKPAFL RVQSMSYQFV ADSHSSPKRI VKNEDEEDFS DSSKKGNAEN PRKHQIGENI PKKDKIKFLV NTLLDIEDNK EAVYGALDAW VAWERNFPIA SLKIVIASLE KEHQWHRMVQ VIKWILSKGQ GNTMGTYGQL IRALDMDRRA EEAHVIWRKK VGNDLHSVPW QLCLQMMRIY FRNNMLQELV KLFKDLESYD RKPPDKHIVQ TVADAYELLG MLDEKERVVT KYSHLLLGTP SDDKPSRSSR KKKKPELRIP EATTEGAVDA AKAEIQEERK ENLDNHQESE AATEKQFEHG ATEMLRFQIC VRSLMAIRFS TKEGIFRDGL ICGKWEPNGM GRRKRKGLGD INQTSKAKVT FRFR // ID NC003070_252 HYPOTHETICAL; PRT; 1568 AA. AC NC003070_252; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1262122...1262124, 1262555...1262680, DE 1262896...1263039, 1263174...1263319, 1263411...1263567, DE 1263680...1263738, 1263832...1263991, 1264093...1264242, DE 1264309...1264445, 1264532...1264678, 1264759...1264871, DE 1265130...1265215, 1265290...1265391, 1265822...1265992, DE 1266313...1266422, 1266551...1266611, 1266814...1266991, DE 1267075...1267280, 1267394...1267513, 1267770...1267868, DE 1267995...1268282, 1268405...1268557, 1268672...1268821, DE 1268939...1269103, 1269191...1269330, 1269451...1269565, DE 1269644...1269664, 1269770...1269847, 1269936...1270106, DE 1270271...1270423, 1270571...1270747, 1270840...1271130, DE 1271677...1271733, 1271940...1272020, 1272100...1272176, DE 1272261...1272375]; Length: 4707. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 1 FIRST EXON; p-value: NaN. FT GENSCAN 2 43 INTERNAL EXON; p-value: NaN. FT GENSCAN 44 91 INTERNAL EXON; p-value: NaN. FT GENSCAN 92 139 INTERNAL EXON; p-value: NaN. FT GENSCAN 140 140 AA on splice site: ag/g -> R. FT GENSCAN 141 192 INTERNAL EXON; p-value: NaN. FT GENSCAN 193 211 INTERNAL EXON; p-value: NaN. FT GENSCAN 212 212 AA on splice site: ag/t -> S. FT GENSCAN 213 265 INTERNAL EXON; p-value: NaN. FT GENSCAN 266 315 INTERNAL EXON; p-value: NaN. FT GENSCAN 316 360 INTERNAL EXON; p-value: NaN. FT GENSCAN 361 361 AA on splice site: at/g -> M. FT GENSCAN 362 409 INTERNAL EXON; p-value: NaN. FT GENSCAN 410 410 AA on splice site: tg/g -> W. FT GENSCAN 411 447 INTERNAL EXON; p-value: NaN. FT GENSCAN 448 448 AA on splice site: g/tc -> V. FT GENSCAN 449 476 INTERNAL EXON; p-value: NaN. FT GENSCAN 477 510 INTERNAL EXON; p-value: NaN. FT GENSCAN 511 567 INTERNAL EXON; p-value: NaN. FT GENSCAN 568 603 INTERNAL EXON; p-value: NaN. FT GENSCAN 604 604 AA on splice site: ag/c -> S. FT GENSCAN 605 624 INTERNAL EXON; p-value: NaN. FT GENSCAN 625 683 INTERNAL EXON; p-value: NaN. FT GENSCAN 684 684 AA on splice site: g/ga -> G. FT GENSCAN 685 752 INTERNAL EXON; p-value: NaN. FT GENSCAN 753 792 INTERNAL EXON; p-value: NaN. FT GENSCAN 793 825 INTERNAL EXON; p-value: NaN. FT GENSCAN 826 921 INTERNAL EXON; p-value: NaN. FT GENSCAN 922 972 INTERNAL EXON; p-value: NaN. FT GENSCAN 973 1022 INTERNAL EXON; p-value: NaN. FT GENSCAN 1023 1077 INTERNAL EXON; p-value: NaN. FT GENSCAN 1078 1123 INTERNAL EXON; p-value: NaN. FT GENSCAN 1124 1124 AA on splice site: ag/g -> R. FT GENSCAN 1125 1162 INTERNAL EXON; p-value: NaN. FT GENSCAN 1163 1169 INTERNAL EXON; p-value: NaN. FT GENSCAN 1170 1195 INTERNAL EXON; p-value: NaN. FT GENSCAN 1196 1252 INTERNAL EXON; p-value: NaN. FT GENSCAN 1253 1303 INTERNAL EXON; p-value: NaN. FT GENSCAN 1304 1362 INTERNAL EXON; p-value: NaN. FT GENSCAN 1363 1459 INTERNAL EXON; p-value: NaN. FT GENSCAN 1460 1478 INTERNAL EXON; p-value: NaN. FT GENSCAN 1479 1505 INTERNAL EXON; p-value: NaN. FT GENSCAN 1506 1530 INTERNAL EXON; p-value: NaN. FT GENSCAN 1531 1531 AA on splice site: ag/c -> S. FT GENSCAN 1532 1568 LAST EXON; p-value: NaN. SQ SEQUENCE 1568 AA; 177528 MW; 867930E3897B44E8 CRC64; MAASAKVTVG SHVWVEDPDD AWIDGEVEEV NSEEITVNCS GKTVVAKLNN VYPKDPEFPE LGVDDMTKLA YLHEPGVLLN LKCRYNANEI YTYTGNILIA VNPFKRLPHL YGSETMKQYK GTAFGELSPH PFAVADSAYR KMINEGVSQA ILVSGESGAG KTESTKMLMQ YLAYMGGRAE SEGRSVEQQV LESNPVLEAF GNAKTVRNNN SSRFGKFVEI QFDQRGRISG AAIRTYLLER SRVCQVSDPE RNYHCFYMLC AAPEQETERY KLGKPSTFRY LNQSNCYALD GLDDSKEYLA TRKAMDVVGI NSEEQDGIFR VVAAILHLGN IEFAKGEESE ASEPKDEKSR FHLKVAAELF MCDGKALEDS LCKRVMVTRD ESITKSLDPD SAALGRDALA KIVYSKLFDW LVTKINNSIG QDPNSKHIIG VLDIYGFESF KTNRCLTVFS FSRIICSFEQ FCINLTNEKL QQHFNQHVFK MEQEEYTKEE IDWSYIEFID NQDVLDLIEK VTYQTELFLD KNKDYVVGEH QALLSSSDCS FVSSLFPPLP EESSKTSKFS SIGSQFKGVM EAIRISCAGY PTRKPFNEFL TRFRILAPET TKSSYDEVDA CKKLLAKVDL KGFQIGKTKV FLRAGQMAEM DAHRAEVLGH SARIIQRNVL TYQSRKKFLL LQAASTEIQA LCRGQVARVW FETMRREAAS LRIQKQARTY ICQNAYKTLC SSACSIQTGM RAKAARIELQ LRKKRRATII IQSQIRRCLC HQRYVRTKKA AITTQCGWRV KVARRELRNL KMAAKETGAL QDAKTKLENQ VEELTSNLEL EKQMRMEIEE AKSQEIEALQ SVLTDIKLQL RDTQETKSKE ISDLQSVLTD IKLQLRDTQE TKSKEISDLQ SALQDMQLEI EELSKGLEMT NDLAAENEQL KESVSSLQNK IDESERKYEE ISKISEERIK DEVPVIDQSA IIKLETENQK LKALVSSMEE KIDELDRKHD ETSPNITEKL KEDVSFDYEI VSNLEAENER LKALVGSLEK KINESGNNST DEQEEGKYIL KEESLTEDAS IDNERVKKLA DENKDLNDLV SSLEKKIDET EKKYEEASRL CEERLKQALD AETGLIDLKT SMQRLEEKVS DMETAEQIRR QQALVNSASR RMSPQVSFTG APPLENGHQE PLAPIPSRRF GTESFRRSRI ERQPHEFVDV LLKCVSKNIG FSHGKPVAAL TIYKCLMRWK IFEAEKTSIF DRIVPVFGSA IENQEDDNHL AYWLTNTSTL LFLLQRSLRQ QSSTGSSPTK PPQPTSFFGR MTQGFRSTSS PNLSTDVVQQ VDARYPALLF KQQLTAYVET MYGIIRENVK REVSSLLSSC IQSLKESSCD SSVVNSPSKS SEENLPAKSS EENSPKKSSE ENSPKESSGD KSPQKLSDDN SPSKEGQAVK SSEENSPASS WQSIIEFLNY ILITWKKNYV TEPKSTITYD DLTINLCSVL STEQLYRICT LCKDKDDGDH NVSPEVISNL KLLLTNEDEN SRSFLLDDDS SIPFDTDEIS SCMQEKDFAN VKSASELADN PNFLFLKE // ID NC003070_253 HYPOTHETICAL; PRT; 437 AA. AC NC003070_253; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1279523...1280236, 1280399...1280770, DE 1281103...1281330]; Length: 1314. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 238 FIRST EXON; p-value: NaN. FT GENSCAN 239 362 INTERNAL EXON; p-value: NaN. FT GENSCAN 363 437 LAST EXON; p-value: NaN. SQ SEQUENCE 437 AA; 48748 MW; 5507EDCD4B6789C9 CRC64; MYGNNNKKSI NITSMFQNLI PEGSDIFSRR CIWVNGPVIV GAGPSGLAVA AGLKREGVPF IILERANCIA SLWQNRTYDR LKLHLPKQFC QLPNYPFPDE FPEYPTKFQF IQYLESYAAN FDINPKFNET VQSAKYDETF GLWRVKTISN MGQLGSCEFE YICRWIVVAT GENAEKVVPD FEGLEDFGGD VLHAGDYKSG GRYQGKKVLV VGCGNSGMEV SLDLYNHGAN PSMVVRSAVH VLPREIFGKS TFELGVTMMK YMPVWLADKT ILFLARIILG NTDKYGLKRP KIGPLELKNK EGKTPVLDIG ALPKIRSGKI KIVPGIIKFG KGKVELIDGR VLEIDSVILA TGYRSNVPSW LKDNDFFSDD GIPKNPFPNG WKGEAGLYAV GFTRKGLFGA SLDAMSVAHD IANRWKEESK QQKKTAAARH RRCISHF // ID NC003070_254 HYPOTHETICAL; PRT; 359 AA. AC NC003070_254; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1286491...1286410, 1285637...1285515, DE 1285368...1285284, 1284857...1284650, 1284277...1284242, DE 1284161...1284100, 1283995...1283968, 1283872...1283798, DE 1283539...1283482, 1283321...1283257, 1283161...1283067, DE 1282773...1282632, 1282605...1282585]; Length: 1080. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 27 FIRST EXON; p-value: NaN. FT GENSCAN 28 28 AA on splice site: g/tt -> V. FT GENSCAN 29 68 INTERNAL EXON; p-value: NaN. FT GENSCAN 69 69 AA on splice site: g/ga -> G. FT GENSCAN 70 96 INTERNAL EXON; p-value: NaN. FT GENSCAN 97 97 AA on splice site: ag/a -> R. FT GENSCAN 98 166 INTERNAL EXON; p-value: NaN. FT GENSCAN 167 178 INTERNAL EXON; p-value: NaN. FT GENSCAN 179 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 199 AA on splice site: ag/c -> S. FT GENSCAN 200 208 INTERNAL EXON; p-value: NaN. FT GENSCAN 209 233 INTERNAL EXON; p-value: NaN. FT GENSCAN 234 252 INTERNAL EXON; p-value: NaN. FT GENSCAN 253 253 AA on splice site: g/gt -> G. FT GENSCAN 254 274 INTERNAL EXON; p-value: NaN. FT GENSCAN 275 305 INTERNAL EXON; p-value: NaN. FT GENSCAN 306 306 AA on splice site: ag/t -> S. FT GENSCAN 307 353 INTERNAL EXON; p-value: NaN. FT GENSCAN 354 359 LAST EXON; p-value: NaN. SQ SEQUENCE 359 AA; 39931 MW; E5ADAA14ED246738 CRC64; MITVVTSRLS LLPPVFSVVN SSSSRSKVLV QSLEPVVHGR GRKADSLQDT YFGVHQEQLY ARKLKPVEGA QWTGIVTTIA IEMLKSNMVE AVVCVQRICI VTGFRSEASS VLRCGLPSAR YGLNILFCGR IFQVSPYRDS HSIVMQSDFL YCSIEISGAA SESGKVVQLK HLDGHIEEVP YFSLPANDLV DVIAPSCYSC FDYTNALADL VIGYMGVPKY SGLNMTDHPQ YITGDRRPFV TETVKADDAA KFGQGPAQPA PLFVGNIIAF ILNLVGPKGL EFARYSLDYH TIRNYLYVNR KWGKQSMLVI SKTISLVNLV RHRQEKMGRF GVFWQRTENK KQSKSAAESK STHVKVMSE // ID NC003070_255 HYPOTHETICAL; PRT; 356 AA. AC NC003070_255; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1290985...1290771, 1290222...1290151, DE 1290064...1289929, 1288830...1288183]; Length: 1071. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 71 FIRST EXON; p-value: NaN. FT GENSCAN 72 72 AA on splice site: ag/g -> R. FT GENSCAN 73 95 INTERNAL EXON; p-value: NaN. FT GENSCAN 96 96 AA on splice site: ag/g -> R. FT GENSCAN 97 141 INTERNAL EXON; p-value: NaN. FT GENSCAN 142 356 LAST EXON; p-value: NaN. SQ SEQUENCE 356 AA; 40876 MW; EA92AACD44B3D5E1 CRC64; MTEAMIRNKP GMASVKDMPL LQDGPPPGGF APVRYARRIS NTGPSAMAMF LAVSGAFAWG MYQVGQGNKI RRALKEEKYA ARRTILPILQ AEEDERFVSE WKKYLEYEAD VMKDVPGWKV GENVYNSGRW MPPATGELRP DVNFVKLNYK IIPWEASLRY ARQDAKEWQD AEEYTHRLSS KPDRVTSQEP TANLWTTPPI GWTKCNYDGT YHSNAPSKAG WLLRDDRGTF LGAAHAIGSI TTNPMESELQ ALVMAMQHCW SRGYRKIYFE GDNKEVSEIV NGRSSNFAVF NWIRDISAWR SKFEECKFTW TRRFSNMAAD ALAKQQLPLN TQFYCYNYIP FVITDFLHRD FATANY // ID NC003070_256 HYPOTHETICAL; PRT; 151 AA. AC NC003070_256; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1291349...1291804]; Length: 456. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 151 SINGLE EXON; p-value: NaN. SQ SEQUENCE 151 AA; 17018 MW; 6EC227EF0C0116E9 CRC64; MVGFKNRYML MEVFLDPDKD LLGEGTPIIL TQFNLSKAIK DSILVNFGEC GLGSSLGSFQ VKYVNPITKL CIVRSSREEH RQVWLAITLV KSIGNCPVIL NLLDISGCIR ACRDTALKCD KEKFEQCSKS LSEEEIRQMN TSLEKIKLLE N // ID NC003070_257 HYPOTHETICAL; PRT; 235 AA. AC NC003070_257; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1292540...1293247]; Length: 708. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 235 SINGLE EXON; p-value: NaN. SQ SEQUENCE 235 AA; 26422 MW; E0463EEFFAFE0F9F CRC64; MRSPRTLEVW KLGTVNYLKS LKLQEKLVSE RKAHQIPDTL LSLQHPPTYT LGKRRTDHNL LIPESELTKI GAELHYTQRG GDITFHGPHQ AILYPIISLR SIGFGARNYV ETLERSMIEF ASIYGVKARA GNKCETGVWV GDRKIGAIGV RISSGITSHG LALNIDPDMK YFEHIVPCGI ADKEVTSLRR ETDTLLPSEE VIHEQLVSCL AKAFSYDDVV WKEDPSLILD TQDKE // ID NC003070_258 HYPOTHETICAL; PRT; 99 AA. AC NC003070_258; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1294142...1293938, 1293620...1293526]; DE Length: 300. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 68 FIRST EXON; p-value: NaN. FT GENSCAN 69 69 AA on splice site: g/gt -> G. FT GENSCAN 70 99 LAST EXON; p-value: NaN. SQ SEQUENCE 99 AA; 11488 MW; 326798C3B240DB89 CRC64; MTNLIGGPPL TIHCKSKQDD LGIHVVPFKQ EYHFKFQPNL WKSTLFFCSF QWDSQFKSFD IYDAQRDQGL LLHTLLTKDV KMVNPIQHNS LPMSLGFMG // ID NC003070_259 HYPOTHETICAL; PRT; 936 AA. AC NC003070_259; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1298902...1298855, 1298201...1298121, DE 1298033...1297944, 1297854...1297705, 1297534...1295940, DE 1295774...1295536, 1295499...1294892]; Length: 2811. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 16 FIRST EXON; p-value: NaN. FT GENSCAN 17 43 INTERNAL EXON; p-value: NaN. FT GENSCAN 44 73 INTERNAL EXON; p-value: NaN. FT GENSCAN 74 123 INTERNAL EXON; p-value: NaN. FT GENSCAN 124 654 INTERNAL EXON; p-value: NaN. FT GENSCAN 655 655 AA on splice site: ct/a -> L. FT GENSCAN 656 734 INTERNAL EXON; p-value: NaN. FT GENSCAN 735 735 AA on splice site: c/ga -> R. FT GENSCAN 736 936 LAST EXON; p-value: NaN. SQ SEQUENCE 936 AA; 105859 MW; AFE31E9A2DEA1F2F CRC64; MEKNAMKLLE EIKSSDAILP VASKYLALDR PDCSHYFLAF AIKVSQWCAK HLNMSVMSME ESQEEEHSNI FFQLLLDYLR FSASSFTAIG KTCFMTDDAS AVTVHKFVSE QLNLTKELIM NSKKVESFSS EIFKAVQVVI DSTVRLCKEY SQTVNREVSE MKTSGHVGKA RMEEGNAVGN LVSMITLGVK SLSELGMLAA RDGGNLVAIL NTSWKGVITL LQLDKQTLVS KVDVGEIILK LISLIKDSLR FAAEAWSCSV KENISATEAR RVFLPVKFYL INAVKVVALF PSQASMVSKD IALCILMISA FKVSLSQQTH GKSASEVMTD LLEKTTVDLL GALLNAAELT QEFRLTLLDS LFVDEFSNQI CKKQSHDSHT KTSLVDILSL SVESATSARD LLLARVVLFQ SVMRYSFELD KDAKLAITTK LQWLLDILAD KEVYSSVLSS QLPMADGSGK IVIWESMYSA LLLSLKTLMI ILSSTPAWEE LETFLLQNLL HPHFLCWQIV MELWCFWVRH ATDDLVVDMI NQLCTFIMSM PSSETPLCPD SVLRRTTKSI CFLLTHSPKS LTVQVYKHIS TESRSDHAPD VYLALLLDGF PLNFLPDRIK NDAKRQIFAD FFNFIEKFDE KPSNSSRYTL LGAPVFTVSA CLRILKMSIS EIDAKTLNFV VALIQKYRNS KDETTKERYS EILSETLSII SRSEQLYTCQ EMDNVITELQ KLFNSETNHH HNHLRLSKYE MSETKKCPKS IAVWELYHML LRKRHWALVH HAVTAFGYFC ARTSCNQLWR FVPEDAALAF DIASGKEAKT ERFMSELKMF LEKEQALLSI TPSEEELELL SKEGTEVKAT VQKLLEGRSQ RSMEVEKRPN KKRKLPEGIC RGMELLQNGV KRINEGLNEL RSDENESEEF QKSLSNQFSC LEDLVSHLLS LTAASD // ID NC003070_260 HYPOTHETICAL; PRT; 212 AA. AC NC003070_260; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1301305...1300667]; Length: 639. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 212 SINGLE EXON; p-value: NaN. SQ SEQUENCE 212 AA; 18313 MW; 4C799597D46A58A4 CRC64; MSRALSVVCV LLAISFVCAR ARQVPGESDE GKTTGHDDTT TMPMHAKAAD QLPPKSVGDK KCIGGVAGVG GFAGVGGVAG VGGLGMPLIG GLGGIGKYGG IGGAAGIGGF HSIGGVGGLG GVGGGVGGLG GVGGGVGGLG GVGGLGGAGL GGVGGVGGGI GKAGGIGGLG GLGGAGGGLG GVGGLGKAGG IGVGGGIGGG HGVVGGVIDP HP // ID NC003070_261 HYPOTHETICAL; PRT; 126 AA. AC NC003070_261; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1301984...1302088, 1302160...1302285, DE 1302382...1302531]; Length: 381. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 35 FIRST EXON; p-value: NaN. FT GENSCAN 36 77 INTERNAL EXON; p-value: NaN. FT GENSCAN 78 126 LAST EXON; p-value: NaN. SQ SEQUENCE 126 AA; 13451 MW; 40C47037507E15A1 CRC64; MYASKLGKVL ISNGKTARSV PLYRTFVSAS PRPLQGKEEA EQCQKVKEAA EAVKEGAKQV KETTEYIQDV ASTTAGRVTK MTKDVTEKVT ETTDTITEKA KGSVSGVLGT AKNATDIIKN KILGGD // ID NC003070_262 HYPOTHETICAL; PRT; 409 AA. AC NC003070_262; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1312312...1312276, 1309925...1309783, DE 1306915...1306791, 1305464...1305011, 1304723...1304620, DE 1304510...1304372, 1304278...1304051]; Length: 1230. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 12 FIRST EXON; p-value: NaN. FT GENSCAN 13 13 AA on splice site: t/tt -> F. FT GENSCAN 14 60 INTERNAL EXON; p-value: NaN. FT GENSCAN 61 101 INTERNAL EXON; p-value: NaN. FT GENSCAN 102 102 AA on splice site: at/g -> M. FT GENSCAN 103 253 INTERNAL EXON; p-value: NaN. FT GENSCAN 254 287 INTERNAL EXON; p-value: NaN. FT GENSCAN 288 288 AA on splice site: ag/g -> R. FT GENSCAN 289 334 INTERNAL EXON; p-value: NaN. FT GENSCAN 335 409 LAST EXON; p-value: NaN. SQ SEQUENCE 409 AA; 46007 MW; E3BAF0EBCE736C7B CRC64; MNGLEKQIGF DAFKLRQKVL TSEPFDRIIN CSMTLDHHYV PNKNIYNVVL NMIIHFKSSK ILNFWISNDF LTNIRSDETE WNQHAVTNPD EVADEVLALT EMSVRNHTER RKLGYFTCGT GNPIDDCWRC DPNWHKNRKR LADCGIGFGR NAIGGRDGRF YVVTDPRDDN PVNPRPGTLR HAVIQDRPLW IVFKRDMVIQ LKQELIVNSF KTIDGRGANV HIANGGCITI QFVTNVIVHG LHIHDCKPTG NAMVMLLGHS DSYMRDKAMQ VTIAYNHFGV GLIQRMPRCR HGYFHVVNND YTHWEMYAIG GSANPTINSQ GNRYAAPKNP FAKEVTKRVD TPASHWKGWN WRSEGDLLQN GAYFTSSGAA ASGSYARASS LSAKSSSLVG HITSDAGALP CRRGRQCSS // ID NC003070_263 HYPOTHETICAL; PRT; 328 AA. AC NC003070_263; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1313661...1314028, 1314463...1314658, DE 1314779...1314922, 1315040...1315204, 1315306...1315419]; Length: 987. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 122 FIRST EXON; p-value: NaN. FT GENSCAN 123 123 AA on splice site: ag/g -> R. FT GENSCAN 124 188 INTERNAL EXON; p-value: NaN. FT GENSCAN 189 236 INTERNAL EXON; p-value: NaN. FT GENSCAN 237 291 INTERNAL EXON; p-value: NaN. FT GENSCAN 292 328 LAST EXON; p-value: NaN. SQ SEQUENCE 328 AA; 36539 MW; 5FAE34A3BA100ED0 CRC64; MQYKNLGKSG LKVSTLSFGA WVTFGNQLDV KEAKSILQCC RDHGVNFFDN AEVYANGRAE EIMGQAIREL GWRRSDIVIS TKIFWGGPGP NDKGLSRKHI VEGTKASLKR LDMDYVDVLY CHRPDASTPI EETVRAMNYV IDKGWAFYWG TSEWSAQQIT EAWGAADRLD LVGPIVEQPE YNMFARHKVE TEFLPLYTNH GIGLTTWSPL ASGVLTGKYN KGAIPSDSRF ALENYKNLAN RSLVDDVLRK VSGLKPIADE LGVTLAQLAI AWCASNPNVS SVITGATRES QIQENMKAVD VIPLLTPIVL DKIEQVIQSK PKRPESYR // ID NC003070_264 HYPOTHETICAL; PRT; 1042 AA. AC NC003070_264; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1316918...1318786, 1318866...1319186, DE 1319284...1319523, 1319604...1319767, 1319851...1319998, DE 1320087...1320209, 1320285...1320372, 1320477...1320652]; Length: 3129. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 623 FIRST EXON; p-value: NaN. FT GENSCAN 624 730 INTERNAL EXON; p-value: NaN. FT GENSCAN 731 810 INTERNAL EXON; p-value: NaN. FT GENSCAN 811 864 INTERNAL EXON; p-value: NaN. FT GENSCAN 865 865 AA on splice site: ag/a -> R. FT GENSCAN 866 914 INTERNAL EXON; p-value: NaN. FT GENSCAN 915 955 INTERNAL EXON; p-value: NaN. FT GENSCAN 956 984 INTERNAL EXON; p-value: NaN. FT GENSCAN 985 985 AA on splice site: g/gt -> G. FT GENSCAN 986 1042 LAST EXON; p-value: NaN. SQ SEQUENCE 1042 AA; 118008 MW; 925E883EF30D36A9 CRC64; MRMEFPGSSN QHLGRDRFNG EVGCGNNCSQ TGEEFSNEFL RDFGAQRRLQ HGGVNRNVEG NYNNRHLVYE DFNRILGLQR VDSNMSEGIN SSNGYFAESN VADSPRKMFQ TAISDVYLPE VLKLLCSFGG RILQRPGDGK LRYIGGETRI ISIRKHVGLN ELMHKTYALC NHPHTIKYQL PGEDLDALIS VCSDEDLLHM IEEYQEAETK AGSQRIRVFL VPSTESSESP KIFHERNMNI NRNTNQQTDI DHYQYVSALN GIVDVSPQKS SSGQSGTSQT TQFGNASEFS PTFHLRDSPT SVHTWEHKDS NSPTFMKPYG NTNAVHFMPK MQIPRNSFGQ QSPPTSPFSV HKRANTDVPY FADQNGFFDP YLAAPNFPQQ NRFFFETTTQ KQKHPEVNLH DRRPSDDIYP HGQAYIGAEK MTLKKNALSD PQLHDESQIN NGLEAFTKQP WKILRKNLRV VATSKWEDSD DIYFNNPEGK RCKELELTKE VPNSWINRDN NPDSFDQATK KQDGSNSNSS FSPNYFSPNH QPAAQITSSD SQDSGSSVFS LSVNTNENYL DCSREKFNGF QHDMSLDILI RSHTSATDQL CSTTKSSDKA DYSSPNTNFP VVFLRQEPMI PRHDLETNSD DSDTQKSLPR EESIHYSGLP LRKVGSRETT FMHTQGSDDF FKSKLLGPQL IVEDVTNEVI SDNLLSATIV PQVNRESDDD HKSYTREKEI TNADHESEME EKYKKSRNTD DSFSEAAMVE IEAGIYGLQI IKNTDLEDLH ELGSGTFGTV YYGKWRGTDV AIKRIKNSCF SGGSSEQARQ TKDFWREARI LANLHHPNVV AFYGVVPDGP GGTMATVTEY MVNGSLRHVL QRKDRLLDRR KKLMITLDSA FGMEYLHMKN IVHFDLKCDN LLVNLRDPQR PICKVGDFGL SRIKRNTLVS GGVRGTLPWM APELLNGSSN RVSEKVDVFS FGIVMWEILT GEEPYANLHC GAIIGGIVNN TLRPPVPERC EAEWRKLMEQ CWSFDPGVRP SFTEIVERLR SMTVALQPKR RT // ID NC003070_265 HYPOTHETICAL; PRT; 373 AA. AC NC003070_265; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1321940...1322017, 1322572...1322698, DE 1322787...1322893, 1323131...1323238, 1323322...1323399, DE 1323505...1323637, 1323731...1323827, 1323904...1324027, DE 1324133...1324241, 1324307...1324390, 1324479...1324555]; Length: 1122. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 26 FIRST EXON; p-value: NaN. FT GENSCAN 27 68 INTERNAL EXON; p-value: NaN. FT GENSCAN 69 69 AA on splice site: g/aa -> E. FT GENSCAN 70 104 INTERNAL EXON; p-value: NaN. FT GENSCAN 105 140 INTERNAL EXON; p-value: NaN. FT GENSCAN 141 166 INTERNAL EXON; p-value: NaN. FT GENSCAN 167 210 INTERNAL EXON; p-value: NaN. FT GENSCAN 211 211 AA on splice site: g/gg -> G. FT GENSCAN 212 242 INTERNAL EXON; p-value: NaN. FT GENSCAN 243 243 AA on splice site: ag/g -> R. FT GENSCAN 244 284 INTERNAL EXON; p-value: NaN. FT GENSCAN 285 320 INTERNAL EXON; p-value: NaN. FT GENSCAN 321 321 AA on splice site: g/ga -> G. FT GENSCAN 322 348 INTERNAL EXON; p-value: NaN. FT GENSCAN 349 349 AA on splice site: g/gt -> G. FT GENSCAN 350 373 LAST EXON; p-value: NaN. SQ SEQUENCE 373 AA; 39252 MW; 963E46126B69FBBD CRC64; MEKATERQRI LLRHLQPSSS SDASLSALIE KTNVNPSEVG DIVVGTVLGP GSQRASECRM AAFYAGFPET VPIRTVNRQC SSGLQAVADV AAAIKAGFYD IGKIVKKFEQ AHNCLLPMGI TSENVAHRFN VSREEQDQAA VDSHRKAASA TASGKFKDEI TPVKTKIVDP KTGDEKPITV SVDDGIRPNT TLSGLAKLKP VFKEDGTTTA GNSSQLSDGA GAVLLMRRNV AMQKGLPILG VFRTFSAVGV DPAIMGVGPA VAIPAAVKAA GLELNDVDLF EINEAFASQF VYCRNKLGLD AEKINVNGGA IAIGHPLGAT GARCVATLLH EMKRRGKDCR FGVVSMCIGS GMGAAAVFER GGGVDELCDV RKV // ID NC003070_266 HYPOTHETICAL; PRT; 1194 AA. AC NC003070_266; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1333425...1333236, 1332729...1332588, DE 1332508...1332388, 1332198...1332136, 1331999...1331869, DE 1331158...1330679, 1330630...1330473, 1330345...1330146, DE 1330035...1329763, 1329644...1329558, 1329419...1329318, DE 1329212...1329147, 1328881...1328721, 1328500...1328456, DE 1328127...1328033, 1327903...1327836, 1327639...1327571, DE 1327490...1327393, 1327305...1327224, 1327121...1327043, DE 1326814...1326636, 1326530...1326339, 1326252...1326193, DE 1326143...1325854, 1325688...1325589, 1325437...1325384]; Length: 3585. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 63 FIRST EXON; p-value: NaN. FT GENSCAN 64 64 AA on splice site: a/cc -> T. FT GENSCAN 65 110 INTERNAL EXON; p-value: NaN. FT GENSCAN 111 111 AA on splice site: gg/c -> G. FT GENSCAN 112 151 INTERNAL EXON; p-value: NaN. FT GENSCAN 152 172 INTERNAL EXON; p-value: NaN. FT GENSCAN 173 215 INTERNAL EXON; p-value: NaN. FT GENSCAN 216 216 AA on splice site: gg/a -> G. FT GENSCAN 217 375 INTERNAL EXON; p-value: NaN. FT GENSCAN 376 376 AA on splice site: ag/t -> S. FT GENSCAN 377 428 INTERNAL EXON; p-value: NaN. FT GENSCAN 429 429 AA on splice site: g/gt -> G. FT GENSCAN 430 495 INTERNAL EXON; p-value: NaN. FT GENSCAN 496 586 INTERNAL EXON; p-value: NaN. FT GENSCAN 587 615 INTERNAL EXON; p-value: NaN. FT GENSCAN 616 649 INTERNAL EXON; p-value: NaN. FT GENSCAN 650 671 INTERNAL EXON; p-value: NaN. FT GENSCAN 672 724 INTERNAL EXON; p-value: NaN. FT GENSCAN 725 725 AA on splice site: aa/g -> K. FT GENSCAN 726 739 INTERNAL EXON; p-value: NaN. FT GENSCAN 740 740 AA on splice site: ag/g -> R. FT GENSCAN 741 771 INTERNAL EXON; p-value: NaN. FT GENSCAN 772 772 AA on splice site: g/aa -> E. FT GENSCAN 773 794 INTERNAL EXON; p-value: NaN. FT GENSCAN 795 817 INTERNAL EXON; p-value: NaN. FT GENSCAN 818 849 INTERNAL EXON; p-value: NaN. FT GENSCAN 850 850 AA on splice site: cg/g -> R. FT GENSCAN 851 877 INTERNAL EXON; p-value: NaN. FT GENSCAN 878 903 INTERNAL EXON; p-value: NaN. FT GENSCAN 904 904 AA on splice site: g/tg -> V. FT GENSCAN 905 963 INTERNAL EXON; p-value: NaN. FT GENSCAN 964 1027 INTERNAL EXON; p-value: NaN. FT GENSCAN 1028 1047 INTERNAL EXON; p-value: NaN. FT GENSCAN 1048 1143 INTERNAL EXON; p-value: NaN. FT GENSCAN 1144 1144 AA on splice site: ag/g -> R. FT GENSCAN 1145 1177 INTERNAL EXON; p-value: NaN. FT GENSCAN 1178 1194 LAST EXON; p-value: NaN. SQ SEQUENCE 1194 AA; 135409 MW; 723345E290D508A5 CRC64; MAQQSLIYSF VARGTVILVE FTDFKGNFTS IAAQCLQKLP SSNNKFTYNC DGHTFNYLVE DGFTYCVVAV DSAGRQIPMS FLERVKEDFN KRYGGGKAAT AQANSLNKEF GSKLKEHMQY CMDHPDEISK LAKVKAQVSE VKGVMMENIE KVLDRGEKIE LLVDKTENLR SQAQDFRTTG TQMRRKMWLQ NMKIKLIVLA IIIALILIIV LSVCHGFRFS LLFTEHQIPT KFRYGQSYQR MEFDIPLPEE LELLEANSHF YEEEDEYLNF EEPPYPYPID GDEEKEEERV AHKEPHVRQS ESSDIKGCKR PRSLISDPIV NLDEVSPASD KRSKIDDNRV EIEDEDWLRF SPVKEVVHVM EEEEEVVIPQ ETMLSSNLVI GICRYASEID GECFPITAPD GGDRVYAKFC RALGDEEVNK LDVKDKSNGL IKDPISVLLQ QSEKEAFNKV LQASSEDQNE TISAETSVMH EKLWVDKYSP SSFTELLSDE QTNREVLLWL KQWDASVFGS EIRSTTEAVL SALKRHSTTS HHQKSDSAFT RKKQFNRWSK ESFGYSKNAE VSNTNTADIN DLWNKKSKLT GPPEQKILLL CGAPGLGKTT LAHIAAKHCG YRVVEINASD ERSASAIETR ILDVVQMNSV TADSRPKCLV IDEIDGALGD GKGAVDVILK MVLAERKHAT GKENVENVKT SSKKDRRTAP LSRPVICICN DLYAPALRPL RQIAKVHIFV QPTVSRVVNR LICVNDRLKY ICNMEGMKAR SFALSALAEY TECDIRSCLN TLQFLYKKKE TINVIDIGSQ VVGRKDMSKS LFDIWKEIFT TRKMKRERSN DASGSGAKNF DFLHSLVSSR GDYDLIFDGI HENILQLHYH DPVMDKTISC LDGLGTSDLL HRYIMRTQQM PLYVLLNIKC TRCRTLLVEK QESLRSWHHK IPPYIGRHLS IKSFVEDSIS PLLHILSPPT LRPVASHLLS DRQKEQLAGL VMLMCSYSLT YKNVKSDPVL SSLREDAASD ALVLALDPHL FDFINFKGHQ FKHHVLALAM KQVLVHELPE DLDCLIFKNM LKTQVEKQKI LQASGGKSGI LNKPEIKKIN QDLAKKTNAA ANESQRTPVT SKPPSVSVGT ATTSKPNSSD VKKASRNALN FFDRFRKSRK DYEDPEDVQN RATAKRDSRP LLFKFNEGFT NAVKRPVRMR EFLL // ID NC003070_267 HYPOTHETICAL; PRT; 220 AA. AC NC003070_267; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1334759...1334948, 1335331...1335472, DE 1335562...1335682, 1335767...1335829, 1335923...1336069]; Length: 663. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 63 FIRST EXON; p-value: NaN. FT GENSCAN 64 64 AA on splice site: a/cc -> T. FT GENSCAN 65 110 INTERNAL EXON; p-value: NaN. FT GENSCAN 111 111 AA on splice site: gg/g -> G. FT GENSCAN 112 151 INTERNAL EXON; p-value: NaN. FT GENSCAN 152 172 INTERNAL EXON; p-value: NaN. FT GENSCAN 173 220 LAST EXON; p-value: NaN. SQ SEQUENCE 220 AA; 24874 MW; F8D984AC0F118548 CRC64; MGQQSLIYSF VARGTVILAE YTEFKGNFTS VAAQCLQKLP SSNNKFTYNC DGHTFNYLAD NGFTYCVVVI ESAGRQIPMA FLERVKEDFN KRYGGGKAST AKANSLNKEF GSKLKEHMQY CADHPEEISK LSKVKAQVTE VKGVMMENIE KVLDRGEKIE LLVDKTENLR SQAQDFRTQG TKMKRKLWFE NMKIKLIVFG IIVALILIII LSVCHGFKCT // ID NC003070_268 HYPOTHETICAL; PRT; 302 AA. AC NC003070_268; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1337763...1337647, 1337555...1337343, DE 1337241...1337086, 1336985...1336563]; Length: 909. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 39 FIRST EXON; p-value: NaN. FT GENSCAN 40 110 INTERNAL EXON; p-value: NaN. FT GENSCAN 111 162 INTERNAL EXON; p-value: NaN. FT GENSCAN 163 302 LAST EXON; p-value: NaN. SQ SEQUENCE 302 AA; 34530 MW; C704BC79EB88C345 CRC64; MMMIQRRGGE RQDSSAAAYN VVHKLPHGDS PYVRAKHVQL VEKDAEAAIE LFWIAIKARD RVDSALKDMA LLMKQQNRAE EAIDAIQSFR DLCSRQAQES LDNVLIDLYK KCGRIEEQVE LLKQKLWMIY QGEAFNGKPT KTARSHGKKF QVTVEKETSR ILGNLGWAYM QLMDYTAAEA VYRKAQLIEP DANKACNLCT CLIKQGKHDE ARSILFRDVL MENKEGSGDP RLMARVQELL SELKPQEEEA AASVSVECEV GIDEIAVVEG LDEFVKEWRR PYRTRRLPIF EEILPLRDQL AC // ID NC003070_269 HYPOTHETICAL; PRT; 107 AA. AC NC003070_269; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1339946...1340112, 1340440...1340596]; DE Length: 324. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 55 FIRST EXON; p-value: NaN. FT GENSCAN 56 56 AA on splice site: aa/t -> N. FT GENSCAN 57 107 LAST EXON; p-value: NaN. SQ SEQUENCE 107 AA; 12484 MW; 0C645205AF82F376 CRC64; MVNLASEEEE EDGGDNINRL RDTAGEAALP SPASLAVSKA TSRSRRRPRK ALNRGNREFF VNPYRVKTLK FKTLEIFEHK RNRILRESEF IIEFLSFQYQ QLLLISR // ID NC003070_270 HYPOTHETICAL; PRT; 664 AA. AC NC003070_270; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1342964...1341339, 1341258...1340890]; DE Length: 1995. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 542 FIRST EXON; p-value: NaN. FT GENSCAN 543 664 LAST EXON; p-value: NaN. SQ SEQUENCE 664 AA; 74361 MW; B1E2B9198582F1FF CRC64; MASSTIDVTK YGHSPVHHAV VTRDYAGLKK LLSALPKMRD PSEVQNEAAS VAEETKADSI AAVIDRRDVV NRDTALHLAV KLGDETSAEM LMAAGADWSL QNEHGWSALQ EAICGREERI AMIIVRHYQP LAWAKWCRRL PRLVATMHRM RDFYMEITFH FESSVIPFIS RVAPSDTYKI WKRGANLRAD MTLAGFDGFR IQRSDQTILF LGDGSEDGKV PSGSLLMISH KDKEIMNALD GAGAAASEEE VRQEVAAMSK TSIFRPGIDV TQAVLFPQLT WRRQEKTEMV GQWKAKVYDM HNVVVSIKSR RVPGAMTDEE LFSNTNQEND TESEDLGDIL TEDEKRQLEL ALKLDSPEES SNGESSRISQ KQNSCSFEDR EIPVTDGNGY CKQEKKGWFS GWRKREEGHR RSSVPPRNSL CVDEKVSDLL GDDDSPSRGG RQIKPGRHST VETVVRNENR GLRDSSKAST SEGSGSSKRK EGNKENEYKK GLRPVLWLSE RFPLQTKELL PLLDILANKV KAIRRLRELM TTKLPSGTFP VKVAIPVIPT IRVLVTFTKF EELEAIEDEF VTPPSSPTSS VKNSPREETQ SSSNPSSSWF QWIKTPSQRP STSSSSGGFN IGKAENDQDP FAIPRGYNWI TAEEKKKKVQ EKNKAKKGKS SQNS // ID NC003070_271 HYPOTHETICAL; PRT; 716 AA. AC NC003070_271; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1345468...1346240, 1346328...1346838, DE 1346941...1347039, 1347237...1347437, 1347526...1347614, DE 1347704...1347803, 1347898...1348053, 1348099...1348320]; Length: 2151. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 257 FIRST EXON; p-value: NaN. FT GENSCAN 258 258 AA on splice site: ag/a -> R. FT GENSCAN 259 428 INTERNAL EXON; p-value: NaN. FT GENSCAN 429 461 INTERNAL EXON; p-value: NaN. FT GENSCAN 462 528 INTERNAL EXON; p-value: NaN. FT GENSCAN 529 557 INTERNAL EXON; p-value: NaN. FT GENSCAN 558 558 AA on splice site: ga/a -> E. FT GENSCAN 559 591 INTERNAL EXON; p-value: NaN. FT GENSCAN 592 643 INTERNAL EXON; p-value: NaN. FT GENSCAN 644 716 LAST EXON; p-value: NaN. SQ SEQUENCE 716 AA; 81088 MW; 7BE72145B3891C6E CRC64; MENSDIDMVI IPDTPDRSVH HREVKRRPHS PVAPLRYQRE EYRNHHLHGR ARPVPEIGDN RESSDTRTES GHRPRASVGN ALFRRTVVEK DKGKSISTDP CAPRVEKNPV LNLNQRNGHV HVAASRYQPS EDIRELRTSN GCSPLRGDHN SFVLPGNSNK GKEKADSGSV PHRETIDLSS GKPQNRGTKR LVRHGCISPH EIAARARQAA DTNSYDTLSV EQELASETAS SIGIREIVPE SDIHGRARGK RPEISSSRVS ILCINANRVA SRDGLEGWVS TRNRNLNMEH EMNHRDESNT RGICSSVTRL DVRETGVVER ESRQQRRRKN GFTTSTASNE PEVTVNRSSG EPSSSRPPRI QNHLRHWHGT QVLEIEDSSP EVRVFRGPRR VENDVSDVNI RQIEADEILA RELQEQLYRE ESLIRHEQID EIIARSMEQE ENSLRASSSR ASTRITRSSN TIAANPRGRS RLEARLQQHS SRRRFNPPQA RAPVRAPARG RGYRLGGASA SLRTALNFSF PIDMGLDSRM DILEELENAI GHSITSSNLL HMDRDFTEDD YELLLALDEN NHRHGGASAN RINNLPESTV QTDNFQETCV ICLETPKIGD TIRHLPCLHK FHKDVRIFTD SNENPLLRTE IRKDGVNHVR FANPQLLDKV FKFIGEFENY HFTLPNRSIS GIILRVSQTF LAIKFELIKR GFTTVYARGM IICVSL // ID NC003070_272 HYPOTHETICAL; PRT; 200 AA. AC NC003070_272; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1349236...1348634]; Length: 603. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 200 SINGLE EXON; p-value: NaN. SQ SEQUENCE 200 AA; 18046 MW; D11619C8EF93A8F5 CRC64; MGLIAGKVRV FVLVFALVTD FTMGEAEFGD EKPLFPHPHP HPLLHKKGFK KEFGDLGGGG GISGGGGFGA GGGWIGGSVG GFGGGIGGGF GGGGFGGGAG KGVDGGFGKG VDGGAGKGVD GGAGKGFDGG VGKGVDGGAG KGFDGGVGKG FEGGIGKGIE GGVGKGFDGG AGKGVDGGAI GGIGGGAGKE IGGGIGGGGH // ID NC003070_273 HYPOTHETICAL; PRT; 941 AA. AC NC003070_273; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1350315...1350330, 1350569...1350635, DE 1350832...1350901, 1351106...1351211, 1351525...1351633, DE 1351712...1351880, 1352070...1352530, 1352637...1352772, DE 1352992...1353123, 1353223...1353443, 1353540...1353798, DE 1353920...1354152, 1354239...1354737, 1354913...1355260]; Length: 2826. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 5 FIRST EXON; p-value: NaN. FT GENSCAN 6 6 AA on splice site: g/tt -> V. FT GENSCAN 7 27 INTERNAL EXON; p-value: NaN. FT GENSCAN 28 28 AA on splice site: aa/a -> K. FT GENSCAN 29 51 INTERNAL EXON; p-value: NaN. FT GENSCAN 52 86 INTERNAL EXON; p-value: NaN. FT GENSCAN 87 87 AA on splice site: t/cc -> S. FT GENSCAN 88 122 INTERNAL EXON; p-value: NaN. FT GENSCAN 123 123 AA on splice site: aa/a -> K. FT GENSCAN 124 179 INTERNAL EXON; p-value: NaN. FT GENSCAN 180 332 INTERNAL EXON; p-value: NaN. FT GENSCAN 333 333 AA on splice site: aa/a -> K. FT GENSCAN 334 378 INTERNAL EXON; p-value: NaN. FT GENSCAN 379 422 INTERNAL EXON; p-value: NaN. FT GENSCAN 423 495 INTERNAL EXON; p-value: NaN. FT GENSCAN 496 496 AA on splice site: ag/g -> R. FT GENSCAN 497 582 INTERNAL EXON; p-value: NaN. FT GENSCAN 583 659 INTERNAL EXON; p-value: NaN. FT GENSCAN 660 660 AA on splice site: ag/g -> R. FT GENSCAN 661 826 INTERNAL EXON; p-value: NaN. FT GENSCAN 827 941 LAST EXON; p-value: NaN. SQ SEQUENCE 941 AA; 102441 MW; EA588E3B591D2F13 CRC64; MVSSAVRVLH SKFFHSCVEL DLNALIHKES LYEDEEFDQH QRQLAALLAS KVFYYLGELN DSLSYALGAG SLFDVSEDSD YIHTLLSKAI DEYAILRSKA VESSEVVEID PRLVAIVERM LDKCITDGKY QQAMGIAIEC RRLDKLEEAI IKSENVQGTL SYCINVSHSF VNQREYRHEV LRLLVNVYQK LASPDYLSIC QCLMFLDEPQ GVASILEKLL RSENKDDALL AFQISFDLVQ NEHQAFLMSV RDRLPAPKTR PVEAIQAVET STAQNENTAG DVQMADETPS QTIVHETDPV DAVYAERLTK AKGILSGETS IQLTLQFLYS HNKSDLLILK TIKQSVEMRN SVCHSATIYA NAIMHAGTTV DTFLRENLGG AGGGGSPYSE GGALYALGLI HANHGEGIKQ FLRDSLRSTS VEVIQHGACL GLGLAALGTA DEDIYDDIKS VLYTDSAVAG EAAGISMGLL LVGTATDKAS EMLAYAHETQ HEKIIRGLAL GIALTVYGRE EGADTLIEQM TRDQDPIIRY GGMYALALAY SGTANNKAIR QLLHFAVSDV SDDVRRTAVL ALGFVLYSDP EQTPRIVSLL SESYNPHVRY GAALAVGISC AGTGLSEAIS LLEPLTSDVV DFVRQGALIA MAMVMVQISE ASDSRVGAFR RQLEKIILDK HEDTMSKMGA ILASGILDAG GRNVTIRLLS KTKHDKVTAV IGLTVFSQFW YWYPLIYFIS LAFSPTAFIG LNYDLKVPKF EFMSHAKPSL FEYPKPTTVA TANTAAKLPT AVLSTSAKAK AKAKKEAEQK AKAENSGNEA GKANAASDEK EAESMQVDST ATTVEKKVEP EATFEILVNP ARVVPSQEKY IKLMEDSRYV PMKLAPSGFV LLRDLRPHEP EVLSLTDAPT STASPAVGAE AAGQAQQAAT TSAMAIDDEP QPPQAFEYAS P // ID NC003070_274 HYPOTHETICAL; PRT; 2972 AA. AC NC003070_274; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1375597...1375382, 1375220...1375028, DE 1374846...1374741, 1374436...1374247, 1374101...1373916, DE 1373833...1373750, 1372280...1369645, 1369545...1369408, DE 1368769...1368538, 1367933...1367789, 1367707...1367612, DE 1367222...1367146, 1366885...1366806, 1366660...1366600, DE 1366469...1366390, 1366298...1366217, 1366127...1366038, DE 1365949...1365872, 1365730...1365611, 1364739...1363860, DE 1363698...1362958, 1361845...1361538, 1361409...1361317, DE 1361178...1361074, 1360535...1360437, 1360352...1360224, DE 1359898...1359719, 1359496...1359388, 1359250...1359126, DE 1357762...1357157, 1357073...1356420]; Length: 8919. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 72 FIRST EXON; p-value: NaN. FT GENSCAN 73 136 INTERNAL EXON; p-value: NaN. FT GENSCAN 137 137 AA on splice site: g/tt -> V. FT GENSCAN 138 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 172 AA on splice site: at/g -> M. FT GENSCAN 173 235 INTERNAL EXON; p-value: NaN. FT GENSCAN 236 297 INTERNAL EXON; p-value: NaN. FT GENSCAN 298 325 INTERNAL EXON; p-value: NaN. FT GENSCAN 326 1203 INTERNAL EXON; p-value: NaN. FT GENSCAN 1204 1204 AA on splice site: ag/g -> R. FT GENSCAN 1205 1249 INTERNAL EXON; p-value: NaN. FT GENSCAN 1250 1250 AA on splice site: tg/g -> W. FT GENSCAN 1251 1327 INTERNAL EXON; p-value: NaN. FT GENSCAN 1328 1375 INTERNAL EXON; p-value: NaN. FT GENSCAN 1376 1376 AA on splice site: g/ag -> E. FT GENSCAN 1377 1407 INTERNAL EXON; p-value: NaN. FT GENSCAN 1408 1408 AA on splice site: g/gt -> G. FT GENSCAN 1409 1433 INTERNAL EXON; p-value: NaN. FT GENSCAN 1434 1459 INTERNAL EXON; p-value: NaN. FT GENSCAN 1460 1460 AA on splice site: ag/g -> R. FT GENSCAN 1461 1480 INTERNAL EXON; p-value: NaN. FT GENSCAN 1481 1506 INTERNAL EXON; p-value: NaN. FT GENSCAN 1507 1507 AA on splice site: cg/t -> R. FT GENSCAN 1508 1534 INTERNAL EXON; p-value: NaN. FT GENSCAN 1535 1564 INTERNAL EXON; p-value: NaN. FT GENSCAN 1565 1590 INTERNAL EXON; p-value: NaN. FT GENSCAN 1591 1630 INTERNAL EXON; p-value: NaN. FT GENSCAN 1631 1923 INTERNAL EXON; p-value: NaN. FT GENSCAN 1924 1924 AA on splice site: g/ga -> G. FT GENSCAN 1925 2170 INTERNAL EXON; p-value: NaN. FT GENSCAN 2171 2171 AA on splice site: a/ag -> K. FT GENSCAN 2172 2273 INTERNAL EXON; p-value: NaN. FT GENSCAN 2274 2304 INTERNAL EXON; p-value: NaN. FT GENSCAN 2305 2339 INTERNAL EXON; p-value: NaN. FT GENSCAN 2340 2372 INTERNAL EXON; p-value: NaN. FT GENSCAN 2373 2415 INTERNAL EXON; p-value: NaN. FT GENSCAN 2416 2475 INTERNAL EXON; p-value: NaN. FT GENSCAN 2476 2511 INTERNAL EXON; p-value: NaN. FT GENSCAN 2512 2512 AA on splice site: g/ga -> G. FT GENSCAN 2513 2553 INTERNAL EXON; p-value: NaN. FT GENSCAN 2554 2755 INTERNAL EXON; p-value: NaN. FT GENSCAN 2756 2972 LAST EXON; p-value: NaN. SQ SEQUENCE 2972 AA; 330950 MW; A0DC417293FC00E6 CRC64; MRSSQNGGAM GGRAAGTGGG GPSAPVDKEV DYAQYFCTYS FLYHQKDMLS DRVRMDAYFN AVFQNKHHFE GKTVLDVGTG SGILAIWSAQ AGARKVYAVE ATKMADHARA LVKANNLDHI VEVIEGSVED ISLPEKVDVI ISEWMGYFLL RESMFDSVIS ARDRWLKPTG VMYPSHARMW LAPIKSNIAD RKRNDFDGAM ADWHNFSDEI KSYYGVDMGV LTKPFAEEQE KYYIQTAMWN DLNPQQIIGT PTIVKEMDCL TASVSEIEEV RSNVTSVINM EHTRLCGFGG WFDVQFSGRK EDPAQQEIEL TTAPSEQHCT HWGQQKAKKK ARAPTKEIQT MEISKKVSEE PPSQAGEIAE GDVKAVKETQ ACVHFDKALN LEKVLDKIKS SRQIKCAECN EGVYGKRGTK AKGSKGKKDF SSSDPKSNNK AIWLCLECGC YVCGGVGLPN GPQSHVLRHS RVTRHRLVIQ WENPQLRWCF PCQLLLPVEK EDNGEKKDVL SEVVKLIKGR SLNNLASSDI EDQCSGSGSI TSDIKLEGAV TSDIEARDGY VVRGLVNLGN TCFFNSIMQN LLSLDRLRDH FLKENGSGVG GPLASSLRKL FTETKPEAGL KSVINPRAFF GSFCSKAPQF RGYDQHDSHE LLRCLLDSLS TEESALRKKR GVSDNDEKST TLIESVFGGE TSSIVSCMEC GHSSKVYEPF LDLSLPVPFK KSPPKKPQPV SRAKKAKLPP KRVPKNVSKV SKVSKVLPGM VLSELNSSGK SMAVTADSDT SCSSLAPLDN GPVLETPSVL TLDNNQASES ASQSDTGFDG SWLDFIGPET SGDETNLDMQ EDGIDNVITA EVNQIVPSPN IVANSSVSSG DQTLEGNTER LMQDYEEIAK AEANLDEKDV QAMQSDECPA TSGISAEFSQ ASCIGCDPGI GESSSSVNPW DEEELPLVVA DSQILYMPYK EISCNDKSVE GECEASSSFV TGDHEPQNSD FVDFGGLFDE PETTEGPVFG PPSKAEASGV GFMAFSSESD PEEIDDSDLP VSVERCLGHF TKHEILSDDN AWNCENCSKN LKLQRLREKR KSNEDESRSS NTSNGWVKEN EDEGFGETEI LAVKQDPNDT SCVKDHSSDG RKAARIHSAD ESESKGTQDE DEDSEKVITV KRDATKKVLI NKAPPVLTIH LKRFSQDLRG RLSKLNGHVA FKEVIDLRQY MDSRCSGEDP PVYRLAGLVE HSGTMRGGHY VAYVRGGQRV KETDSSSTAW ERKTIESRVC ETKSREWEIV AMAGVSLKCG DCGTLLKSVE EAQEHAELTS HSNFAESTEA VLNLVCTTCT KPCRSKIESD LHTKRTGHTE FVDKTLETIK PISLEAPKVA MEIDDNASGS GEAAEEMVVP DVDNNILEEL EAMGFPKARA TRALHYSGNA SLEAAVNWVV EHENDPDVDE MPKVPSNSNV GPAKPALTPE EVKLKAQELR ERARKKKEEE EKRMEREREK YLVQERIRIG KELLEAKRME EVNERKRLMF LRKAEKEEEK RAREKIRQKL EEDKAERRRK LGLPPEDPAT AAAKPSVPVV EEKKVTLPIR PATKTEQMRE CLRSLKQAHK EDDAKVKRAF QTLLTYMGNV AKNPDEEKFR KIRLTNQTFQ SPDYSLSIFR NSEERNPFVL NALIRGLTEN ARFESSVRHF ILMLRLGVKP DRLTFPFVLK SNSKLGFRWL GRALHAATLK NFVDCDSFVR LSLVDMYAKT GQLKHAFQVF EESPDRIKKE SILIWNVLIN GYCRAKDMHM ATTLFRSMPE RNSGSWSTLI KGYVDSGELN RAKQLFELMP EKNVVSWTTL INGFSQTGDY ETAISTYFEM LEKGLKPNEY TIAAVLSACS KSGALGSGIR IHGYILDNGI KLDRAIGTAL VDMYAKCGEL DCAATVFSNM NHKDILSWTA MIQGEKPDEV VFLAVLTACL NSSEVDLGLN FFDSMRLDYA IEPTLKHYVL VVDLLGRAGK LNEAHELVEN MPINPDLTTW AALYRACKAH KGYRRAESVS QNLLELDPEL CGSYIFLDKT HASKGNIQDV EKRRLSLQKR IKERSLGWSY IELDGQLNKF SAGDYSHKLT QEIGLKLDEI ISLAIQKGYN PGADWSIHDI EEEEKENVTG IHSEKLALTL GFLRTAPGTT IRIIKNLRIC GDCHSLMKYV KMVRKKVPEW LNSTMWSTPP PPSSYDDGLL RHSPVTKMKE EAESISVAPR LNSAPPPSSN TSVPSPSHRP RNGNSISGGS GEYGHSVGPS AEDFSRQAHV SAELSKKVIN MKELRSLALQ SLPDSPGIRS TVWKLLLGYL PPERSLWSTE LKQKRSQYKH YKDELLTSPD TETIEQIDRD VKRTHPDIPF FSGESSFARS NQESMKNILL VFAKLNQGIR YVQGMNEILA PIFYVFRNDP DEDSSSHAEA DAFFCFVELL SGFRDFYCQQ LDNSVVGIRS AITRLSQLVR KHDEELWRHL EITTKVNPQF YAFRWITLLL TQEFSFFDSL HIWDALLSDP EGKLTGDMLC DAGTGEEEAN RWGFHVEYEV TSTLSNYKHQ SSLPDGQMPS DKTVGGGDDA FNTFFSETGA GKHVPRAVFV DLEPTVIDEV RTGTYRQLFH PEQLISGKED AANNFARGHY TIGKEIVDLC LDRIRKLADN CTGLQGFLVF NAVGGGTGSG LGSLLLERLS VDYGKKSKLG FTVYPSPQVS TSVVEPYNSV LSTHSLLEHT DVSILLDNEA IYDICRRSLS IERPTYTNLN RLVSQVISSL TASLRFDGAL NVDVTEFQTN LVPYPRIHFM LSSYAPVISA EKAFHEQLSV AEITNSAFEP ASMMAKCDPR HGKYMACCLM YRGDVVPKDV NAAVGTIKTK RTIQFVDWCP TGFKCGINYQ PPTVVPGGDL AKVQRAVCMI SNSTSVAEVF SRIDHKFDLM YAKRAFVHWY VGEGMEEGEF SEAREDLAAL EKDYEEVGAE GGDDEDDEGE EY // ID NC003070_275 HYPOTHETICAL; PRT; 435 AA. AC NC003070_275; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1378224...1378112, 1377742...1377664, DE 1377509...1377353, 1377260...1377138, 1377034...1376619, DE 1376524...1376105]; Length: 1308. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 37 FIRST EXON; p-value: NaN. FT GENSCAN 38 38 AA on splice site: at/g -> M. FT GENSCAN 39 64 INTERNAL EXON; p-value: NaN. FT GENSCAN 65 116 INTERNAL EXON; p-value: NaN. FT GENSCAN 117 117 AA on splice site: g/ac -> D. FT GENSCAN 118 157 INTERNAL EXON; p-value: NaN. FT GENSCAN 158 158 AA on splice site: g/ga -> G. FT GENSCAN 159 296 INTERNAL EXON; p-value: NaN. FT GENSCAN 297 435 LAST EXON; p-value: NaN. SQ SEQUENCE 435 AA; 48729 MW; 66F847A15738E367 CRC64; MNNVCVTPEA TYEAVVADPR LFMTSLERLH SLLGTKFMVP IIGGRDLDLH KLFVEVTSRG GINKILNERR WKEVTATFVF PPTATNASYV LRKYYFSLLN NYEQIYFFRS NGQIPPDSMQ SPSARPCFIQ GAIRPSQELQ ALTFTPQPKI NTAEFLGGSL AGSNVVGVID GKFESGYLVT VTIGSEQLKG VLYQLLPQNT VSYQTPQQSH GVLPNTLNIS ANPQGVAGGV TKRRRRRKKS EIKRRDPDHP KPNRSGYNFF FAEQHARLKP LHPGKDRDIS RMIGELWNKL NEDEKLIYQG KAMEDKERYR TEMEDYREKK KNGQLISNAV PLQQRLPEQN VDMAEADLPI DEVEEDDEEG DSSGSSGESE PHDDQSIETD PELEEPSLNP SGPNLNPNPT EIVVAPKEKN GDVVMETSPL KKADEPTVAV TAEQN // ID NC003070_276 HYPOTHETICAL; PRT; 477 AA. AC NC003070_276; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1382803...1382350, 1382248...1381269]; DE Length: 1434. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 151 FIRST EXON; p-value: NaN. FT GENSCAN 152 152 AA on splice site: g/at -> D. FT GENSCAN 153 477 LAST EXON; p-value: NaN. SQ SEQUENCE 477 AA; 54844 MW; 1623726C7968702E CRC64; MRKCLHEDLL VDRASDRISA VDRSIKSKIP FGSITSDDDS NRFVELKLVR KNSGDISNAR VKDETEDLSR DEPNVINNKE LGRFDLMTVQ GMKGKRFVTR RSRNGLKNHR RSGRVNCASL RSISSSEAFD ETAKTNLMQP YDETCSNDDQ KDCGKAGEGT NLSHWPEFEK TVSVNQQVNS NSSAEQSFVR NVEKRSVRDL EELLKEERAA RATVCVELDK ERSAAASAAD EAMAMIHRLQ DEKAAIEMEA RQFQRLVEER STFDAEEMVI LKDILIRRER EKHFLEKEVE AYRQLLEETE ELECSLIKEK NVPEPEHKQN KDCQERRALL VQELDGTVLD MPYREEGNRD KNRDLYKSDS EVAYSRVRDV YMVKDETENI SKKKNLEESS VGKPKESLDE NSIIVSGIAR KLPPLCRPRK KSLSSSGSRR KSMSAVDYER LKIENEVELL RERLKAVQEE REELTRRASL PPLPSKV // ID NC003070_277 HYPOTHETICAL; PRT; 403 AA. AC NC003070_277; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1383911...1384039, 1384339...1384450, DE 1384658...1384740, 1384836...1384901, 1385400...1385500, DE 1385646...1385764, 1386093...1386148, 1386249...1386337, DE 1386409...1386568, 1386640...1386756, 1387023...1387067, DE 1387426...1387560]; Length: 1212. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 43 FIRST EXON; p-value: NaN. FT GENSCAN 44 80 INTERNAL EXON; p-value: NaN. FT GENSCAN 81 81 AA on splice site: g/gg -> G. FT GENSCAN 82 108 INTERNAL EXON; p-value: NaN. FT GENSCAN 109 130 INTERNAL EXON; p-value: NaN. FT GENSCAN 131 163 INTERNAL EXON; p-value: NaN. FT GENSCAN 164 164 AA on splice site: ac/t -> T. FT GENSCAN 165 203 INTERNAL EXON; p-value: NaN. FT GENSCAN 204 204 AA on splice site: a/ta -> I. FT GENSCAN 205 222 INTERNAL EXON; p-value: NaN. FT GENSCAN 223 251 INTERNAL EXON; p-value: NaN. FT GENSCAN 252 252 AA on splice site: ag/c -> S. FT GENSCAN 253 305 INTERNAL EXON; p-value: NaN. FT GENSCAN 306 344 INTERNAL EXON; p-value: NaN. FT GENSCAN 345 359 INTERNAL EXON; p-value: NaN. FT GENSCAN 360 403 LAST EXON; p-value: NaN. SQ SEQUENCE 403 AA; 45950 MW; D11BDDF8420544C9 CRC64; MAFVRYIPCR KIPRNVDQFE LPCLGSLRAF FSTQKLIGDE PVLVRDFIHT ALYDPIQGYF SQRSKSVGVL ERSIKFNQLE GRKAYMKLLE KVYKQSDISW FTPVELFKPW YAHGIAEAIL RTTNLSVPLK IYEIGGGSGT CAKGVLDYIM LNAPERIYKN MSYTSIEISP SLAKIQKETV AQVGSHLSKF RVECRDASDL AGWINFQTEN VEQQPCWVIM LEVLDNLPHD LVYSKSQLSP WMEVLVENKP ESEALSELYK PLEDPLIKRC IEIVEHEDDP VSKPKEIWSK LFPKPRRSWL PTGCLKLLEV LHAKLPKMSL IASDFSFLPD VKVPGERAPL VSTKKDGCSS DYSSYLDAKL DTSAFMDEFG LPSKTRTKDG YNPLLDDFKN TKFYLSVPTH NTK // ID NC003070_278 HYPOTHETICAL; PRT; 499 AA. AC NC003070_278; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1390883...1390776, 1390528...1390450, DE 1390333...1390263, 1390080...1389992, 1389703...1389580, DE 1389486...1389203, 1389009...1388880, 1388714...1388100]; Length: 1500. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 36 FIRST EXON; p-value: NaN. FT GENSCAN 37 62 INTERNAL EXON; p-value: NaN. FT GENSCAN 63 63 AA on splice site: c/ct -> P. FT GENSCAN 64 86 INTERNAL EXON; p-value: NaN. FT GENSCAN 87 115 INTERNAL EXON; p-value: NaN. FT GENSCAN 116 116 AA on splice site: ag/t -> S. FT GENSCAN 117 157 INTERNAL EXON; p-value: NaN. FT GENSCAN 158 251 INTERNAL EXON; p-value: NaN. FT GENSCAN 252 252 AA on splice site: gg/g -> G. FT GENSCAN 253 295 INTERNAL EXON; p-value: NaN. FT GENSCAN 296 499 LAST EXON; p-value: NaN. SQ SEQUENCE 499 AA; 56906 MW; 0A523AFE8FD103C4 CRC64; MVAKLSIGVI VLLICTLSLL FSANIGSNRE PTRPSKINVE ELWESAKSGG WRPSSAPRSD WPPPTKETNG YLRVRCNGGL NQQRSAICNA VLAARIMNAT LVLPELDANS FWHDDSGFQG IYDVEHFIET LKYDVKIVGK IPDVHKNGKT KKIKAFQIRP PRDAPIEWYL TTALKAMREH SAIYLTPFSH RLAEEIDNPE YQRLRCRVNY HALRFKPHIM KLSESIVDKL RSQGHFMSIH LRFEMDMLAF AGCFDIFNPE EQKILRKYRK ENFADKRLIY NERRAIGKCP LTPEEVGLIL RAMRFDNSTR IYLAAGELFG GEQFMKPFRT LFPRLDNHSS VDPSEELSAT SQGLIGSAVD YMVCLLSDIF MPTYDGPSNF ANNLLGHRLY YGFRTTIRPD RKALAPIFIA REKGKRAGFE EAVRRVMLKT NFGGPHKRVS PESFYTNSWP ECFCQMNPKK SSDKCPPNNV IEILDSRLES IRDPDSTSQT NSTVTGLER // ID NC003070_279 HYPOTHETICAL; PRT; 1356 AA. AC NC003070_279; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1398987...1398439, 1397576...1397469, DE 1397123...1397010, 1396626...1396521, 1395823...1395462, DE 1395376...1395144, 1395070...1395007, 1394925...1394230, DE 1394147...1394013, 1393926...1393795, 1393709...1393479, DE 1393191...1392256, 1392182...1392055, 1391949...1391673]; Length: 4071. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 183 FIRST EXON; p-value: NaN. FT GENSCAN 184 219 INTERNAL EXON; p-value: NaN. FT GENSCAN 220 257 INTERNAL EXON; p-value: NaN. FT GENSCAN 258 292 INTERNAL EXON; p-value: NaN. FT GENSCAN 293 293 AA on splice site: g/tt -> V. FT GENSCAN 294 413 INTERNAL EXON; p-value: NaN. FT GENSCAN 414 490 INTERNAL EXON; p-value: NaN. FT GENSCAN 491 491 AA on splice site: ag/t -> S. FT GENSCAN 492 512 INTERNAL EXON; p-value: NaN. FT GENSCAN 513 744 INTERNAL EXON; p-value: NaN. FT GENSCAN 745 789 INTERNAL EXON; p-value: NaN. FT GENSCAN 790 833 INTERNAL EXON; p-value: NaN. FT GENSCAN 834 910 INTERNAL EXON; p-value: NaN. FT GENSCAN 911 1222 INTERNAL EXON; p-value: NaN. FT GENSCAN 1223 1264 INTERNAL EXON; p-value: NaN. FT GENSCAN 1265 1265 AA on splice site: ag/g -> R. FT GENSCAN 1266 1356 LAST EXON; p-value: NaN. SQ SEQUENCE 1356 AA; 152470 MW; DBEF212ACFB22ABE CRC64; MSIPDPNSSA SLTVSPSLST ASETPVTPVN TVRPPPSQPP PAPPPLPPPT YRPIAPLRHP NPFQQQSAYS NNLYAHSIPV RRQIQDPSAV LYPFALPGRG FSARPVRGFV ADPSVTAGNL SGYPPRPSFT YDPGPYEQRQ MESLLQQFIR ERNPQIRPLP RLGLGSPVGL GPIRASPQFL QPRVRITEGS SSLYSLGRSW LKNGAHVGIQ VRTLYAAVRP QRSGIMKPLP KPLPVDLTTE TSVPDDPDEE SADEDKEITR RTVEEDQEVQ RKNNSYSGTI RRPMMLEERL KKVSYSPRNK EHRSQFSRET EDKLRMAGNE WINGYLEAIL DSQAQGIEET QQKPQASVNL REGDGQYFNP TKYFVEEVVT GVDETDLHRT WLKVVATRNS RERNSRLENM CWRIWHLTRK KKQLEWEDSQ RIANRRLERE QGRRDATEDL SEDLSEGEKG DGLGEIVQPE TPRRQLQRNL SNLEIWSDDK KENRLYVVLI SLHGLVRGEN MELGSDSDTG GQVKYVVELA RALARMPGVY RVDLFTRQIC SSEVDWSYAE PTEMLTTAED CDGDETGESS GAYIIRIPFG PRDKYLNKEI LWPFVQEFVD GALAHILNMS KVLGEQIGKG KPVWPYVIHG HYADAGDSAA LLSGALNVPM VLTGHSLGRN KLEQLLKQGR QSKEDINSTY KIKRRIEAEE LSLDAAELVI TSTRQEIDEQ WGLYDGFDVK LEKVLRARAR RGVNCHGRFM PRMAVIPPGM DFTNVEVQED TPEGDGDLAS LVGGTEGSSP KAVPTIWSEV MRFFTNPHKP MILALSRPDP KKNITTLLKA FGECRPLREL ANLTLIMGNR DDIDELSSGN ASVLTTVLKL IDKYDLYGSV AYPKHHKQSD VPDIYRLAAN TKGVFINPAL VEPFGLTLIE ALHNGLLVDP HDQEAIANAL LKLVSEKNLW HECRINGWKN IHLFSWPEHC RTYLTRIAAC RMRHPQWQTD ADEVAAQDDE FSLNDSLKDV QDMSLRLSMD GDKPSLNGSL EPNSADPVKQ IMSRMRTPEI KSKPELQGKK QSDNLGSKYP VLRRRERLVV LAVDCYDNEG APDEKAMVPM IQNIIKAVRS DPQMAKNSGF AISTSMPLDE LTRFLKSAKI QVSEFDTLIC SSGSEVYYPG GEEGKLLPDP DYSSHIDYRW GMEGLKNTVW KLMNTTAVGG EARNKGSPSL IQEDQASSNS HCVAYMIKDR SKVMRVDDLR QKLRLRGLRC HPMYCRNSTR MQIVPLLASR SQALRYLFVR WRLNVANMYV VVGDRGDTDY EELISGTHKT VIVKGLVTLG SDALLRSTDL RDDIVPSESP FIGFLKVDSP VKEITDIFKQ LSKATA // ID NC003070_280 HYPOTHETICAL; PRT; 242 AA. AC NC003070_280; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1400968...1401111, 1402066...1402650]; DE Length: 729. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 48 FIRST EXON; p-value: NaN. FT GENSCAN 49 242 LAST EXON; p-value: NaN. SQ SEQUENCE 242 AA; 27290 MW; B9509ABFCA78CBB8 CRC64; MKDSVCEECK QNPWKYKCPG CSIRSCALPC VKAHKQRTGC TGKRKFTDGA KAPFKELDIK APLRKQLAKV VILEYPVIHV YLPSQSYEFK VIKDFNTTPN PNDSLYDGHG CTNGITFREE EIEEDDIDSF EPEVLGLMKQ MNYNPCLRVS EKSKAEGVGT NNSNPQVDTT EQEDAGNMEL EFEQGLIDTY SDLFAEMNPG DYFNFECEFA KGLDSDDNCN LQNLDTDFIA DGLDLEEGEI VE // ID NC003070_281 HYPOTHETICAL; PRT; 801 AA. AC NC003070_281; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1410672...1410618, 1410317...1410184, DE 1410032...1409951, 1409822...1409755, 1409440...1409351, DE 1409004...1408930, 1408693...1408616, 1408199...1408047, DE 1407204...1407058, 1406957...1406871, 1406641...1406540, DE 1406446...1406338, 1406056...1405956, 1405879...1405742, DE 1405654...1405559, 1405233...1405109, 1404748...1404616, DE 1404325...1404200, 1404111...1403605]; Length: 2406. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 18 FIRST EXON; p-value: NaN. FT GENSCAN 19 19 AA on splice site: g/gt -> G. FT GENSCAN 20 63 INTERNAL EXON; p-value: NaN. FT GENSCAN 64 90 INTERNAL EXON; p-value: NaN. FT GENSCAN 91 91 AA on splice site: g/gc -> G. FT GENSCAN 92 113 INTERNAL EXON; p-value: NaN. FT GENSCAN 114 143 INTERNAL EXON; p-value: NaN. FT GENSCAN 144 168 INTERNAL EXON; p-value: NaN. FT GENSCAN 169 194 INTERNAL EXON; p-value: NaN. FT GENSCAN 195 245 INTERNAL EXON; p-value: NaN. FT GENSCAN 246 294 INTERNAL EXON; p-value: NaN. FT GENSCAN 295 323 INTERNAL EXON; p-value: NaN. FT GENSCAN 324 357 INTERNAL EXON; p-value: NaN. FT GENSCAN 358 393 INTERNAL EXON; p-value: NaN. FT GENSCAN 394 394 AA on splice site: g/tc -> V. FT GENSCAN 395 427 INTERNAL EXON; p-value: NaN. FT GENSCAN 428 473 INTERNAL EXON; p-value: NaN. FT GENSCAN 474 505 INTERNAL EXON; p-value: NaN. FT GENSCAN 506 546 INTERNAL EXON; p-value: NaN. FT GENSCAN 547 547 AA on splice site: ag/g -> R. FT GENSCAN 548 591 INTERNAL EXON; p-value: NaN. FT GENSCAN 592 633 INTERNAL EXON; p-value: NaN. FT GENSCAN 634 801 LAST EXON; p-value: NaN. SQ SEQUENCE 801 AA; 88835 MW; 5A95D0A3F43BE4AE CRC64; MAMQAGVQTS KVLILLGAGV SGSIVLRHGR LSDLIAQLQD LLNGAQGVES TPFKYDGALL AAQIRQLANE IKELTMTNPV TIFNGDSNSS GYASYLVPAA AVGAMGYCYM WWKSTRKHLS QKLATLDWKV EEQNETSKMI LSDVTEMRSS ISQIGFDFKQ LNEMISGIDV TLSGLWHLCQ VAGVKDSTST KVFQGLRFLT EGKEDANVIH KPVMAKEIMV MGEKTKVTAA GRSRVHRAFP GGISWWISSL GKMSIVPKET VEVIAQSIGI TNLLPEAALM LAPDVEYRVR EIMQEAIKCM RHSKRTTLTA SDVDGALNLR NVEPIYGFAS GGPFRFRKAI GHRDLFYTDD REVDFKDVIE APLPKAPLDT EIVCHWLAIE GVQPAIPENA PLEVIRAPAE TKIHEQKDGP LIDVRLPVKH VLSRELQLYF QKIAELAMSK SNPPLYKEAL VSLASDSGLH PLVPYFTNFI ADEVSNGLND FRLLFNLMHI VRSLLQNPHI HIEPYLHQLM PSVVTCLVSR KLGNRFADNH WELRDFAANL VSLICKRYGT VYITLQSRLT RTLVNALLDP KKALTQHYGA IQGLAALGHT VVRLLILSNL EPYLSLLEPE LNAEKQKNQM KIYEAWRVYG ALLRAAGLCI HGRLKIFPPL PSPSPSFLHK GKGKGKIIST DPHKRKLSVD SSENQSPQKR LITMDGPDGV HSQDQSGSAP MQVDNPVEND NPPQNSVQPS SSEQASDANE SESRNGKVKE SGRSRAITMK AILDQIWKDD LDSGRLLVKL HELYGDRILP FIPSTEMSVF L // ID NC003070_282 HYPOTHETICAL; PRT; 488 AA. AC NC003070_282; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1411215...1411622, 1411914...1412053, DE 1412159...1412510, 1412655...1412903, 1413016...1413141, DE 1413239...1413430]; Length: 1467. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 136 FIRST EXON; p-value: NaN. FT GENSCAN 137 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 183 AA on splice site: gg/g -> G. FT GENSCAN 184 300 INTERNAL EXON; p-value: NaN. FT GENSCAN 301 383 INTERNAL EXON; p-value: NaN. FT GENSCAN 384 425 INTERNAL EXON; p-value: NaN. FT GENSCAN 426 488 LAST EXON; p-value: NaN. SQ SEQUENCE 488 AA; 53494 MW; 885BAD5DAF0A80C9 CRC64; MDVGRCFLFL LLPSFFFLPS QTQSTDSFTS VLVSQNGLDF VKNLLVNKAI ASIIPLQIPR IEKSMKIPFL GGIDVVVSNL TIYELDVASS YVKLGETGVV IVASGTTCNL SMNWHYSYST WLPPIEISDQ GIASVQVQGM EIGLSLGLKS DEGGLKLSLS ECGCHVEDIT IELEGGASWF YQGMVNAFKD QIGSSVESTI AKKLTEGVSD LDSFLQSLPK EIPVDDNADL NVTFTSDPIL RNSSITFEID GLFTKGETNQ VLKSFFKKSV SLVICPGNSK MLGISVDEAV FNSAAALYYN ADFVQWVVDK IPEQSLLNTA RWRFIIPQLY KKYPNQDMNL NISLSSPPLV KISEQYVGAN VNADLVINVL DANQVIPVAC ISLMIRGSGA LRVMGNNLGG SVSLEDFSMS LKWSNIGNLH LHLLQPIVWT VIQTVFVPYA NDHLEKGFPL PIMHGFTLQN AEIICSESEI TVCSDVAYLD SSQQPQWL // ID NC003070_283 HYPOTHETICAL; PRT; 603 AA. AC NC003070_283; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1418290...1418174, 1418045...1417973, DE 1417389...1417325, 1416977...1416842, 1416778...1416749, DE 1416166...1415985, 1415832...1415674, 1415562...1415449, DE 1415374...1415196, 1415091...1414992, 1414821...1414645, DE 1414561...1414416, 1414301...1414155, 1414054...1413868]; Length: 1812. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 39 FIRST EXON; p-value: NaN. FT GENSCAN 40 63 INTERNAL EXON; p-value: NaN. FT GENSCAN 64 64 AA on splice site: g/gt -> G. FT GENSCAN 65 85 INTERNAL EXON; p-value: NaN. FT GENSCAN 86 130 INTERNAL EXON; p-value: NaN. FT GENSCAN 131 131 AA on splice site: g/gt -> G. FT GENSCAN 132 140 INTERNAL EXON; p-value: NaN. FT GENSCAN 141 141 AA on splice site: g/ag -> E. FT GENSCAN 142 201 INTERNAL EXON; p-value: NaN. FT GENSCAN 202 254 INTERNAL EXON; p-value: NaN. FT GENSCAN 255 292 INTERNAL EXON; p-value: NaN. FT GENSCAN 293 351 INTERNAL EXON; p-value: NaN. FT GENSCAN 352 352 AA on splice site: tg/g -> W. FT GENSCAN 353 385 INTERNAL EXON; p-value: NaN. FT GENSCAN 386 444 INTERNAL EXON; p-value: NaN. FT GENSCAN 445 492 INTERNAL EXON; p-value: NaN. FT GENSCAN 493 493 AA on splice site: gg/c -> G. FT GENSCAN 494 541 INTERNAL EXON; p-value: NaN. FT GENSCAN 542 542 AA on splice site: aa/a -> K. FT GENSCAN 543 603 LAST EXON; p-value: NaN. SQ SEQUENCE 603 AA; 65649 MW; A6DAC1EC7F96E1A1 CRC64; MNRLRGRGTP ILGSALVPQL KKKALNSLVA VQDSYLSTKD LFERHRVVFT VGTSIASVAT AWIGYSLRHY NETRINQRLE SIENAMKNTQ ELERGELKKL VDPVGSRFTT TIATAGTTLI LGFVSGKTVI GMGWVGEAGY ESLFCVLVEK KEREETMERK MYKSTVFPIC CLLFALFDRG NALYGSSSPV LQLTPSNFKS KVLNSNGVVL VEFFAPWCGH CQSLTPTWEK VASTLKGIAT VAAIDADAHK SVSQDYGVRG FPTIKVFVPG KPPIDYQGAR DAKSISQFAI KQIKALLKDR LDGKTSGTKN GGGSSEKKKS EPSASVELNS SNFDELVTES KELWIVEFFA PWCGHCKKLA PEWKKAANNL KGKVKLGHVN CDAEQSIKSR FKVQGFPTIL VFGSDKSSPV PYEGARSASA IESFALEQLE SNAGPAEVTE LTGPDVMEDK CGSAAICFVS FLPDILDSKA EGRNKYLEML LSVADKFKKD PYGFVWVAAG KQPDLEKRVG VGGYGYPAMV ALNAKKGAYA PLKSGFEVKH LKDFVKEAAK GGKGNLPIDG TMEIVKTEAW DGKDGEVVDA DEFSLEDLMG NDDEASTESK DDL // ID NC003070_284 HYPOTHETICAL; PRT; 412 AA. AC NC003070_284; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1421441...1421378, 1421129...1420972, DE 1420880...1420824, 1420563...1420447, 1420408...1420103, DE 1420031...1419786, 1419657...1419367]; Length: 1239. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 21 FIRST EXON; p-value: NaN. FT GENSCAN 22 22 AA on splice site: g/at -> D. FT GENSCAN 23 74 INTERNAL EXON; p-value: NaN. FT GENSCAN 75 93 INTERNAL EXON; p-value: NaN. FT GENSCAN 94 132 INTERNAL EXON; p-value: NaN. FT GENSCAN 133 234 INTERNAL EXON; p-value: NaN. FT GENSCAN 235 316 INTERNAL EXON; p-value: NaN. FT GENSCAN 317 412 LAST EXON; p-value: NaN. SQ SEQUENCE 412 AA; 45655 MW; BCA469F12E0E9955 CRC64; MSDTQHVQSS LVSIRSSDKI EDAFRKMKVN ETGVEELNPY PDRPGERDCQ FYLRTGLCGY GSSCRYNHPT HLPQDVAYYK EELPERIGQP DCEYFLKTGA CKYGPTCKYH HPKDRNGAQP VMFNVIGLPM RLSNVEDLFE FLLQGEKPCP YYLRTGTCRF GVACKFHHPQ PDNGHSTAYG MSSFPAADLR YASGLTMMST YGTLPRPQVP QSYVPILVSP SQGFLPPQGW APYMAASNSM YNVKNQPYYS GSSASMAMAV ALNRGLSESS DQPECRFFMN TGTCKYGDDC KYSHPGVRIS QPPPSLINPF VLPARPGQPA CGNFRSYGFC KFGPNCKFDH PMLPYPGLTM ATSLPTPFAS PVTTHQRISP TPNRSDSKSL SNGKPDVKKE SSETEKPDNG EVQDLSEDAS SP // ID NC003070_285 HYPOTHETICAL; PRT; 515 AA. AC NC003070_285; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1423404...1423464, 1425576...1425861, DE 1425902...1426108, 1426188...1426329, 1427898...1427955, DE 1428070...1428113, 1428207...1428388, 1428562...1428574, DE 1429399...1429857, 1430290...1430385]; Length: 1548. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 20 FIRST EXON; p-value: NaN. FT GENSCAN 21 21 AA on splice site: t/gt -> C. FT GENSCAN 22 115 INTERNAL EXON; p-value: NaN. FT GENSCAN 116 116 AA on splice site: cg/a -> R. FT GENSCAN 117 184 INTERNAL EXON; p-value: NaN. FT GENSCAN 185 185 AA on splice site: ag/t -> S. FT GENSCAN 186 232 INTERNAL EXON; p-value: NaN. FT GENSCAN 233 251 INTERNAL EXON; p-value: NaN. FT GENSCAN 252 252 AA on splice site: g/at -> D. FT GENSCAN 253 266 INTERNAL EXON; p-value: NaN. FT GENSCAN 267 326 INTERNAL EXON; p-value: NaN. FT GENSCAN 327 327 AA on splice site: ag/t -> S. FT GENSCAN 328 331 INTERNAL EXON; p-value: NaN. FT GENSCAN 332 484 INTERNAL EXON; p-value: NaN. FT GENSCAN 485 515 LAST EXON; p-value: NaN. SQ SEQUENCE 515 AA; 58776 MW; 8E587A5639246FCD CRC64; MHEKIKISMI WKNDNNGRNL CRRYVTTISI TRLSSPKDST FSLFPNSEMK LVEKTTTTEQ DNGEDFCRTI IEVSEVNRNV FQAPGGEADP FRVVSGEELH LIPPLNFSMV DNGIFRLSVS AQSCKSSFHI HDILYTIYQT FEIADFRFFR ICEDTCARSP TQRAISSSLN PMESGFSSLV LKATSSLGEF SEWKNHILLQ CVPDLDNEIS LHLWNSKHQK QGPLTNGLSK TLEPFVNIPD HKIRMALKVL LDEKNHPVLI HCKRGKHRTG CLVGCLRKLQ KWCLTSIFDE YQRFAAAKAR VSDQRFMEIF DVSSFSHIPM SFSCSISVSI KKDYAARLYL IETLLGLEEA DKFFKSIPSN MIDYSTLLTS YARSDDVKPD NVTANTVLKA DVKAIKMFMR MWVDEEGIKL ERDRIVEMAK VYVRVCLEIY GNVVWNAREL LQTLWDDFKK CEEVYRTAVI SLSKLDDVEG AEDIYGEGEK KEQNGPLGVS VWTVFFMGQV GRSIMDNGPK NVHAQ // ID NC003070_286 HYPOTHETICAL; PRT; 323 AA. AC NC003070_286; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1432694...1432590, 1432517...1432291, DE 1432139...1431806, 1431723...1431418]; Length: 972. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 35 FIRST EXON; p-value: NaN. FT GENSCAN 36 110 INTERNAL EXON; p-value: NaN. FT GENSCAN 111 111 AA on splice site: ag/a -> R. FT GENSCAN 112 222 INTERNAL EXON; p-value: NaN. FT GENSCAN 223 323 LAST EXON; p-value: NaN. SQ SEQUENCE 323 AA; 36678 MW; FC4B8DA190FA285C CRC64; MESFPIINLE KLNGEERAIT MEKIKDACEN WGFFECVNHG ISLELLDKVE KMTKEHYKKC MEERFKESIK NRGLDSLRSE VNDVDWESTF YLKHLPVSNI SDVPDLDDDY RTLMKDFAGK IEKLSEELLD LLCENLGLEK GYLKKVFYGS KRPTFGTKVS NYPPCPNPDL VKGLRAHTDA GGIILLFQDD KVSGLQLLKD GEWVDVPPVK HSIVVNLGDQ LEVITNGKYK SVEHRVLSQT DGEGRMSIAS FYNPGSDSVI FPAPELIGKE AEKEKKENYP RFVFEDYMKL YSAVKFQAKE PRFEAMKAME TTVANNVGPL ATA // ID NC003070_287 HYPOTHETICAL; PRT; 653 AA. AC NC003070_287; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1437344...1435383]; Length: 1962. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 653 SINGLE EXON; p-value: NaN. SQ SEQUENCE 653 AA; 72173 MW; 6D6B9C9DFE47850A CRC64; MPSKLKKAIG AVKDQTSISL AKVANGATGG GDLTTLEVAI LKATSHDEEV PIDDRLVTEI LGIISSKKSH AASCAAAIGR RIGRTRNWIV ALKSLVLVLR IFQDGDPYFP REVLHAMKRG AKILNLSSFR DDSNSCPWDF TAFVRTFALY LDERLDCFLT GKLQRRYTNR EQTGRISTNS TTRSRFNPKA GIKSHEPAVR DMKPVMLLDK ITYWQKLLDR AIATRPTGDA KANRLVKMSL YAVMQESFDL YRDISDGLAL LLDSFFHLQY QSCINAFQAC VRASKQFEEL NAFYDLSKSI GIGRTSEYPS IQKISLELLE TLQEFLKDQS SFPASSGLYP SPNSFLPPPP SSKDSAVSSS LDFGDSTIDT SERYSDYGSF RSTSLEDLMS RTEAGTSSPP MSCHSEPYGG GRDDPNGNNF DTVSTKSLPN NPSVSASNLI LDLLSLDDVS NTAEAEDVED KKKQDDSKAE TFDPWEALML RDDPKKKIET IEEEPSTAED HQRDSGNWLL ALEETATQVQ GNNSMAIVPF GLDDPMPAFQ AATDQYNPFL EEPVAQLATA GEPMITFGGL ALTGFQPEPT FQVNVPDDFE PSSTPTFKAT ETLPMKCDPF TTFESFGFGE TFSENGGVNQ QSVLQEQQIW LQNQKKIIAK HLS // ID NC003070_288 HYPOTHETICAL; PRT; 470 AA. AC NC003070_288; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1442667...1442610, 1442252...1442083, DE 1441399...1441216, 1441125...1441002, 1440645...1440445, DE 1440332...1440233, 1439657...1439478, 1438898...1438810, DE 1438709...1438595, 1438514...1438323]; Length: 1413. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: NaN. FT GENSCAN 20 20 AA on splice site: a/at -> N. FT GENSCAN 21 76 INTERNAL EXON; p-value: NaN. FT GENSCAN 77 137 INTERNAL EXON; p-value: NaN. FT GENSCAN 138 138 AA on splice site: g/at -> D. FT GENSCAN 139 178 INTERNAL EXON; p-value: NaN. FT GENSCAN 179 179 AA on splice site: gg/a -> G. FT GENSCAN 180 245 INTERNAL EXON; p-value: NaN. FT GENSCAN 246 246 AA on splice site: ag/t -> S. FT GENSCAN 247 279 INTERNAL EXON; p-value: NaN. FT GENSCAN 280 339 INTERNAL EXON; p-value: NaN. FT GENSCAN 340 368 INTERNAL EXON; p-value: NaN. FT GENSCAN 369 369 AA on splice site: at/g -> M. FT GENSCAN 370 407 INTERNAL EXON; p-value: NaN. FT GENSCAN 408 470 LAST EXON; p-value: NaN. SQ SEQUENCE 470 AA; 52357 MW; C94F4A734C2366AD CRC64; MKTCPTRCDH VEMMYSHVRN LKQRRVRKEE LSICNRSHRL HKHNFANMNQ IFTQSNKTLN SIRRIEESSQ NLDRLVSSSP IMWVTNTVLL YRPNSMNRLT FSYPTRLAHS RKASSFSRFF RSSKRKKRVT TLSTKKPDDD HEISPVPPEK FSADLGWLSA FPHVSVASMA NFLFGYHIGV MNGPIVSIAR ELGFEGNSIL EGLVVSIFIA GAFIGSIVAG PLVDKFGYRR TFQIFTIPLI LGALVSAQAH SLDEILCGRF LVGLGIGVNT VLVPIYISEV GRLDDAKVVI RNIWGGSEVE KAVEDFQSVM KNSGSNLNSR WLELLDKPHS RGHVKVAFNA VSMFLIVYAV GFPLDEDLSQ SLSILGTLMY IFSFAIGAGP VTGLIIPELS SNRTRGKIMG FSFSVHWVSN FLVGLFFLDL VEKYGVGTVY ASFGSVSLLA AAFSHLFTVE TKGRSLEEIE LSLNSRDDLS // ID NC003070_289 HYPOTHETICAL; PRT; 98 AA. AC NC003070_289; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1444982...1445223, 1446086...1446140]; DE Length: 297. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 80 FIRST EXON; p-value: NaN. FT GENSCAN 81 81 AA on splice site: ag/a -> R. FT GENSCAN 82 98 LAST EXON; p-value: NaN. SQ SEQUENCE 98 AA; 12085 MW; E91E583361B9DB69 CRC64; MEEFSSSEII YDDVIQRLFC DDVTQRLFWM MSYKGYFGIC FEYREQRTTV KIREITERRR LGEVSDQEKS PRYIKFQYDF RLDLQNDMKT NRILDPVI // ID NC003070_290 HYPOTHETICAL; PRT; 166 AA. AC NC003070_290; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1448447...1448140, 1448055...1447953, DE 1447846...1447757]; Length: 501. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 102 FIRST EXON; p-value: NaN. FT GENSCAN 103 103 AA on splice site: ag/t -> S. FT GENSCAN 104 137 INTERNAL EXON; p-value: NaN. FT GENSCAN 138 166 LAST EXON; p-value: NaN. SQ SEQUENCE 166 AA; 18754 MW; 01B65F521121AC59 CRC64; MKEEDTNVTR FCKATSACKD AAFYYLEGFD WNLEDAISGF LGDQLPPLKI RATPRRVNEW RERSRSPLRR RYSTTSVSEF TLKMRQDEIT ITKSSDTGAL ATSLDDSGRE LHDSDDISHG EKIVSIANPV NQEFKEEGSS VQATKICEST SIEIHLPRPD PYSDSD // ID NC003070_291 HYPOTHETICAL; PRT; 644 AA. AC NC003070_291; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1452857...1452508, 1452418...1452294, DE 1451755...1451664, 1451536...1451450, 1450866...1450576, DE 1450173...1449712, 1449614...1449236, 1449060...1448912]; Length: 1935. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 116 FIRST EXON; p-value: NaN. FT GENSCAN 117 117 AA on splice site: ag/a -> R. FT GENSCAN 118 158 INTERNAL EXON; p-value: NaN. FT GENSCAN 159 159 AA on splice site: a/tt -> I. FT GENSCAN 160 189 INTERNAL EXON; p-value: NaN. FT GENSCAN 190 218 INTERNAL EXON; p-value: NaN. FT GENSCAN 219 315 INTERNAL EXON; p-value: NaN. FT GENSCAN 316 469 INTERNAL EXON; p-value: NaN. FT GENSCAN 470 595 INTERNAL EXON; p-value: NaN. FT GENSCAN 596 596 AA on splice site: g/gg -> G. FT GENSCAN 597 644 LAST EXON; p-value: NaN. SQ SEQUENCE 644 AA; 71439 MW; 0C7D9A02762AFADC CRC64; MSFLGAGRLA GKEAAYFFQE SKHAVNRLAE KSPATGKKLP SSPPDPPEIQ PDVLPEILRH SLPSKIYGRP PDPSSLSQFS KWALESDPNA TVSISPDVLN PLRGYVSLPQ VTFGRRRWDL PESENSVLAS TANELRRDRY GTPVNPEKLK AAGEGLQHIG KAFAAATIII FGSATLVFGT AASKLDMRNA DDIRTKGKDL FQPKLESMKE QVEPLRTWTV RLTMSNQRKR SNDEREEEDD EDAEGIGEWE RAYVDDRSWE ELQEDESGLL RPIDNSAIYH AQYRRRLRML SAAAAGTRIQ KGLIRYLYIV IDFSRAAAEM DFRPSRMAIM AKHVEAFIRE FFDQNPLSQI GLVSIKNGVA HTLTDLGGSP ETHIKALMGK LEALGDSSLQ NALELVHEHL NQVPSYGHRE VLILYSALCT CDPGDIMETI QKCKKSKLRC SVIGLSAEMF ICKHLCQETG GLYSVAVDEV HLKDLLLEHA PPPPAIAEFA IANLIKMGFP QRAAEGSMAI CSCHKEVKIG AGYMCPRCKA RVCDLPTECT ICGLTLVSSP HLARSYHHLF PIAPFDEVPA LSSLNDNRRK LGKSCFGCQQ SLIGAGNKPV PCVTCRKCKH YFCLDCDIYI HESLHNCPGC ESIHRPKSVS LMEE // ID NC003070_292 HYPOTHETICAL; PRT; 315 AA. AC NC003070_292; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1458530...1458347, 1457640...1457617, DE 1457517...1457221, 1455134...1455007, 1454234...1454094, DE 1453450...1453277]; Length: 948. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 61 FIRST EXON; p-value: NaN. FT GENSCAN 62 62 AA on splice site: g/ag -> E. FT GENSCAN 63 69 INTERNAL EXON; p-value: NaN. FT GENSCAN 70 70 AA on splice site: g/at -> D. FT GENSCAN 71 168 INTERNAL EXON; p-value: NaN. FT GENSCAN 169 169 AA on splice site: g/ct -> A. FT GENSCAN 170 211 INTERNAL EXON; p-value: NaN. FT GENSCAN 212 258 INTERNAL EXON; p-value: NaN. FT GENSCAN 259 315 LAST EXON; p-value: NaN. SQ SEQUENCE 315 AA; 35375 MW; 80B2F2C07D4508A7 CRC64; MARHTAALKI GLALLGLSMA GYILGPPLYW HLTEALAAVS ASSCPSCPCE CSTYSAVTIP KELSNASFAD CAKHDPEVNE DTEKNYAELL TEELKLREAE SLEKHKRADM GLLEAKKVTS SYQKEADKCN SGMETCEEAR EKAELALAEQ KKLTSRWEER ARQKGWREAS RIHVERRRFS SKPSGENREF LPSQPTFPVV DAGEILPDKR KDYVELILNN TIINAYVKKG DFLAVEKILK VMKKENVTYN AATYDFDWQG KVKEARKLRT DLEDKRMVLF DEWGSKGLVQ NVVTYTAMIS SGLSKAGKSN ESFGF // ID NC003070_293 HYPOTHETICAL; PRT; 501 AA. AC NC003070_293; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1459090...1459977, 1460062...1460196, DE 1460303...1460545, 1460996...1461124, 1461394...1461504]; Length: 1506. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 296 FIRST EXON; p-value: 0.000. FT GENSCAN 297 341 INTERNAL EXON; p-value: 0.000. FT GENSCAN 342 422 INTERNAL EXON; p-value: 0.000. FT GENSCAN 423 465 INTERNAL EXON; p-value: 0.000. FT GENSCAN 466 501 LAST EXON; p-value: 0.000. SQ SEQUENCE 501 AA; 57408 MW; 4C0BF0C98F0D05D1 CRC64; MAEAKIEEIG CEDRISVLPE DLLVVILDLL PTKDVVATMI LSKRWLSIWT MVRTLEYTDD MDDESKKSVW WFLNKSLQLH KAPVIDSLCM ELGPQCPTTD DVDIGKWVAK AVDCLVMTLT IKLLWSAGPT SLPKSLYSCT SLSELTLSDQ ILVNVPSSAY LPSLTELELI CVVYKDEDSL VSFLSSCPVL EFLFVLRKID DNVKTFTVKV PSLLELTYKN LCSDVVDNTD RCLVVNAPAV NTCQITDYSL ESFSIEDMPC LQDATIDVDE AYHPDDKFLT SFSSVLSLRM HLSDAMVMRC TTINFSRLIK LSIYPYGPDM LETLLRLLGN APKLKEFLVD YKFVYNPEDL PWSWKQPSHV PECLSSQLEI FEWRDYGDRI IEEEFLTYVL ANSKRLKTAT ISLRLNLEDP ELIIEEIKDL PRPRWTADVR NIETCKWNKW SIVVNVILVN IRLKLTVIRG QKTNKIKRLH LLLTVKESAM DVPSNLESRR RLTVLSFQIH Y // ID NC003070_294 HYPOTHETICAL; PRT; 728 AA. AC NC003070_294; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1462855...1462870, 1463151...1463527, DE 1463648...1464089, 1464147...1465323, 1465447...1465621]; Length: 2187. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 5 FIRST EXON; p-value: 0.000. FT GENSCAN 6 6 AA on splice site: g/ag -> E. FT GENSCAN 7 131 INTERNAL EXON; p-value: 0.000. FT GENSCAN 132 278 INTERNAL EXON; p-value: 0.000. FT GENSCAN 279 279 AA on splice site: a/ag -> K. FT GENSCAN 280 670 INTERNAL EXON; p-value: 0.000. FT GENSCAN 671 671 AA on splice site: ag/a -> R. FT GENSCAN 672 728 LAST EXON; p-value: 0.000. SQ SEQUENCE 728 AA; 79788 MW; 51912740E5F40550 CRC64; MIDVQEEEDS ESSSSNPRFL NEMAMEISVD LINQLKVSLR KEAKLTSVDD CSDSSFPSLP TSEEAIAELD ASAPYLRCRN CKGKLLRGIE SLICVFCGNQ QRTSDNPPDP IKFTSTSAYK WFLTSLNLDG SEMVEPLKET DGSSRGATKA PPSKGIALSK FLDLEIQWSA LEEKSDDGQS VQKKNPLNLG GINLDDYFVE RRGDLSKVEQ AESKPVEDDD FKDPRSLSLF DSVKSQGVVG SQQHDNVGLF DKKDAPKSVV SSGEHENLSL FAGRDAQEKD ENLSLFEGKE DAQRTSSSKV DESFGFFEGK DAQRTSSSKD DESFGMFEGK KDAQRNSSSK EDESFGMFEG KEDAQRNSSS KENENFGFFE GAPLSNADLK SFDDKIVAAS SDWDSDFQSA DQNLSQKKID GDPFVSSPVD LAAHMDSVFG SGKDLLYAQP ADSSTAYVSK AGDWLQDDLF GNVTGEAQTN DSAVHDKNEG QIVGGNGNSS MDIDWIGDDL WQTNEKKSIE KTPTDVNDDD DDDWNDFASS ANSKTPNNPL SQTMESSQFE IFYGHAQDKN GVKEQSVDEK QNTDTSVMSD IGKCQEDDLF GTWDSFTSST ILQTSLQPPT IHANPSGEKN PEMNLFGENN NNRDLDFDSI SRSDFFSESS GGKTNSEEVK VIPSGTSTLD RPSDPDGSKD QTVDLVVGTT TTVPKSMSDV AEELMSQMHD LSFMLETKLS VPPISKTE // ID NC003070_295 HYPOTHETICAL; PRT; 339 AA. AC NC003070_295; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1469678...1470697]; Length: 1020. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 339 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 339 AA; 37743 MW; 552650D1E6B9DE17 CRC64; MNWTRGKTLG RGSTATVSAA TCHESGETLA VKSAEFHRSE FLQREAKILS SLNSPYVIGY RGCEITREPF HNNGEATTYS LLMEYAPYGT LTDVATKNGG FIDEARVVKY TRQILLGLEY IHNSKGIAHC DIKGSNVLVG ENGEAKIADF GCAKWVEPEI TEPVRGTPAF MAPEAARGER QGKESDIWAV GCTVIEMVTG SQPWIGADFT DPVSVLYRVG YLGELPELPC SLTEQAKDFL GKCLKKEATE RWTASQLLNH PFLVNKEPEL VTGLVTNSPT SVTDQMFWRS VEEEVSEDRS SWWECHEDER IGVLSWIGHV VVESTWDLDG EDWITVRRN // ID NC003070_296 HYPOTHETICAL; PRT; 829 AA. AC NC003070_296; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1476066...1476021, 1475725...1475630, DE 1475538...1474595, 1474374...1474252, 1474042...1473920, DE 1473816...1473670, 1473478...1473393, 1473306...1473237, DE 1473130...1473008, 1472832...1472602, 1472476...1472372, DE 1472281...1472129, 1472035...1471947, 1471859...1471779, DE 1471695...1471623]; Length: 2490. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 15 FIRST EXON; p-value: 0.000. FT GENSCAN 16 16 AA on splice site: g/ta -> V. FT GENSCAN 17 47 INTERNAL EXON; p-value: 0.000. FT GENSCAN 48 48 AA on splice site: g/gt -> G. FT GENSCAN 49 362 INTERNAL EXON; p-value: 0.000. FT GENSCAN 363 403 INTERNAL EXON; p-value: 0.000. FT GENSCAN 404 444 INTERNAL EXON; p-value: 0.000. FT GENSCAN 445 493 INTERNAL EXON; p-value: 0.000. FT GENSCAN 494 521 INTERNAL EXON; p-value: 0.000. FT GENSCAN 522 522 AA on splice site: ac/g -> T. FT GENSCAN 523 545 INTERNAL EXON; p-value: 0.000. FT GENSCAN 546 586 INTERNAL EXON; p-value: 0.000. FT GENSCAN 587 663 INTERNAL EXON; p-value: 0.000. FT GENSCAN 664 698 INTERNAL EXON; p-value: 0.000. FT GENSCAN 699 749 INTERNAL EXON; p-value: 0.000. FT GENSCAN 750 778 INTERNAL EXON; p-value: 0.000. FT GENSCAN 779 779 AA on splice site: ag/g -> R. FT GENSCAN 780 805 INTERNAL EXON; p-value: 0.000. FT GENSCAN 806 806 AA on splice site: gg/g -> G. FT GENSCAN 807 829 LAST EXON; p-value: 0.000. SQ SEQUENCE 829 AA; 93528 MW; 9B2D79CB116A3763 CRC64; MELRSRNKAI RPSTEVVVDL EEGTGINPDE EPYAISSDDD SIGSEFQGDE EEEEELEEVV ANDDLPNPVP VLAIVNLPRA SKKRKKPDAR KEKVVLLWET WEKEQNSWID EHMSEDVDLD QHNAVIAETA EPPSDLIMPL LRYQKEFLAW ATKQEQSVAG GILADEMGMG KTIQAISLVL ARREVDRAQF GEAAGCTLVL CPLVAVSQWL NEIARFTSPG STKVLVYHGA KRAKNIKEFM NYDFVLTTYS TVESEYRRNI MPSKVQCAYC SKSFYPKKLV IHLRYFCGPS AVKTAKQSKQ KRKKTSDSSS QQGKEADAGE DKKLKKSKKK TKQTVEKDQL GSDDKEKSLL HSVKWNRIIL DEAHYIKERR SNTARAVFAL EATYRWALSG TPLQNRVGEL YSLFCLIVNN NLCGSVFSGG VCSAHQSCPH CPHNAVRHFC WWNKYVAKPI TVYGSFGLGK RAMILLKHKV LKDILLRRTK LGRAADLALP PRIITLRRDT LDVKEFDYYE SLYKNSQAEF NTYIEAGTLM NNYAHIFDLL TRLRQAVDHP YLVVYSNSSG ANANLVDENK SEQECGLCHD PAEDYVVTSC AHVFCKACLI GFSASLGKVT CPTCSKLLTV DWTTKADTEH KASKTTLKGF RASSILNRIK LDDFQTSTKI EALREEIRFM VERDGSAKAI VFSQFTSFLD LINYTLGKCG VSCVQLVGSM TMAARDTAIN KFKEDPDCRV FLMSLKAGGV ALNLTVASHV FMMDPWWNPA VERQAQDRIH RIGQYKPIRV VRFIIENTVE ERILRLQKKK ELVFEGTVGG SQEAIGKLTE EDMRFLFTT // ID NC003070_297 HYPOTHETICAL; PRT; 398 AA. AC NC003070_297; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1481040...1481027, 1480781...1480754, DE 1478820...1477666]; Length: 1197. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 4 FIRST EXON; p-value: 0.000. FT GENSCAN 5 5 AA on splice site: ac/g -> T. FT GENSCAN 6 14 INTERNAL EXON; p-value: 0.000. FT GENSCAN 15 398 LAST EXON; p-value: 0.000. SQ SEQUENCE 398 AA; 30323 MW; 18FA4745DB1E7103 CRC64; MNFQTRLIIT SSSIMELKGV TCLLLSLVLL NSCVECVLGD GSVVGPARFR DDDCRWGRRC AGRGRFGRGG GGGFGGGRGS GGGIGGGGGQ GGGFGAGGGV GGGAGGGLGG GGGAGGGGGG GIGGGSGHGG GFGAGGGVGG GAGGGIGGGG GAGGGGGGGV GGGSGHGGGF GAGGGVGGGA GGIGGGGGAG GGGGGGVGGG SGHGSGFGAG GGIGGGAGGG VGGGGGGGGG GGGGGGANGG SGHGSGFGAG GGVGGGVGGG AGGGGGGGGG GGGGANGGSG HGSGFGAGGG VGGGVGGGAG GGGGGGGGGG GGVGGGSGHG GGFGAGGGLG GGAGGGLGGG GGAGGGGGGG LGHGGGVGGG HGGGVGIGIG IGIGVGVGGG SGQGSGSGSG SGGGGGRH // ID NC003070_298 HYPOTHETICAL; PRT; 441 AA. AC NC003070_298; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1482680...1484005]; Length: 1326. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 441 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 441 AA; 47776 MW; 0919B8C6D7C6ADD6 CRC64; MLLNISSSPI SHRIPHFLSD FNNPTSNFPP KSKTHLPKSN LSTLSNHSLY GTKNRAFYKN KRNPYNRTQA LGRFDFGSLE SVLEASAVLT AIIVVHETGH FLAASLQGIR VSKFAIGFGP ILAKFNSNNV EYSLRAFPLG GFVGFPDNDP DSDIPVDDRN LLKNRPILDR VIVVSAGIVA NVIFAYAIIF TQVVSVGLPV QESFPGVLVP DVKSFSAASR DGLLPGDVIL AVDGTELSNS GSDSVSKVVD VVKRNPEHNV LLRIERGKES FEIRITPDKS FDGTGKIGVQ LSPNVRFGKV RPKNIPETFS FAGREFFGLS YNVLDSLKQT FLNFSQTASK VAGPVAIIAV GAEVARSNAD GLYQFAALLN LNLAVINLLP LPALDGGTLA LILLEAVRGG RKLPLEVEQG IMSSGIMLVL FLGLFLIVKD TLNLDFIKEM L // ID NC003070_299 HYPOTHETICAL; PRT; 808 AA. AC NC003070_299; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1486705...1484279]; Length: 2427. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 808 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 808 AA; 90171 MW; F4B8DA7AB620B86A CRC64; MATRGSRSEK VKRIFQQFDG NHDGGLNREE MAALVVAVNP RVKFSDEQIN AILDEVFRTY AEFIDPNKGL TYDGLLRTYD DGAGDVDRDF DALGLELNAD ETTIKGSEAA SSSSITDERA VEAQKKQRTA AWAVSPNHGI VFDETWKLVD DLEILVKRLK SKQEKDGKLK ADNNNNNVDA FSDAGWSREL GPSSEISEKR IYWEESSHDY GVFVKELGVL RSKADGARSR EEAFDGHMAI GRVLYEHQLF KEALVSFKRA CELQPTDVRP HFKAGNCLYV LGKCKESKDE FLLALEAAES GGNQWAYLLP QIYVNLGIAL EGEGMVLSAC EYYREAAILC PTHFRALKLL GSALFGVGEY RAAVKALEEA IYLKPDYADA HCDLASSLHS MGEDERAIEV FQRAIDLKPG HVDALYNLGG LYMDLGRFQR ASEMYTRVLT VWPNHWRAQL NKAVSLLGAG ETEEAKRALK EALKLTNRVE LHDAISHLKH LQKKKGKNNG NGNGGEGPFI VVEPSKFKTV GEKTTLRPDL ATALQIRAFQ RVTRLGKCDV EAVRKEMRDN DVPVSYSGSG GPTKSIRKPN LEEILRRLLS SLKPDTFQGA IKAINEKILA LLDDSGSGRV DMGMFYAVIA PLCGGHSDKR KRVAFDALLW RPVNEGSSQI TKTDAVKYIK LLRAIYIPSH GMSEMLEVHG EEEAESSVTV TFNQFLAMFD DPDWGFGIMS TILKLEANDR NRHGNQVCSV CRYPVIGSRF KEVKARFSLC NQCYGEGKVP PSFKQEEYKF REYESEAEAM KAKCVCFSMQ SHKKAIAT // ID NC003070_300 HYPOTHETICAL; PRT; 808 AA. AC NC003070_300; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1496618...1496524, 1493590...1493494, DE 1493400...1493327, 1493239...1493028, 1492951...1492888, DE 1492804...1492732, 1492647...1492527, 1492282...1492135, DE 1492048...1491939, 1491607...1491491, 1490863...1490771, DE 1490042...1489944, 1489685...1489589, 1489385...1488908, DE 1488823...1488560, 1488478...1488389, 1488297...1488219, DE 1488124...1488018, 1487678...1487670]; Length: 2427. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 31 FIRST EXON; p-value: 0.000. FT GENSCAN 32 32 AA on splice site: ac/g -> T. FT GENSCAN 33 64 INTERNAL EXON; p-value: 0.000. FT GENSCAN 65 88 INTERNAL EXON; p-value: 0.000. FT GENSCAN 89 89 AA on splice site: ca/g -> Q. FT GENSCAN 90 159 INTERNAL EXON; p-value: 0.000. FT GENSCAN 160 160 AA on splice site: g/gt -> G. FT GENSCAN 161 180 INTERNAL EXON; p-value: 0.000. FT GENSCAN 181 181 AA on splice site: ag/t -> S. FT GENSCAN 182 205 INTERNAL EXON; p-value: 0.000. FT GENSCAN 206 245 INTERNAL EXON; p-value: 0.000. FT GENSCAN 246 246 AA on splice site: g/tc -> V. FT GENSCAN 247 294 INTERNAL EXON; p-value: 0.000. FT GENSCAN 295 295 AA on splice site: ca/g -> Q. FT GENSCAN 296 331 INTERNAL EXON; p-value: 0.000. FT GENSCAN 332 332 AA on splice site: g/at -> D. FT GENSCAN 333 370 INTERNAL EXON; p-value: 0.000. FT GENSCAN 371 371 AA on splice site: g/tc -> V. FT GENSCAN 372 401 INTERNAL EXON; p-value: 0.000. FT GENSCAN 402 402 AA on splice site: g/aa -> E. FT GENSCAN 403 434 INTERNAL EXON; p-value: 0.000. FT GENSCAN 435 435 AA on splice site: g/gt -> G. FT GENSCAN 436 466 INTERNAL EXON; p-value: 0.000. FT GENSCAN 467 467 AA on splice site: ag/g -> R. FT GENSCAN 468 626 INTERNAL EXON; p-value: 0.000. FT GENSCAN 627 714 INTERNAL EXON; p-value: 0.000. FT GENSCAN 715 744 INTERNAL EXON; p-value: 0.000. FT GENSCAN 745 770 INTERNAL EXON; p-value: 0.000. FT GENSCAN 771 771 AA on splice site: g/gc -> G. FT GENSCAN 772 806 INTERNAL EXON; p-value: 0.000. FT GENSCAN 807 808 LAST EXON; p-value: 0.000. SQ SEQUENCE 808 AA; 92737 MW; 16D1B6F3371E1483 CRC64; MCLLHFDGYG KFKSCGFLFG TPKIIISIGW LTMWNIPESK GMSHPSVTEA ERLKLVSEGC NPKALYQKEV KRDPQALFGE VANTHIALQT LDKTISSLEM ELAAARSVQE SLQNGAPLSD DMGKKQPQEQ RRFLMVVGIN TAFSSRKRRD SIRATWMPQG EKRKRLEEEK GIIIRFVIGH SATTGGILDR AIEAEDRKHG DFLRLDHVEG YLELSGKTKT YFSTAFSMWD ADFYVKVDDD VHVNIVIFVL WSRGVRYHEP EYWKFGENGN KYFRHATGQL YAISRDLASY ISINQHVLHK YANEDVSLGA WFIGIDVKHI DDRRLCCGTP PDCEWKAQAG NICVASFDWS CSGICRSADR IKEVHRRCGE VETLVRELGD AEDGNSDEYW NQFIEPFAES EETKIRYNIT TRNCMDKIFS RLKDILPVLL VDFPGDLGWP FIGNMLSFLR AFKTSDPDSF TRTLIKRYGP KGIYKAHMFG NPSIIVTTSD TCRRVLTDDD AFKPGWPTST MELIGRKSFV GISFEEHKRL RRLTAAPVNG HEALSTYIPY IEENVITVLD KWTKMGEFEF LTHLRKLTFR IIMYIFLSSE SENVMDALER EYTALNYGVR AMAVNIPGFA YHRALKARKT LVAAFQSIVT ERRNQRKQNI LSNKKDMLDN LLNVKDEDGK TLDDEEIIDV LLMYLNAGHE SSGHTIMWAT VFLQEHPEVL QRAKAEQEMI LKSRPEGQKG LSLKETRKME FLSQVVDETL RVITFSLTAF REAKTDVEMN GYLIPKGWKV LTWFRDVHID PEVFPDPRKF DPARWDTN // ID NC003070_301 HYPOTHETICAL; PRT; 540 AA. AC NC003070_301; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1501774...1501689, 1501596...1501424, DE 1501329...1501166, 1500913...1500803, 1500678...1500599, DE 1500484...1500331, 1500248...1500155, 1499978...1499890, DE 1499778...1499705, 1499617...1499455, 1499313...1499228, DE 1498918...1498835, 1498745...1498565, 1498439...1498356]; Length: 1623. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 28 FIRST EXON; p-value: 0.000. FT GENSCAN 29 29 AA on splice site: ag/g -> R. FT GENSCAN 30 86 INTERNAL EXON; p-value: 0.000. FT GENSCAN 87 87 AA on splice site: g/tc -> V. FT GENSCAN 88 141 INTERNAL EXON; p-value: 0.000. FT GENSCAN 142 178 INTERNAL EXON; p-value: 0.000. FT GENSCAN 179 204 INTERNAL EXON; p-value: 0.000. FT GENSCAN 205 205 AA on splice site: ag/t -> S. FT GENSCAN 206 256 INTERNAL EXON; p-value: 0.000. FT GENSCAN 257 287 INTERNAL EXON; p-value: 0.000. FT GENSCAN 288 288 AA on splice site: a/gc -> S. FT GENSCAN 289 317 INTERNAL EXON; p-value: 0.000. FT GENSCAN 318 341 INTERNAL EXON; p-value: 0.000. FT GENSCAN 342 342 AA on splice site: ga/a -> E. FT GENSCAN 343 396 INTERNAL EXON; p-value: 0.000. FT GENSCAN 397 424 INTERNAL EXON; p-value: 0.000. FT GENSCAN 425 425 AA on splice site: ag/t -> S. FT GENSCAN 426 452 INTERNAL EXON; p-value: 0.000. FT GENSCAN 453 453 AA on splice site: gg/a -> G. FT GENSCAN 454 513 INTERNAL EXON; p-value: 0.000. FT GENSCAN 514 540 LAST EXON; p-value: 0.000. SQ SEQUENCE 540 AA; 60036 MW; 9CD0091114654B06 CRC64; MQAVKRSRRH VEEEPTMVEP KTKYDRQLRI WGEVGQAALE EASICLLNCG PTGSEALKNL VLGGVGSITV VDGSKVQFGD LGNNFMVDAK SVGQSKAKSV CAFLQELNDS VNAKFIEENP DTLITTNPSF FSQFTLVIAT QLVEDSMLKL DRICRDANVK LVLVRSYGLA GFVRISVKEH PIIDSKPDHF LDDLRLNNPW PELKSFVETI DLNVSEPAAA HKHIPYVVIL VKMAEEWAQS HSGNLPSTRE EKKEFKDLVK SKMVSTDEDN YKEAIEAAFK VFAPRGISSE VQKLINDSCA EVNSNSSAFW VMVAALKEFV LNEGGGEAPL EGSIPDMTSS TEHYINLQKI YLAKAEADFL VIEERVKNIL KKIGRDPSSI PKPTIKSFCK NARKLKLCRY RMVEDEFRNP SVTEIQKYLA DEDYSGAMGF YILLRAADRF AANYNKFPGQ FDGGMDEDIS RLKTTALSLL TDLGCNGSVL PDDLIHEMCR FGASEIHVVS AFVGGIASQE VIKLVTKQFV PMLGTYIFNG IDHKSQLLKL // ID NC003070_302 HYPOTHETICAL; PRT; 240 AA. AC NC003070_302; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1504853...1504768, 1503260...1502961, DE 1502850...1502514]; Length: 723. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 28 FIRST EXON; p-value: 0.000. FT GENSCAN 29 29 AA on splice site: ag/g -> R. FT GENSCAN 30 128 INTERNAL EXON; p-value: 0.000. FT GENSCAN 129 129 AA on splice site: ag/g -> R. FT GENSCAN 130 240 LAST EXON; p-value: 0.000. SQ SEQUENCE 240 AA; 26740 MW; E5F8E13B282ADFC6 CRC64; MIDALLVSSY YSINQIFTLA LIGNYHLLRS AFLGDRNVFK VSSTPFAQVG YSSKTIECKE SRIGKQPIAV PSNVTIALEG QDLKVKGPLG ELALTYPREV ELTKEESGFL RVKKTVETRR ANQMHGLFRT LTDNMVVGVS KGFEKKLILV GVGYRATVDG KELVLNLGFS HPVKMQIPDS LKVKVEENTR ITVSGYDKSE IGQFAATVRK WRPPEPYKGK GVKYSDEIVR RKEGKAGKKK // ID NC003070_303 HYPOTHETICAL; PRT; 959 AA. AC NC003070_303; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1505641...1505992, 1506116...1507449, DE 1507528...1507805, 1507885...1507916, 1508023...1508432, DE 1508528...1509001]; Length: 2880. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 117 FIRST EXON; p-value: 0.000. FT GENSCAN 118 118 AA on splice site: g/ct -> A. FT GENSCAN 119 562 INTERNAL EXON; p-value: 0.000. FT GENSCAN 563 654 INTERNAL EXON; p-value: 0.000. FT GENSCAN 655 655 AA on splice site: tg/g -> W. FT GENSCAN 656 665 INTERNAL EXON; p-value: 0.000. FT GENSCAN 666 666 AA on splice site: a/gg -> R. FT GENSCAN 667 802 INTERNAL EXON; p-value: 0.000. FT GENSCAN 803 959 LAST EXON; p-value: 0.000. SQ SEQUENCE 959 AA; 107208 MW; 0709B834A76FAFC0 CRC64; MGFLVMIREV SMAKAIRVVL LCVSVLWVVP KECACRSNFS RNSSSSSSSS LRPLRQRPSS VNVGALFTYD SFIGRAAKPA VKAAMDDVNA DQSVLKGIKL NIIFQDSNCS GFIGTMGALQ LMENKVVAAI GPQSSGIAHM ISYVANELHV PLLSFGATDP TLSSLQFPYF LRTTQNDYFQ MHAIADFLSY SGWRQVIAIF VDDECGRNGI SVLGDVLAKK RSRISYKAAI TPGADSSSIR DLLVSVNLME SRVFVVHVNP DSGLNVFSVA KSLGMMASGY VWIATDWLPT AMDSMEHVDS DTMDLLQGVV AFRHYTIESS VKRQFMARWK NLRPNDGFNS YAMYAYDSVW LVARALDVFF RENNNITFSN DPNLHKTNGS TIQLSALSVF NEGEKFMKII LGMNHTGVTG PIQFDSDRNR VNPAYEVLNL EGTAPRTVGY WSNHSGLSVV HPETLYSRPP NTSTANQRLK GIIYPGEVTK PPRGWVFPNN GKPLRIGVPN RVSYTDYVSK DKNPPGVRGY CIDVFEAAIE LLPYPVPRTY ILYGDGKRNP SYDNLVNEVV ADNFDVAVGD ITIVTNRTRY VDFTQPFIES GLVVVAPVKE AKSSPWSFLK PFTIEMWAVT GGFFLFVGAM VWILEHRFNQ EFRGPPRRQL ITIFWFSFST MFFSHRENTV SSLGRFVLII WLFVVLIINS SYTASLTSIL TIRQLTSRIE GIDSLVTSNE PIGVQDGTFA RNYLINELNI LPSRIVPLKD EEQYLSALQR GPNAGGVAAI VDELPYIEVL LTNSNCKFRT VGQEFTRTGW GFAFQRDSPL AVDMSTAILQ LSEEGELEKI HRKWLNYKHE CSMQISNSED SQLSLKSFWG LFLICGITCF MALTVFFWRV FWQYQRLLPE SADEERAGEV SEPSRSGRGS RAPSFKELIK VVDKREAEIK EILKQKSSKK LKSTQSAAGT SQSQHGEIT // ID NC003070_304 HYPOTHETICAL; PRT; 209 AA. AC NC003070_304; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1509612...1509662, 1510396...1510791, DE 1511302...1511484]; Length: 630. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 17 FIRST EXON; p-value: 0.000. FT GENSCAN 18 149 INTERNAL EXON; p-value: 0.000. FT GENSCAN 150 209 LAST EXON; p-value: 0.000. SQ SEQUENCE 209 AA; 23172 MW; A110EC9F9FA9CCD1 CRC64; MEQAAKILQL SIKQHLMVFP VNDSQNSDLK QNINRRRKKE KMGALGKLID VALFVYFVSM AIIAPLIDGQ TSLPSGIYPA FLTDLKSKYI ADFGDYLLME KPHFLVGLVW HELLFLWPLS IANVYAILAG KSWFGTTCLL YGASLVTSMA AILGDMIGSG KASDRLLMMY VPFMGFGILA VLRGLVYRST KNTGSSGKRS TIMPRRKLA // ID NC003070_305 HYPOTHETICAL; PRT; 160 AA. AC NC003070_305; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1511942...1512262, 1512494...1512655]; DE Length: 483. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 107 FIRST EXON; p-value: 0.000. FT GENSCAN 108 160 LAST EXON; p-value: 0.000. SQ SEQUENCE 160 AA; 17837 MW; 56934F542742A18F CRC64; MGALGKLINI SLFFFFALMA INVPLLNGQI LFPGIYPKLL TDLKDWYSSE FNDYLFIEKP LFFVGLVWHE IIFLLPLSIV NIYAILTSKS WFGTTSLLYG ASFLTSMAAI LGDMIGSEKV TNKLLLAYLP FVGLAILAML RGLVTCSTKR STVLARRKLA // ID NC003070_306 HYPOTHETICAL; PRT; 637 AA. AC NC003070_306; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1516993...1516782, 1516703...1516586, DE 1516008...1515820, 1515450...1515037, 1514789...1514589, DE 1514523...1514349, 1514144...1513993, 1513919...1513824, DE 1513743...1513387]; Length: 1914. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 70 FIRST EXON; p-value: 0.000. FT GENSCAN 71 71 AA on splice site: gc/g -> A. FT GENSCAN 72 110 INTERNAL EXON; p-value: 0.000. FT GENSCAN 111 173 INTERNAL EXON; p-value: 0.000. FT GENSCAN 174 311 INTERNAL EXON; p-value: 0.000. FT GENSCAN 312 378 INTERNAL EXON; p-value: 0.000. FT GENSCAN 379 436 INTERNAL EXON; p-value: 0.000. FT GENSCAN 437 437 AA on splice site: g/ga -> G. FT GENSCAN 438 487 INTERNAL EXON; p-value: 0.000. FT GENSCAN 488 519 INTERNAL EXON; p-value: 0.000. FT GENSCAN 520 637 LAST EXON; p-value: 0.000. SQ SEQUENCE 637 AA; 70501 MW; 8C5DC8357855DD76 CRC64; MNNADSNNHN YNHEDNNNEG FLRDDEFDSP NTKSGSENQE GGSGNDQDPL HPNKKKRYHR HTQLQIQEME AFFKECPHPD DKQRKQLSRE LNLEPLQVKF WFQNKRTQMK NHHERHENSH LRAENEKLRN DNLRYREALA NASCPNCGGP TAIGEMSFDE HQLRLENARL REEIDRISAI AAKYVGKPVS NYPLMSPPPL PPRPLELAMG NIGGEAYGNN PNDLLKSITA PTESDKPVII DLSVAAMEEL MRMVQVDEPL WKSLVLDEEE YARTFPRGIG PRPAGYRSEA SRESAVVIMN HVNIVEILMD VMSAEFQVPS PLVPTRETYF ARYCKQQGDG SWAVVDISLD SLQPNPPARC RRRASGCLIQ ELPNGYSKVT WVEHVEVDDR GVHNLYKHMV STGHAFGAKR WVAILDRQCE RLASVMATNI SSGEVGGAED VRVMTRKSVD DPGRPPGIVL SAATSFWIPV PPKRVFDFLR DENSRNEWDI LSNGGVVQEM AHIANGRDTG NCVSLLRVNS ANSSQSNMLI LQESCTDPTA SFVIYAPVDI VAMNIVLNGG DPDYVALLPS GFAILPDGNA NSGAPGGDGG SLLTVAFQIL VDSVPTAKLS LGSVATVNNL IACTVERIKA SMSCETA // ID NC003070_307 HYPOTHETICAL; PRT; 396 AA. AC NC003070_307; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1521201...1521413, 1521503...1521688, DE 1521782...1521947, 1522097...1522411, 1523645...1523775, DE 1523842...1524021]; Length: 1191. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 71 FIRST EXON; p-value: 0.000. FT GENSCAN 72 133 INTERNAL EXON; p-value: 0.000. FT GENSCAN 134 188 INTERNAL EXON; p-value: 0.000. FT GENSCAN 189 189 AA on splice site: g/ga -> G. FT GENSCAN 190 293 INTERNAL EXON; p-value: 0.000. FT GENSCAN 294 294 AA on splice site: g/aa -> E. FT GENSCAN 295 337 INTERNAL EXON; p-value: 0.000. FT GENSCAN 338 396 LAST EXON; p-value: 0.000. SQ SEQUENCE 396 AA; 44340 MW; EA77ADFF9E5116E0 CRC64; MAIKNILALV VLLSVVGVSV AIPQLLDLDY YRSKCPKAEE IVRGVTVQYV SRQKTLAAKL LRMHFHDCFV RGCDGSVLLK SAKNDAERDA VPNLTLKGYE VVDAAKTALE RKCPNLISCA DVLALVARDA VAVIGGPWWP VPLGRRDGRI SKLNDALLNL PSPFADIKTL KKNFANKGLN AKDLVVLSGK GDSDPSMNPS YVRELKRKCP PTDFRTSLNM DPGSALTFDT HYFKVVAQKK GLFTSDSTLL DDIETKNYVQ TQAILPPVFS SFNKDFSDSM VKLGFVQILT GKNEFFLIFR SKSPFETLRR NSENFFRSSS CSSSHTIRSD GEGEEANILK SNPVQLKSIS IGEGYSINDD ELELTAYFNV DRRREKNEND DRFWFSFLEE EKIRKK // ID NC003070_308 HYPOTHETICAL; PRT; 393 AA. AC NC003070_308; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1527168...1526956, 1526866...1526681, DE 1526587...1526422, 1526272...1525958, 1525325...1525215, DE 1525120...1524930]; Length: 1182. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 71 FIRST EXON; p-value: 0.000. FT GENSCAN 72 133 INTERNAL EXON; p-value: 0.000. FT GENSCAN 134 188 INTERNAL EXON; p-value: 0.000. FT GENSCAN 189 189 AA on splice site: g/ga -> G. FT GENSCAN 190 293 INTERNAL EXON; p-value: 0.000. FT GENSCAN 294 294 AA on splice site: g/tt -> V. FT GENSCAN 295 330 INTERNAL EXON; p-value: 0.000. FT GENSCAN 331 331 AA on splice site: g/ct -> A. FT GENSCAN 332 393 LAST EXON; p-value: 0.000. SQ SEQUENCE 393 AA; 44011 MW; 7B18C5B92054DB8A CRC64; MAIKNILALV VLLSVVGVSV AIPQLLDLDY YRSKCPKAEE IVRGVTVQYV SRQKTLAAKL LRMHFHDCFV RGCDGSVLLK SAKNDAERDA VPNLTLKGYE VVDAAKTALE RKCPNLISCA DVLALVARDA VAVIGGPWWP VPLGRRDGRI SKLNDALLNL PSPFADIKTL KKNFANKGLN AKDLVVLSGK GDSDPSMNPS YVRELKRKCP PTDFRTSLNM DPGSALTFDT HYFKVVAQKK GLFTSDSTLL DDIETKNYVQ TQAILPPVFS SFNKDFSDSM VKLGFVQILT GKNVKTVKEE KITVESDSDL LWTISIKKTE LSYKTLHLRP ANNTCTITVV YLYYEIKISL IVCLCGASLI CRIQNCDWML VRYWNLLVRY MRKDKIQKVN DYD // ID NC003070_309 HYPOTHETICAL; PRT; 256 AA. AC NC003070_309; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1528894...1528933, 1530184...1530335, DE 1530429...1530591, 1530855...1531270]; Length: 771. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 13 FIRST EXON; p-value: 0.000. FT GENSCAN 14 14 AA on splice site: a/ga -> R. FT GENSCAN 15 64 INTERNAL EXON; p-value: 0.000. FT GENSCAN 65 118 INTERNAL EXON; p-value: 0.000. FT GENSCAN 119 119 AA on splice site: g/gg -> G. FT GENSCAN 120 256 LAST EXON; p-value: 0.000. SQ SEQUENCE 256 AA; 27447 MW; CDC71A8FBFF3F996 CRC64; MSAKTSCSSV KNQRNAERDA TPNLTVRGFG FIDAIKSVLE AQCPGIVSCA DIIALASRDA VVFTGGPNWS VPTGRRDGRI SNAAEALANI PPPTSNITNL QTLFANQGLD LKDLVLLSGA HTIGVSHCSS FTNRLYNFTG RGGQDPALDS EYAANLKSRK CPSLNDNKTI VEMDPGSRKT FDLSYYQLVL KRRGLFQSDS ALTTNPTTLS NINRILTGSV GSFFSEFAKS MEKMGRINVK TGSAGVVRRQ CSVANS // ID NC003070_310 HYPOTHETICAL; PRT; 333 AA. AC NC003070_310; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1534304...1533942, 1532833...1532729, DE 1532596...1532483, 1532363...1532263, 1532207...1532018, DE 1531933...1531805]; Length: 1002. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 121 FIRST EXON; p-value: 0.000. FT GENSCAN 122 156 INTERNAL EXON; p-value: 0.000. FT GENSCAN 157 194 INTERNAL EXON; p-value: 0.000. FT GENSCAN 195 227 INTERNAL EXON; p-value: 0.000. FT GENSCAN 228 228 AA on splice site: ca/a -> Q. FT GENSCAN 229 291 INTERNAL EXON; p-value: 0.000. FT GENSCAN 292 333 LAST EXON; p-value: 0.000. SQ SEQUENCE 333 AA; 36628 MW; 7336F27160C9B9BC CRC64; MTIEPTQSPS SEPEVHSGED FVHIDDPRPT GDISLSDSIV NVEKDELLDE AAEEEFRGSD SVFSGGDGGG ADDDGGECSS EATKVELPEE LAKSVVILTC ESTADGGSCD VYLVGTAHVS KIASHLEVFP GAEFRVAYEE AIKYGGKVIL GDRPVQITLK RTWAKMPLWH KVKFLYSILF QAVFLPGAEE LEKMLKDMDN VDMVTLVIQE MSKEFPTLMD TIVHERDQNA GSSKYREELT NYHRLHILIN EMRRYMASSL LRVASDHSSV VAVIGKGHIN GIKKNWKQPI TMNDLMEIPS DKSVFTLKRI ISSVAVAVAG TAIVSGILLS RRK // ID NC003070_311 HYPOTHETICAL; PRT; 171 AA. AC NC003070_311; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1537116...1537010, 1536516...1536450, DE 1535886...1535584, 1535391...1535353]; Length: 516. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 35 FIRST EXON; p-value: 0.000. FT GENSCAN 36 36 AA on splice site: ag/g -> R. FT GENSCAN 37 58 INTERNAL EXON; p-value: 0.000. FT GENSCAN 59 159 INTERNAL EXON; p-value: 0.000. FT GENSCAN 160 171 LAST EXON; p-value: 0.000. SQ SEQUENCE 171 AA; 20140 MW; 2294D1179A2CA985 CRC64; MRGCVFVERP LPSSQNHTDS YLLPPVCVSQ DTSRFRINLG DLSPELYVRT RHGVRGWWID GRHLFLRDVL RAQETFRPWQ KSGGLASVYT FNTREIHRDP CQRPVTFYMQ HVSSSSHDGT IKSVYKQAYE NCTYDPVTSP RKIHEIRVFS RRLDPNIRQI LPFDFRIDQQ N // ID NC003070_312 HYPOTHETICAL; PRT; 416 AA. AC NC003070_312; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1542322...1542191, 1542113...1542024, DE 1541459...1541317, 1540321...1540081, 1539969...1539781, DE 1539693...1539383, 1539295...1539151]; Length: 1251. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 44 FIRST EXON; p-value: 0.000. FT GENSCAN 45 74 INTERNAL EXON; p-value: 0.000. FT GENSCAN 75 121 INTERNAL EXON; p-value: 0.000. FT GENSCAN 122 122 AA on splice site: cg/a -> R. FT GENSCAN 123 202 INTERNAL EXON; p-value: 0.000. FT GENSCAN 203 265 INTERNAL EXON; p-value: 0.000. FT GENSCAN 266 368 INTERNAL EXON; p-value: 0.000. FT GENSCAN 369 369 AA on splice site: aa/g -> K. FT GENSCAN 370 416 LAST EXON; p-value: 0.000. SQ SEQUENCE 416 AA; 46445 MW; E08694A91A7978FD CRC64; MEEGKGTEKK GCIITVIIVC IVLTVGLDIV AGFVGLQAQA AQQYVKHDKL ECKAPSKTAF VLGIIAVSCL ATAHQGLGIF TEPNQTKLNH LNSNRSQLNQ TKLNQTLHPT HKLFYNPNGS GRNDASDYSA YNFPLDLDSS SSSCSSSFID RNWEFSLGSL PLNNEDSSSS FEILQIFEDH TKNYSDQQVL TLQPDYLDII PKDSSCSGFK SYETKEDIEN VLMNTFDGAE AAVLDLKQYN PEEDLILTEL IDELARDNSE TTPTNTLPTS ASGMSSDIHK DYNTEALANA SSQDYFINQM IGSKAKEETN DLAILPNALV QLDCGQSQLI LTDEMLPWEN QTFVPVKGFS PKDREEAKKR YFEKKKKRKF GKQIRYESRK STADTKKRMK GRFTKAGADY DYDPRANDIN KGKIQT // ID NC003070_313 HYPOTHETICAL; PRT; 58 AA. AC NC003070_313; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1543884...1544036, 1544271...1544294]; DE Length: 177. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 51 FIRST EXON; p-value: 0.000. FT GENSCAN 52 58 LAST EXON; p-value: 0.000. SQ SEQUENCE 58 AA; 6572 MW; B1AC0B4998B85BD7 CRC64; MSFSGEYGGG SVFGNGRKRR PPPLGSVDPS PLRKKRRAAV NTWRDRRLHA RIKRFSFG // ID NC003070_314 HYPOTHETICAL; PRT; 360 AA. AC NC003070_314; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1547708...1547082, 1546336...1546187, DE 1545562...1545257]; Length: 1083. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 209 FIRST EXON; p-value: 0.000. FT GENSCAN 210 259 INTERNAL EXON; p-value: 0.000. FT GENSCAN 260 360 LAST EXON; p-value: 0.000. SQ SEQUENCE 360 AA; 38168 MW; 6EBF87C681ED902D CRC64; MRITQNVKLL LFFFFFISFL FIAVSAGESK CECSHEDDEA NKAGAKKYKI AAIPSVLAAG VIGVMFPLLG KFFPSLKPET TFFFVTKAFA AGVILATGFM HVLPEGYEKL TSPCLKGEAW EFPFTGFIAM VAAILTLSVD SFATSYFHKA HFKTSKRIGD GEEQDAGGGG GGGDELGLHV HAHGHTHGIV GVESGESQVQ LHRTRVVAQV LEVGIIVHSV VIGISLGASQ SPDTAKALFA ALMFHQCFEG LGLGGCIAQG NFNCMSITIM SIFFSVTTPV GIAVGMAISS SYDDSSPTAL IVQGVLNAAS AGILIYMSLV DFLAADFMHP KMQSNTRLQI MAHISLLVGA GVMSLLAKWA // ID NC003070_315 HYPOTHETICAL; PRT; 431 AA. AC NC003070_315; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1552433...1552048, 1551969...1551798, DE 1551417...1551211, 1551119...1550860, 1550770...1550680, DE 1550314...1550135]; Length: 1296. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 128 FIRST EXON; p-value: 0.000. FT GENSCAN 129 129 AA on splice site: ta/t -> Y. FT GENSCAN 130 186 INTERNAL EXON; p-value: 0.000. FT GENSCAN 187 255 INTERNAL EXON; p-value: 0.000. FT GENSCAN 256 341 INTERNAL EXON; p-value: 0.000. FT GENSCAN 342 342 AA on splice site: gc/g -> A. FT GENSCAN 343 372 INTERNAL EXON; p-value: 0.000. FT GENSCAN 373 431 LAST EXON; p-value: 0.000. SQ SEQUENCE 431 AA; 47935 MW; 81C7D478044E27FC CRC64; MKIISLSISI GIAIIAVLAS KTLFKTHPEA FGIKAISYSF KKSLCDHHHH HHHHHHHHHR HKPSDTKRKV SICDDFPKNI PPLDTDTTSY LCVDKNGCCN FTTVQSAVDA VGNFSQRRNV IWINSGMYYE KVVIPKTKPN ITLQGQGFDI TAIAWNDTAY SANGTFYCAT VQVFGSQFVA KNISFMNVAP IPKPGDVGAQ AVAIRIAGDE SAFVGCGFFG AQDTLHDDRG RHYFKDCYIQ GSIDFIFGNA KSLYQDCRII SMANQLSPGS KAVNGAVTAN GRSSKDENSG FSFVNCTIGG TGHVWLGRAW RPYSRVVFVS TTMTDVIAPE GWNNFNDPSR DATIFYGEYN CSGPGADMSK RAPYVQKLNE TQIRVRTLPV RFNISTYDMR SQRRAPPPEY SPPIYDLFHL DSHSISRTDE EIVYLSSQQI S // ID NC003070_316 HYPOTHETICAL; PRT; 936 AA. AC NC003070_316; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1554855...1554899, 1554951...1556585, DE 1556657...1556866, 1556944...1557078, 1557175...1557327, DE 1557424...1557631, 1558158...1558582]; Length: 2811. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 15 FIRST EXON; p-value: 0.000. FT GENSCAN 16 560 INTERNAL EXON; p-value: 0.000. FT GENSCAN 561 630 INTERNAL EXON; p-value: 0.000. FT GENSCAN 631 675 INTERNAL EXON; p-value: 0.000. FT GENSCAN 676 726 INTERNAL EXON; p-value: 0.000. FT GENSCAN 727 795 INTERNAL EXON; p-value: 0.000. FT GENSCAN 796 796 AA on splice site: g/ta -> V. FT GENSCAN 797 936 LAST EXON; p-value: 0.000. SQ SEQUENCE 936 AA; 106462 MW; B918991A24A27573 CRC64; MEEATKVSSD VPQAKFLEKI KYCDDLLQEV TKEDTVMEKE EEDTIFDGGF VKVEKEGINK KYDDDDDEKA EKQLKSLEDA LQLHDVKHKE LTEVKEAFDG LGLELENSRK KMIELEDRIR ISALEAEKLE ELQKQSASEL EEKLKISDER YSKTDALLSQ ALSQNSVLEQ KLKSLEELSE KVSELKSALI VAEEEGKKSS IQMQEYQEKV SKLESSLNQS SARNSELEED LRIALQKGAE HEDIGNVSTK RSVELQGLFQ TSQLKLEKAE EKLKDLEAIQ VKNSSLEATL SVAMEKERDL SENLNAVMEK LKSSEERLEK QAREIDEATT RSIELEALHK HSELKVQKTM EDFSSRDTEA KSLTEKSKDL EEKIRVYEGK LAEACGQSLS LQEELDQSSA ENELLADTNN QLKIKIQELE GYLDSEKETA IEKLNQKDTE AKDLITKLKS HENVIEEHKR QVLEASGVAD TRKVEVEEAL LKLNTLESTI EELEKENGDL AEVNIKLNQK LANQGSETDD FQAKLSVLEA EKYQQAKELQ ITIEDLTKQL TSERERLRSQ ISSLEEEKNQ VNEIYQSTKN ELVKLQAQLQ VDKSKSDDMV SQIEKLSALV AEKSVLESKF EQVEIHLKEE VEKVAELTSK LQEHKHKASD RDVLEEKAIQ LHKELQASHT AISEQKEALS HKHSELEATL KKSQEELDAK KSVIVHLESK LNELEQKVKL ADAKSKETES TGKEEEVEVK SRDSDLSFSN PKQTKIKKNL DAASSSGHVM IQKAETWHLM TLKIALGVAL VSVILVKLTM ADHNNTPPFD LTKLDHYIKY QPREEAEDFF VHVEVKVLGK GSSPLEISFS TSVYEFVWED EDCYELVELY EFFTEDAGID AFEAQFLVND LILYVNKTTR PLDEDFTGVF KLMAEVTLKP VQLNHAGSQK TESQQP // ID NC003070_317 HYPOTHETICAL; PRT; 61 AA. AC NC003070_317; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1559645...1559615, 1559516...1559375, DE 1559001...1558989]; Length: 186. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 10 FIRST EXON; p-value: 0.000. FT GENSCAN 11 11 AA on splice site: g/ga -> G. FT GENSCAN 12 57 INTERNAL EXON; p-value: 0.000. FT GENSCAN 58 58 AA on splice site: tg/c -> C. FT GENSCAN 59 61 LAST EXON; p-value: 0.000. SQ SEQUENCE 61 AA; 6344 MW; FA42DF10F7DF36A5 CRC64; MSQYDHNQSA GANPPPPMST CTSPPPPIGY PTNQPSHGSV AQGKVETKSK GDGFFKGCLI V // ID NC003070_318 HYPOTHETICAL; PRT; 383 AA. AC NC003070_318; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1564004...1563924, 1563502...1563389, DE 1563292...1563164, 1563079...1563011, 1562732...1562577, DE 1562404...1562368, 1562286...1562204, 1562026...1561952, DE 1561622...1561536, 1561430...1561312, 1561194...1561142, DE 1561038...1560890]; Length: 1152. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 27 FIRST EXON; p-value: 0.000. FT GENSCAN 28 65 INTERNAL EXON; p-value: 0.000. FT GENSCAN 66 108 INTERNAL EXON; p-value: 0.000. FT GENSCAN 109 131 INTERNAL EXON; p-value: 0.000. FT GENSCAN 132 183 INTERNAL EXON; p-value: 0.000. FT GENSCAN 184 195 INTERNAL EXON; p-value: 0.000. FT GENSCAN 196 196 AA on splice site: g/gt -> G. FT GENSCAN 197 223 INTERNAL EXON; p-value: 0.000. FT GENSCAN 224 248 INTERNAL EXON; p-value: 0.000. FT GENSCAN 249 277 INTERNAL EXON; p-value: 0.000. FT GENSCAN 278 316 INTERNAL EXON; p-value: 0.000. FT GENSCAN 317 317 AA on splice site: ag/t -> S. FT GENSCAN 318 334 INTERNAL EXON; p-value: 0.000. FT GENSCAN 335 335 AA on splice site: g/at -> D. FT GENSCAN 336 383 LAST EXON; p-value: 0.000. SQ SEQUENCE 383 AA; 41705 MW; 4A9F14818C221D02 CRC64; MEVGFKALLD DLDVLEKSLS DPALINKELS SEVVDSNPYS RLMALQRMGI VDNYERIREF SVAIVGIGGV GSVAAEMLTR CGIGRLLLYD YDTVELANMN RLFFRPDQVG MTKTDAAVQT LAEINPDVVL ESFTMNITTV QGFETFTSSL TNKSFCPSKE GGSGVDLVLS CVDNYEARMA VNQACNELRQ TWMESGVSED AVSGHIQLLV PGETACFACA PPLVVASGID ERTLKREGVC AASLPTTMGY NSLKDFFPTM KMRPNPQCSN VACLERQKEY MLAKPERDAA AKAKMEADAS TTIDEGPLHD DNEWNISVVD DENEKDTTKA ASSSDTLPEG LTRELPVADE YEKAIAIASG SGETEEEDDL EDLKKQLEAL NAA // ID NC003070_319 HYPOTHETICAL; PRT; 324 AA. AC NC003070_319; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1564815...1564860, 1565328...1565362, DE 1565579...1565699, 1565988...1566104, 1566341...1566489, DE 1566611...1566727, 1566825...1567009, 1567069...1567273]; Length: 975. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 15 FIRST EXON; p-value: 0.000. FT GENSCAN 16 16 AA on splice site: g/ct -> A. FT GENSCAN 17 27 INTERNAL EXON; p-value: 0.000. FT GENSCAN 28 67 INTERNAL EXON; p-value: 0.000. FT GENSCAN 68 68 AA on splice site: g/tt -> V. FT GENSCAN 69 106 INTERNAL EXON; p-value: 0.000. FT GENSCAN 107 107 AA on splice site: g/ca -> A. FT GENSCAN 108 156 INTERNAL EXON; p-value: 0.000. FT GENSCAN 157 195 INTERNAL EXON; p-value: 0.000. FT GENSCAN 196 256 INTERNAL EXON; p-value: 0.000. FT GENSCAN 257 257 AA on splice site: ac/g -> T. FT GENSCAN 258 324 LAST EXON; p-value: 0.000. SQ SEQUENCE 324 AA; 36247 MW; 5D96800B82F48EF7 CRC64; MGYGNRASSK TPAISALLVT LDGPHVKMKK TVKFSNFLLV QHVEELSEYT RFGLWWIFLG VASSIGLVFS SGVPLSSILP QVQIEAILWG LGTALGELPP YFISRAASLS GGKMKELETC SGDDNGFIAK RVNQIKSWLL SHSQYLNFFT ILILASVPNP LFDLAGIMCG QFEKPFWEFF LATLIGKAII KTHIQTVFII CVCNNQLLDW VENELIYILS FVPGFASALP ELTAKLRLMK EKYLIASPPV SSDINVTYLD DFFLGSGQEM GSFFCIRVER SRVADAFEFL RPDCDFNCTE IPKEATRRRT RCLDQQVVAD LKEI // ID NC003070_320 HYPOTHETICAL; PRT; 348 AA. AC NC003070_320; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1572279...1572172, 1571203...1571157, DE 1571075...1570987, 1570378...1570245, 1570085...1569417]; Length: 1047. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 36 FIRST EXON; p-value: 0.000. FT GENSCAN 37 51 INTERNAL EXON; p-value: 0.000. FT GENSCAN 52 52 AA on splice site: ca/g -> Q. FT GENSCAN 53 81 INTERNAL EXON; p-value: 0.000. FT GENSCAN 82 82 AA on splice site: a/gc -> S. FT GENSCAN 83 126 INTERNAL EXON; p-value: 0.000. FT GENSCAN 127 348 LAST EXON; p-value: 0.000. SQ SEQUENCE 348 AA; 39885 MW; 2743C1CFF7BED18D CRC64; MGKKEQKDHH SVVESDDKVE AVLHLLRKHS PLTLKQVFRI KQDYQKLHTQ KQLTRLVVFT LEVAISTMSR NVEQFVILFD ASFFKSASAF MNILVTTLKI VAEYYPCRLF KTFVIDPPSL FSYLWKGIRT FVDLSTATMI VSMQDFQDSF DYDDFSSSYP SRVSSLRFDT SSLKSTDKIG SCASSRFAFT VSRDGLDTVK PWCLTLTDTS STKLGHNTGA YISPLNARSF SFASPAARSE PFGGPRRSFF ASTPMPARTT DRHSIGTLRD PRIPRPSFFQ SPAIFFRRES HVSKSEKPRD SFVQFLKFYR RPYDEMTYRS KMRPPLGGLV SIVSTQIRRR HVSLSQRF // ID NC003070_321 HYPOTHETICAL; PRT; 1183 AA. AC NC003070_321; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1573064...1573146, 1576931...1577095, DE 1577343...1579194, 1579313...1579439, 1579533...1579618, DE 1580029...1580267, 1580624...1580710, 1581183...1581282, DE 1581377...1582189]; Length: 3552. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 27 FIRST EXON; p-value: 0.000. FT GENSCAN 28 28 AA on splice site: cg/c -> R. FT GENSCAN 29 82 INTERNAL EXON; p-value: 0.000. FT GENSCAN 83 83 AA on splice site: cg/g -> R. FT GENSCAN 84 700 INTERNAL EXON; p-value: 0.000. FT GENSCAN 701 742 INTERNAL EXON; p-value: 0.000. FT GENSCAN 743 743 AA on splice site: t/at -> Y. FT GENSCAN 744 771 INTERNAL EXON; p-value: 0.000. FT GENSCAN 772 850 INTERNAL EXON; p-value: 0.000. FT GENSCAN 851 851 AA on splice site: gg/a -> G. FT GENSCAN 852 879 INTERNAL EXON; p-value: 0.000. FT GENSCAN 880 880 AA on splice site: ag/g -> R. FT GENSCAN 881 913 INTERNAL EXON; p-value: 0.000. FT GENSCAN 914 1183 LAST EXON; p-value: 0.000. SQ SEQUENCE 1183 AA; 131315 MW; 8AF277255BE1D1F0 CRC64; MKGSINRSIG QISLVLRIGF NSQPFCPRGG VTLSAAGETG DSGFSKLLNS LRRFLKDSSP ITFIPPSFPI GSGLILFPTP RIRQNYQKRS RMVVSDSESS DEFMKPPPRR SGVDRKTLGA KEKFVRKRDR VEHDRNGYVR RNNEASGSFM KMNKLDIFEF DEYDGFDSAN LMRKRFDNGS VGVRGRSSFA SRRVDSSVGR SGSGREGLFD RRRNTFVNGT CSASSQEDSS SESDSDEPMR VQGINGVLKV KVNNKTNTLA ASINPRDAEI YERPPSSRKA QRRENVVVKP PFRKSNNVDN NSESEESDMS RKSKRKKSEY SKPKKEFNTK SKSTFPELVN PDVREERRGR RGGGTDKQRL RERIKGMLTD AGWTIDYKPR RNQSYLDAVY VNPSGTAYWS IIKAYDALLK QLKDEGVDAR PRKDTAAVAS VSEEIVNKLA RKAKKTRSEM TKKWKQNSSG SDSENKSEGG AYTDTSEERI RSSIKLGGKS TKKGRNGADW DELHKKSKRS LYYNNARPSC GSDSHYLHGR KTKKIGRCTL LVRSSKDKKN PAINGFNPYS GKRTLLSWLI ESGVVQLRQK VQYMRRRGAK VMLEGWITRE GIHCDCCSKI LTVSRFEIHA GSKSCQPFQN IYLESGASLL QCQVRAWNMQ KDATNLALHQ VDTDGDDPND DACGICGDGG DLICCDGCPS TYHQNCLGMQ VLPSGDWHCP NCTCKFCDAA VASGGKDGNF ISLLSCGMCE RRYHQLCLND EAHKVQSFGS ASSFCGPKCL ELFEKLQKYL GVKTEIEGGY SWSLIHRVDT DSDTNSQMSA QRIENNSKLA VGLAIMDECF LPIVDRRSGV DLIRNVLYNC GSNFNRINYT GFYTAILERG DEIISAASLR FHGMQLAEMP FIGTRHIYRR QGMCRRLFDA IESAMRSLKV EKLVIPAIPD FLHAWTGNFG FTPLDDSVRK EMRSLNTLVF PGIDMLQKPL LHEENIIAPA AAGDAMISEV ETEKKSEFTS SVEIGPYAVE GDEFVADAAN CYKDILASDE DNILVSVETA MGTICKPKDE LSRHFRGEES GISSSPCQIT LKSGTKHVLG HICDDTGSSC EDGLTDVNVE ADASLLSQEI QQASASFKVE NNLSLSISGR GSSDLSSISQ EVKSEQTSSN LDGVPSCKDY NILVPGAKLD KSKDDAFADG FLL // ID NC003070_322 HYPOTHETICAL; PRT; 192 AA. AC NC003070_322; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1583626...1583569, 1583346...1582964, DE 1582883...1582746]; Length: 579. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: 0.000. FT GENSCAN 20 20 AA on splice site: a/at -> N. FT GENSCAN 21 147 INTERNAL EXON; p-value: 0.000. FT GENSCAN 148 192 LAST EXON; p-value: 0.000. SQ SEQUENCE 192 AA; 21550 MW; DDDD10086F136F4F CRC64; MNFSPTLVHH HMKSKPQCQN EKLRQGQTSS LFDRRGFLKC VVGASSFMAT IEFSGLQAQA SEEKLDEGEG VVGAFKTLFD PNERTKSGKE LPKAYLKSAR EVVKTMRESL KENPKDNAKF RRSADAAKES IRDYLSNWRG QKTVAGEESY VELENVIRAL AKFYSKAGPS APLPDEVKTE ILDDLNKAEE FL // ID NC003070_323 HYPOTHETICAL; PRT; 131 AA. AC NC003070_323; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1584398...1584587, 1584896...1585059, DE 1585160...1585184, 1585272...1585288]; Length: 396. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 63 FIRST EXON; p-value: 0.000. FT GENSCAN 64 64 AA on splice site: a/ct -> T. FT GENSCAN 65 118 INTERNAL EXON; p-value: 0.000. FT GENSCAN 119 126 INTERNAL EXON; p-value: 0.000. FT GENSCAN 127 127 AA on splice site: g/gc -> G. FT GENSCAN 128 131 LAST EXON; p-value: 0.000. SQ SEQUENCE 131 AA; 14316 MW; 0E1E81EBFFC6238B CRC64; MKRCPVPDDY DARRVGPCIK RILRKSGYNG PVTITAVGSL SKVPRDILEV VSSTGISLYH EVATDPGPLE EEEDSCSETP GPASWICSVC RRICGPGIAG QGVDNFITHL STREHELKRQ RFTNPKGHLA N // ID NC003070_324 HYPOTHETICAL; PRT; 471 AA. AC NC003070_324; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1587267...1586907, 1586722...1586157, DE 1586089...1586029, 1585930...1585503]; Length: 1416. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 120 FIRST EXON; p-value: 0.000. FT GENSCAN 121 121 AA on splice site: g/gt -> G. FT GENSCAN 122 309 INTERNAL EXON; p-value: 0.000. FT GENSCAN 310 329 INTERNAL EXON; p-value: 0.000. FT GENSCAN 330 330 AA on splice site: g/at -> D. FT GENSCAN 331 471 LAST EXON; p-value: 0.000. SQ SEQUENCE 471 AA; 53370 MW; 582512E495010DDA CRC64; MDSMDIDQSN IGESPHLLLR PVSPLESGEG LPYAPENWPN PGDTWHWKVG PRISGKGYFV DRYLYPPKYL PGLDTEILRK NKVFRSRLSL QRYIRVHFPE ADVQKFFASF SWSIPCRDGQ GVLPQKQVQL PVYSSDEDPM RDDGSDTAVC KAGNEKCRSL MPQCEAETLP AMPCDICCGE RKFCVDCCCI LCCKLISLEH GGYSYIKCEA VVSEGHICGH VAHMNCALRA YLAGTIGGSM GLDTEYYCRR CDAKKDLFPH VNKFLEICQT VEYQGDVEKI LNLGICILRG AQRDNAKELL NCIESTVIKL KCGTSLEDLW NDDTPTIWSD YSDSGEAREN DTLQSLQDVT PIGPIPFNHE AEMHKLEEEI GEVLRALRKA QEFEYQIAEG KLHAQKECLS DLYRQLEKEK SELSRRVSGT DANSLMTNVL KRLDQIRKEV TKLKEMEEVA KGFGRTPRGV LEEYFHLNIE D // ID NC003070_325 HYPOTHETICAL; PRT; 222 AA. AC NC003070_325; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1590084...1590752]; Length: 669. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 222 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 222 AA; 24485 MW; 94C0A7A16A3570CA CRC64; MWKNFHLCFP SNLTKPSSSP SGATSDDPNR PSILLINNFN LLYDDSSAAH RRLSKPLIHD VEPSSTFTAS TSTAANSSSS SASYDDSDNY GFAPDDDSPP PDLTAVLASR RFFFSSPGCS NSITDSPDLR CRDNYDTATR LLTGGTAVKH YVQSPDPYND FRRSMQEMID AVTNAGDLRR YEFLHELLLS YLSLNAADTH KFIIRAFADI LVSLLSDGHR IS // ID NC003070_326 HYPOTHETICAL; PRT; 269 AA. AC NC003070_326; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1595730...1595674, 1595425...1595320, DE 1595185...1595132, 1595042...1594922, 1594806...1594539, DE 1594429...1594226]; Length: 810. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 19 FIRST EXON; p-value: 0.000. FT GENSCAN 20 54 INTERNAL EXON; p-value: 0.000. FT GENSCAN 55 55 AA on splice site: g/tg -> V. FT GENSCAN 56 72 INTERNAL EXON; p-value: 0.000. FT GENSCAN 73 73 AA on splice site: g/ga -> G. FT GENSCAN 74 112 INTERNAL EXON; p-value: 0.000. FT GENSCAN 113 113 AA on splice site: gt/c -> V. FT GENSCAN 114 202 INTERNAL EXON; p-value: 0.000. FT GENSCAN 203 269 LAST EXON; p-value: 0.000. SQ SEQUENCE 269 AA; 29862 MW; 5D502113E628FD3A CRC64; MAMAALNQLF VVLASKPEQE KITPEESRAI VSCHFKALWT AGFASGVGGG LTWQVTKKLK KPKGLERVAL AAGVAASTFV VAWNWSSSKY AVSSLDHILS QDATRMQKEL VNVLVRSNRG EAWRWQLMSK HFYPESVYGD EGDKPQMRWR RRTTFTEIAS SYDDVNATKS QRNPNGLPNP SHRRISGGSD ASKTKQTLQN SSGNSDGEMA EEDVLDIVFG CSEATESIPA PVISKLASKT QTRKQKRAQR RQRLKNREAS TNTPQYELA // ID NC003070_327 HYPOTHETICAL; PRT; 318 AA. AC NC003070_327; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1597824...1597132, 1597051...1596998, DE 1596582...1596373]; Length: 957. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 231 FIRST EXON; p-value: 0.000. FT GENSCAN 232 249 INTERNAL EXON; p-value: 0.000. FT GENSCAN 250 318 LAST EXON; p-value: 0.000. SQ SEQUENCE 318 AA; 36027 MW; 4DFE86B326E95A10 CRC64; MKGKILVSPS EKSMTSSTTT TTTTTTTVST EDWERRDSNC YFPDCRKDAN CSCEICLDSL NATLDLMPLS VQKSSLTKLS SASNFKSTVE STPTSFDPTV VTTPASVSRP ILKLMISSPK KKLTKKSKVF ENEEQRKTTK KERRLLIVVF LKVILVIGLV LGLELGFSWV LEEVFKPEFT EEIVRNAYER TQADLDLGVK LRLLEDEFKG FVTSRKFSRC TGSDSKWKIN QDGPMLNSKC VLYKSAIEEL DPNTWVLEYS QSSVMDDSSS LLSLTIDIME HLVFRVAKNV NRERYWMFSS SRRLYREAES KASAMTPT // ID NC003070_328 HYPOTHETICAL; PRT; 108 AA. AC NC003070_328; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1600213...1600333, 1600677...1600882]; DE Length: 327. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 40 FIRST EXON; p-value: 0.000. FT GENSCAN 41 41 AA on splice site: g/at -> D. FT GENSCAN 42 108 LAST EXON; p-value: 0.000. SQ SEQUENCE 108 AA; 11686 MW; FC7A5281AAA87CE5 CRC64; MDCLCLIVTA GVPISIPINR TLAISLPRAC GIPGVPVQCK DSQTSDPEGT RELFTATSSS NSKTFCSEFV SKRFSGLLTS HNSQGLLLSV RPLLRQLRKL LMVNHINS // ID NC003070_329 HYPOTHETICAL; PRT; 1048 AA. AC NC003070_329; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1605541...1605504, 1604676...1603875, DE 1603786...1603622, 1603587...1602052, 1601961...1601356]; Length: 3147. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 12 FIRST EXON; p-value: 0.000. FT GENSCAN 13 13 AA on splice site: cc/a -> P. FT GENSCAN 14 280 INTERNAL EXON; p-value: 0.000. FT GENSCAN 281 335 INTERNAL EXON; p-value: 0.000. FT GENSCAN 336 847 INTERNAL EXON; p-value: 0.000. FT GENSCAN 848 1048 LAST EXON; p-value: 0.000. SQ SEQUENCE 1048 AA; 118441 MW; D691BC369C445A3F CRC64; MVTISGVTTS TRPLSSSSAM SVSGYKSDDE YSVIADKGEI GFIDYQNDGS SGCYNPFDEG PVVVSVPFPF KKEKPQSVTV GETSFDSFTV KNTMDEPVDL WTKIYASNPE DSFTLSILKP PSKDSDLKER QCFYETFTLE DRMLEPGDTL TIWVSCKPKD IGLHTTVVTV DWGSDRVERV VFLLAEDKIS SSLTSNRPYS RSRRAPKKDF AVDDYVKGSR PSKVVERSFR NRLPLYEIPK EIREMIENKE FPDDLNEGLT ARNYANYYKT LLIMEELQLE EDMRAYDMEN VSMKRRGIYL SLEVPGLAER RPSLVHGDFI FVRHAYDDGT DHAYQITFCF TTANMFNHLS FRGYKIEINF SQGFVHRVEA DEVHMKFASE FHQRHTAGSV YNVRFTYNRI NTRRLYQAVD AAEMLDPNFL FPSLHSGKRM IKTKPFVPIS PALNAEQICS IEMVLGCKGA PPYVIHGPPG TGKTMTLVEA IVQLYTTQRN ARVLVCAPSN SAADHILEKL LCLEGVRIKD NEIFRLNAAT RSYEEIKPEI IRFCFFDELI FKCPPLKALT RYKLVVSTYM SASLLNAEGV NRGHFTHILL DEAGQASEPE NMIAVSNLCL TETVVVLAGD PRQLGPVIYS RDAESLGLGK SYLERLFECD YYCEGDENYV TKLVKNYRCH PEILDLPSKL FYDGELVASK EDTDSVLASL NFLPNKEFPM VFYGIQGCDE REGNNPSWFN RIEISKVIET IKRLTANDCV QEEDIGVITP YRQQVMKIKE VLDRLDMTEV KVGSVEQFQG QEKQVIIIST VRSTIKHNEF DRAYCLGFLS NPRRFNVAIT RAISLLVIIG NPHIICKDMN WNKLLWRCVD NNAYQGCGLP EQEEFVEEPF KQEGSSNGPQ YPPEAEWNNS GELNNGGANE NGEWSDGWNN NGGTKEKNEW SDGWNSNGGG TKKKDEWSDG WDNNGGTNGI NQEGSSNAPQ DPQEAEWNDS GEVKNGGTKE KDVRSDGWNN NGGKNEKEEC CDGWKDGGSG EEIKNGGKFE TRGDFVAKEE DEWSDGWK // ID NC003070_330 HYPOTHETICAL; PRT; 559 AA. AC NC003070_330; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1611155...1611140, 1610760...1610673, DE 1610589...1610485, 1610396...1609601, 1609508...1609315, DE 1609206...1609098, 1609015...1608867, 1608779...1608557]; Length: 1680. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 5 FIRST EXON; p-value: 0.000. FT GENSCAN 6 6 AA on splice site: g/ag -> E. FT GENSCAN 7 34 INTERNAL EXON; p-value: 0.000. FT GENSCAN 35 35 AA on splice site: ag/c -> S. FT GENSCAN 36 69 INTERNAL EXON; p-value: 0.000. FT GENSCAN 70 70 AA on splice site: gg/a -> G. FT GENSCAN 71 335 INTERNAL EXON; p-value: 0.000. FT GENSCAN 336 399 INTERNAL EXON; p-value: 0.000. FT GENSCAN 400 400 AA on splice site: ga/c -> D. FT GENSCAN 401 436 INTERNAL EXON; p-value: 0.000. FT GENSCAN 437 485 INTERNAL EXON; p-value: 0.000. FT GENSCAN 486 486 AA on splice site: tg/g -> W. FT GENSCAN 487 559 LAST EXON; p-value: 0.000. SQ SEQUENCE 559 AA; 64213 MW; AE459A9396C4DF24 CRC64; MVQEDEKLSK NWEQQARQRR MNYENPRIID VQNYSIFVAT WNVAGRSPPS DLNLDEWLHS SAPADIYVLG FQEIVPLNAG NVLGAEDNGP AQKWLSLIRK TLNNRPGTSG TSGYHTPSPI PVPMAELDAD FSGSTRQKNS TFFHRRSFQT PSSTWNDPSI PQPGLDRRFS VCDRVFFSHR PSDFDPSFRG SSSSHRPSDY SRRPSDYSRR PSDYSRRPSD YSRRPSDSRP SDYSRPSDYY SRPSDYSRPS DFSRSSDDDN GLGDSPSTVL YSPGSAANEN GYRIPWNSSQ YCLVASKQMV GVFLTIWVKS ELREHVKNMK VSCVGRGLMG YLGNKGSISI SMLLHQTSFC FVCTHLTSGQ KEGDELKRNS DVMEILKKTR FPRVKSSEEE KSPENILQHD RVIWLGDLNY RIALSYRSAK ALVEMQNWRA LLENDQLRIE QKRGHVFKGW NEGKIYFPPT YKYSRNSDRY SGDDLHPKEK RRTPAWCDRI LWFGEGLHQL SYVRGESRFS DHRPVYGIFC AEVESAHNRI KRTTSYSASR VQAEELLPYS RGYTELSFF // ID NC003070_331 HYPOTHETICAL; PRT; 1578 AA. AC NC003070_331; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1623480...1623334, 1623212...1621047, DE 1620998...1620816, 1620728...1618817, 1618361...1618224, DE 1615599...1615531, 1614230...1614109]; Length: 4737. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 49 FIRST EXON; p-value: 0.000. FT GENSCAN 50 771 INTERNAL EXON; p-value: 0.000. FT GENSCAN 772 832 INTERNAL EXON; p-value: 0.000. FT GENSCAN 833 1469 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1470 1470 AA on splice site: g/aa -> E. FT GENSCAN 1471 1515 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1516 1516 AA on splice site: g/aa -> E. FT GENSCAN 1517 1538 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1539 1539 AA on splice site: c/aa -> Q. FT GENSCAN 1540 1578 LAST EXON; p-value: 0.000. SQ SEQUENCE 1578 AA; 178414 MW; 80EFCE39E80E4376 CRC64; MVWLCHHAFF MPFSLSLSAH LSLFHFLKPV KIKNFSKNSR TSSSSSQTRI FVFALMECIG KRVKSRSWQR LQAVNKRKKM ETVAPVTSPP KKRRQKKPKN YDSDIEDITP TCNDSVPPPQ VSNMYSVPNN SVKESFSRIM RDLNVEKKSG PSSSRLTDGS EQNPCLKERS FRVSDLGVEK KCSPEITDLD VGIPVPRFSK LKDVSEQKNT CLMQKSSPEI ADLDLVISVP SSSVLKDVSE EIRFLKDKCS PEIRGLVLEK SVPGEIEILS DSESETEARR RASAKKKLFE ESSRIVESIS DGEDSSSETD EEEEENQDSE DNNTKDNVTV ESLSSEDPSS SSSSSSSSSS SSSSSSSDDE SYVKEVVGDN RDDDDLRKAS SPIKRVSLVE RKALVRYKRS GSSLTKPRER DNKIQKLNHR EEEKKERQRE VVRVVTKQPS NVVYTCAHCG KENTGNPESH SSFIRPHSIR DEIEDVNNFA STNVSKYEDS VSINSGKTTG APSRPEVENP ETGKELNTPE KPSISRPEIF TTEKAIDVQV PEEPSRPEIY SSEKAKEVQA PEMPSRPEVF SSEKAKEIQV PEMPSIPEIQ NSEKAKEVQA NNRMGLTTPA VAEGLNKSVV TNEHIEDDSD SSISSGDGYE SDPTLKDKEV KINNHSDWRI LNGNNKEVDL FRLLVNSVWE KGQLGEEDEA DELVSSAEDQ SQEQAREDHR KYDDAGLLII RPPPLIEKFG VEEPQSPPVV SEIDSEEDRL WEELAFFTKS NDIGGNELFS NEQGKSSLKC LQVEKNISAN ETPAAQCKKG KHDLCIDLEV GLKCMHCGFV EREIRSMDVS EWGEKTTRER RKFDRFEEEE GSSFIGKLGF DAPNNSLNEG CVSSEGTVWD KIPGVKSQMY PHQQEGFEFI WKNLAGTIML NELKDFENSD ETGGCIMSHA PGTGKTRLTI IFLQAYLQCF PDCKPVIIAP ASLLLTWAEE FKKWNISIPF HNLSSLDFTG KENSAALGLL MQKNATARSN NEIRMVKIYS WIKSKSILGI SYNLYEKLAG VKDEDKKTKM VREVKPDKEL DDIREILMGR PGLLVLDEAH TPRNQRSCIW KTLSKVETQK RILLSGTPFQ NNFLELCNVL GLARPKYLER LTSTLKKSGM TVTKRGKKNL GNEINNRGIE ELKAVMLPFV HVHKGSILQS SLPGLRECVV VLNPPELQRR VLESIEVTHN RKTKNVFETE HKLSLVSVHP SLVSRCKISE KERLSIDEAL LAQLKKVRLD PNQSVKTRFL MEFVELCEVI KEKVLVFSQY IDPLKLIMKH LVSRFKWNPG EEVLYMHGKL EQKQRQTLIN EFNDPKSKAK VFLASTKACS EGISLVGASR VILLDVVWNP AVERQAISRA YRIGQKRIVY TYHLVAKGTP EGPKYCKQAQ KDRISELVFA CSSRHDKGKE KIAEAVTEDK VLDTMVEHSK LGDMFDNLIV QPKEADLVEE EHALIIATNE LLIIIQEQHQ QQRLQYDDHF SSRVACIRDL LTGSKEYSNN DYTHIILYFK CKIINRRKQE TEEFSSIVFG KTSFVNTLFE RALRIYEFGI KSISNFSL // ID NC003070_332 HYPOTHETICAL; PRT; 691 AA. AC NC003070_332; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1625097...1625201, 1625526...1625576, DE 1625697...1625817, 1626230...1626351, 1626509...1626610, DE 1626841...1626915, 1627328...1627407, 1627603...1627753, DE 1627827...1628057, 1628159...1628437, 1628528...1628641, DE 1628793...1628857, 1630015...1630418, 1630514...1630689]; Length: 2076. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 35 FIRST EXON; p-value: 0.000. FT GENSCAN 36 52 INTERNAL EXON; p-value: 0.000. FT GENSCAN 53 92 INTERNAL EXON; p-value: 0.000. FT GENSCAN 93 93 AA on splice site: g/gt -> G. FT GENSCAN 94 133 INTERNAL EXON; p-value: 0.000. FT GENSCAN 134 167 INTERNAL EXON; p-value: 0.000. FT GENSCAN 168 192 INTERNAL EXON; p-value: 0.000. FT GENSCAN 193 218 INTERNAL EXON; p-value: 0.000. FT GENSCAN 219 219 AA on splice site: ag/c -> S. FT GENSCAN 220 269 INTERNAL EXON; p-value: 0.000. FT GENSCAN 270 346 INTERNAL EXON; p-value: 0.000. FT GENSCAN 347 439 INTERNAL EXON; p-value: 0.000. FT GENSCAN 440 477 INTERNAL EXON; p-value: 0.000. FT GENSCAN 478 498 INTERNAL EXON; p-value: 0.000. FT GENSCAN 499 499 AA on splice site: tg/g -> W. FT GENSCAN 500 633 INTERNAL EXON; p-value: 0.000. FT GENSCAN 634 634 AA on splice site: g/ag -> E. FT GENSCAN 635 691 LAST EXON; p-value: 0.000. SQ SEQUENCE 691 AA; 77950 MW; D5F120B26318A0F5 CRC64; MGFIVGVVIG LLVGIAIIIG FVKLENSRSK LRSELLTWLN HHLTKIWPYV DEAASELIKA SVEPVLEQYR PAIVASLTFS KLTLGTVAPQ FTGVSVIDGD KNGITLELDM QWDGNPNIVL GVKTLVGVSL PIQVKNIGFT GVFRLIFRPL VEDFPCFGAV SVSLREKKKL DFTLKVVGGD ISAIPGLSEA IEETIRDAVE DSITWPVRKV IPIIPGDYSD LELKPVGMLE VKLVQAKNLT NKDLVGKSDP FAKMFIRPLR EKTKRSKTIN NDLNPIWNEH FEFVVEDAST QHLVVRIYDD EGVQASELIG CAQIRLCELE PGKVKDVWLK LVKDLEIQRD TKNRGEVHLE LLYIPYGSGN GIVNPFVTSS MTSLERVLKN DTTDEENASS RKRKDVIVRG VLSVTVISAE EIPIQDLMGK ADPYVVLSMK KSGAKSKTRV VNDSLNPVWN QTFDFVVEDG LHDMLVLEVW DHDTFGKDYI GRCILTLTRV IMEEEYKDWF HFYAYDMTRQ VEAHHFCGHI NEDMRQCLIY DGPDANARLI GLEYIVTEKL FMTLPDDEKK LWHTHEWEVK GGFLFMPGVP EAIQRQDLEK VAKTYGKVYH FWQVDLGHQL PIGLPNIMMA VTRDGQLYPE MIKETEKQFG VSIDKERESR AYMKGPDHGI HPLANGGGKG LKLELREVDI KPVESVPRVF V // ID NC003070_333 HYPOTHETICAL; PRT; 717 AA. AC NC003070_333; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1635693...1634917, 1634776...1634603, DE 1634094...1633933, 1633760...1633618, 1633190...1633131, DE 1633052...1632971, 1632420...1632343, 1632076...1631915, DE 1631841...1631578, 1631493...1631389, 1631271...1631125]; Length: 2154. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 259 FIRST EXON; p-value: 0.000. FT GENSCAN 260 317 INTERNAL EXON; p-value: 0.000. FT GENSCAN 318 371 INTERNAL EXON; p-value: 0.000. FT GENSCAN 372 418 INTERNAL EXON; p-value: 0.000. FT GENSCAN 419 419 AA on splice site: aa/t -> N. FT GENSCAN 420 438 INTERNAL EXON; p-value: 0.000. FT GENSCAN 439 439 AA on splice site: ag/c -> S. FT GENSCAN 440 466 INTERNAL EXON; p-value: 0.000. FT GENSCAN 467 492 INTERNAL EXON; p-value: 0.000. FT GENSCAN 493 546 INTERNAL EXON; p-value: 0.000. FT GENSCAN 547 634 INTERNAL EXON; p-value: 0.000. FT GENSCAN 635 669 INTERNAL EXON; p-value: 0.000. FT GENSCAN 670 717 LAST EXON; p-value: 0.000. SQ SEQUENCE 717 AA; 79334 MW; B6FEE75650769ED1 CRC64; MASMDPEGID GVRMTWNVWP RTKVEASKCV IPVAACISPI RYHRDIPSVE YAPLRCRICT AALNPFARVD FLAKIWICPI CFQRNHFPPH YHVMSETNVP CELYPQYTTV EYTLPNPSQP TGVGNFDQTG AVSGQPSPSV FVFVLDTCMI EEEFGYAKSA LKQAIGLLPE NALVGFVSFG TQAHVHELGF SDLTKVYVFR GDKEISKDQV LEQLGLGASG RRNPVGGFPM GRDNSANFGY SGVNRFLLPA SDCEFTIDLL LEELQTDQWP VQAGRRQSRC TGVAISVATG LLGACFPGTG ARIVALIGGP CSEGPGTIVS KDLSEPLRSH KDLDKDAAPF YKKAEKFYDA LANQLVNQGH VLDLFASALD QVGVAEMKAA VERTGGLVVL SESFGHSVFK DSFKRVFEDG EESLGLCFND QSSAPGGVNN QLYLQFMTSY QNSKGKTLQR VTTVTRQWVD TGLSTEELVQ GFDQETAAVV VARLASLKME TEEGFDATRW LDRNLIRLCS KFGDYRKDDP ASFTLNPNFS LFPQFTFNLR RSQFVQVFNN SPDETAYNRM LLNRENISNA AVMIQPSLTT YSFNSLPQPA LLDVASIGAD RILLLDSYIS VVVFHGMTIA QWRNLGYQNQ PEHQAFAQLL EAPQEDAQMI IRDRFPVPRL VVCDQHGSQA RFLLAKLNPS ATYNNASEMN AGSDIIFTDD VSLQVFFQHL QKLAVQS // ID NC003070_334 HYPOTHETICAL; PRT; 455 AA. AC NC003070_334; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1637862...1636495]; Length: 1368. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 455 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 455 AA; 51191 MW; CFCFA2B93EA3F547 CRC64; MAQPHFLLVT FPAQGHVNPS LRFARRLIKT TGARVTFATC LSVIHRSMIP NHNNVENLSF LTFSDGFDDG VISNTDDVQN RLVHFERNGD KALSDFIEAN QNGDSPVSCL IYTILPNWVP KVARRFHLPS VHLWIQPAFA FDIYYNYSTG NNSVFEFPNL PSLEIRDLPS FLSPSNTNKA AQAVYQELMD FLKEESNPKI LVNTFDSLEP EFLTAIPNIE MVAVGPLLPA EIFTGSESGK DLSRDHQSSS YTLWLDSKTE SSVIYVSFGT MVELSKKQIE ELARALIEGG RPFLWVITDK LNREAKIEGE EETEIEKIAG FRHELEEVGM IVSWCSQIEV LRHRAIGCFL THCGWSSSLE SLVLGVPVVA FPMWSDQPAN AKLLEEIWKT GVRVRENSEG LVERGEIMRC LEAVMEAKSV ELRENAEKWK RLATEAGREG GSSDKNVEAF VKSLF // ID NC003070_335 HYPOTHETICAL; PRT; 497 AA. AC NC003070_335; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1639535...1640313, 1640385...1640650, DE 1641900...1642348]; Length: 1494. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 259 FIRST EXON; p-value: 0.000. FT GENSCAN 260 260 AA on splice site: tg/g -> W. FT GENSCAN 261 348 INTERNAL EXON; p-value: 0.000. FT GENSCAN 349 349 AA on splice site: g/tt -> V. FT GENSCAN 350 497 LAST EXON; p-value: 0.000. SQ SEQUENCE 497 AA; 56691 MW; 73DECA44658A04E6 CRC64; MRYFSVLMSE HYIINWQIHK SVRLFSTSPY LTLGSVVEDL PEEDGSYIGD ILLFDPAKEE LVTVRDKTIP EKLVNSRVVG ASHGWTFFSD RCNHNSVCIS DLFTPMASKS NTKIIPLPLL TTMIYGQTEA VWNVAMSSSS PHQDNEEEDC VVAINFLGSQ LSVCRPGRDH GWTNKQIPFI CSENSNLMYS KRDQRFYLPA PGGNYLCSWD LHFDNDPKFN ELVFLNLPEL PQSEWELLNS CFKEDHWVES PSGQSFLVKW YSHIPSQRYK DPILMVLRED EETEEGTRNM CYTEDIGDLC IFLSKSDPFC VVASSCPGLK PNSIYLMGRC FAVYDLTTRT ARHFKAPKVE RGYNVSPLKV TTGSAQTQLD TRKLDSHYTV FLTKRRRGGH GYVKNFFGLA SGQSRKPRQR NHDRVCFGCQ RIENVADTYG TISPVTEKSP SMGSSHHFFI HELFGNCFVR DRDYLFLGRV KQKNISNFAA GRQNFLNPSA KCQIRAG // ID NC003070_336 HYPOTHETICAL; PRT; 469 AA. AC NC003070_336; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1647082...1645673]; Length: 1410. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 469 SINGLE EXON; p-value: 0.000. SQ SEQUENCE 469 AA; 52814 MW; 77C6088342C78D76 CRC64; MAPPHFLLVT FPAQGHVNPS LRFARRLIKR TGARVTFVTC VSVFHNSMIA NHNKVENLSF LTFSDGFDDG GISTYEDRQK RSVNLKVNGD KALSDFIEAT KNGDSPVTCL IYTILLNWAP KVARRFQLPS ALLWIQPALV FNIYYTHFMG NKSVFELPNL SSLEIRDLPS FLTPSNTNKG AYDAFQEMME FLIKETKPKI LINTFDSLEP EALTAFPNID MVAVGPLLPT EIFSGSTNKS VKDQSSSYTL WLDSKTESSV IYVSFGTMVE LSKKQIEELA RALIEGKRPF LWVITDKSNR ETKTEGEEET EIEKIAGFRH ELEEVGMIVS WCSQIEVLSH RAVGCFVTHC GWSSTLESLV LGVPVVAFPM WSDQPTNAKL LEESWKTGVR VRENKDGLVE RGEIRRCLEA VMEEKSVELR ENAKKWKRLA MEAGREGGSS DKNMEAFVED ICGESLIQNL CEAEEVKVK // ID NC003070_337 HYPOTHETICAL; PRT; 1700 AA. AC NC003070_337; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1660877...1660834, 1658890...1658806, DE 1658637...1658490, 1658398...1658298, 1658204...1658083, DE 1657980...1657887, 1657750...1657640, 1657555...1657439, DE 1657354...1657262, 1657157...1657066, 1656920...1656899, DE 1656691...1656554, 1656466...1656390, 1656293...1656200, DE 1655747...1655630, 1655456...1655376, 1655243...1655119, DE 1654668...1654587, 1654499...1654450, 1654155...1654018, DE 1653916...1653822, 1653050...1652966, 1652878...1652718, DE 1652636...1652513, 1652423...1652326, 1652175...1652109, DE 1651983...1651832, 1651590...1651436, 1651323...1651255, DE 1651170...1650981, 1650886...1650727, 1650575...1650456, DE 1650333...1650215, 1649961...1649903, 1649759...1649530, DE 1649425...1648625, 1648548...1648498, 1648411...1648226, DE 1648127...1647879]; Length: 5103. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 14 FIRST EXON; p-value: 0.000. FT GENSCAN 15 15 AA on splice site: at/a -> I. FT GENSCAN 16 43 INTERNAL EXON; p-value: 0.000. FT GENSCAN 44 92 INTERNAL EXON; p-value: 0.000. FT GENSCAN 93 93 AA on splice site: t/gt -> C. FT GENSCAN 94 126 INTERNAL EXON; p-value: 0.000. FT GENSCAN 127 166 INTERNAL EXON; p-value: 0.000. FT GENSCAN 167 167 AA on splice site: cg/t -> R. FT GENSCAN 168 198 INTERNAL EXON; p-value: 0.000. FT GENSCAN 199 235 INTERNAL EXON; p-value: 0.000. FT GENSCAN 236 274 INTERNAL EXON; p-value: 0.000. FT GENSCAN 275 305 INTERNAL EXON; p-value: 0.000. FT GENSCAN 306 335 INTERNAL EXON; p-value: 0.000. FT GENSCAN 336 336 AA on splice site: tg/g -> W. FT GENSCAN 337 343 INTERNAL EXON; p-value: 0.000. FT GENSCAN 344 389 INTERNAL EXON; p-value: 0.000. FT GENSCAN 390 414 INTERNAL EXON; p-value: 0.000. FT GENSCAN 415 415 AA on splice site: tg/g -> W. FT GENSCAN 416 446 INTERNAL EXON; p-value: 0.000. FT GENSCAN 447 485 INTERNAL EXON; p-value: 0.000. FT GENSCAN 486 486 AA on splice site: g/ct -> A. FT GENSCAN 487 512 INTERNAL EXON; p-value: 0.000. FT GENSCAN 513 513 AA on splice site: g/tg -> V. FT GENSCAN 514 554 INTERNAL EXON; p-value: 0.000. FT GENSCAN 555 581 INTERNAL EXON; p-value: 0.000. FT GENSCAN 582 582 AA on splice site: g/cc -> A. FT GENSCAN 583 598 INTERNAL EXON; p-value: 0.000. FT GENSCAN 599 644 INTERNAL EXON; p-value: 0.000. FT GENSCAN 645 675 INTERNAL EXON; p-value: 0.000. FT GENSCAN 676 676 AA on splice site: ag/g -> R. FT GENSCAN 677 704 INTERNAL EXON; p-value: 0.000. FT GENSCAN 705 757 INTERNAL EXON; p-value: 0.000. FT GENSCAN 758 758 AA on splice site: ca/g -> Q. FT GENSCAN 759 799 INTERNAL EXON; p-value: 0.000. FT GENSCAN 800 831 INTERNAL EXON; p-value: 0.000. FT GENSCAN 832 832 AA on splice site: ag/c -> S. FT GENSCAN 833 854 INTERNAL EXON; p-value: 0.000. FT GENSCAN 855 904 INTERNAL EXON; p-value: 0.000. FT GENSCAN 905 905 AA on splice site: tc/c -> S. FT GENSCAN 906 956 INTERNAL EXON; p-value: 0.000. FT GENSCAN 957 957 AA on splice site: g/tg -> V. FT GENSCAN 958 979 INTERNAL EXON; p-value: 0.000. FT GENSCAN 980 980 AA on splice site: g/aa -> E. FT GENSCAN 981 1042 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1043 1043 AA on splice site: ac/g -> T. FT GENSCAN 1044 1096 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1097 1136 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1137 1175 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1176 1176 AA on splice site: ag/a -> R. FT GENSCAN 1177 1195 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1196 1196 AA on splice site: g/gt -> G. FT GENSCAN 1197 1272 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1273 1539 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1540 1556 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1557 1618 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1619 1700 LAST EXON; p-value: 0.000. SQ SEQUENCE 1700 AA; 196831 MW; 8D80C27DD49DE09B CRC64; MTGDLVKREK QNVSIKKTAF RVYCAYSDVC FSVDESEIRF SVKRRILRTQ TVGSLGEAML DSEVVPSSLV EIAPILRVAN EVEASNPRVA YLCRFYAFEK AHRLDPTSSG RGVRQFKTAL LQRLERENET TLAGRQKSDA REMQSFYQHY YKKYIQALLN AADKADRAQL TKAYQTAAVL FEVLKAVNQT EDVEVADEIL ETHNKVEEKT QIYVPYNILP LDPDSQNQAI MRLPEIQAAV AALRNTRGLP WTAGHKKKLD EDILDWLQSM FGFQKDNVLN QREHLILLLA NVHIRQFPKP DQQPKLDDRA LTIVMKKLFR NYKKWCKYLG RKSSLWLPTI QQEMAFELYG MLAGSVSPMT GEHVKPAYGG EDEAFLQKVV TPIYQTISKE AKRSRGGKSK HSVWRNYDDL NEYFWSIRCF RLGWPMRADA DFFCQTAEEL RLERSEAMIV IAWNGSGELS AIFQGDVFLK VLSVFITAAI LKLAQAVLDI ALSWKARHSM SLYVKLRYVM KVVAILIYLS PNMLSALLFL FPFIRRYLER SDYKIMMLMM WWSQIKPLVG PTKDIMRIHI SVYSWHEFFP HAKNNLGVVI ALWSPVILIR TLGMLRSRFQ SIPGAFNDCL VPQDNSDDTK KKRFRATFSR KFDQLPSSKD KEAARFAQMW NKIISSFREE DLISDREMEL LLVPYWSDPD LDLIRWPPFL LASKIPIALD MAKDSNGKDR ELKKRLAVDS YMTCAVRECY ASFKNLINYL VVGEREGQVI NDIFSKIDEH IEKETLITEL NLSALPDLYG QFVRLIEYLL ENREEDKDQI VIVLLNMLEL VTRDIMEEEV PSANISVNFD SQFILKRKLG KKKQIKRLHL LLTVKESAMD VPSNLEARRR LTFFSNSLFM DMPPAPKIRN MLSFSYQLSP WLTSDEWTNF LERVKCGNEE ELRAREDLEE ELRLWASYRG QTLTKTVRGM MYYRKALELQ AFLDMAKDEE LLKGYKALEL TSEEASKSGG SLWAQCQALA DMKFTFVVSC QQYSIHKRSG DQRAKDILRL MTTYPSIRVA YIDEVEQTHK ESYKGTEEKI YYSALVKAAP QTKPMDSSES VQTLDQLIYR IKLPGPAILG EGKPENQNHA IIFTRGEGLQ TIDMNQDNYM EEAFKMRNLL QEFLEKHGGV RCPTILGLRE HIFTGRVRFH YGHPDIFDRL FHLTRGFNST LREGNVTHHE YIQVGKGRDV GLNQISMFEA KIANGNGEQT LSRDLYRLGH RFDFFRMLSC YFTTIGFYFS TMLTVLTVYV FLYGRLYLVL SGLEEGLSSQ RAFRNNKPLE AALASQSFVQ IGFLMALPMM MEIGLERGFH NALIEFVLMQ LQLASVFFTF QLGTKTHYYG RTLFHGGAEY RGTGRGFVVF HAKFAENYRF YSRSHFVKGI ELMILLLVYQ IFGQSYRGVV TYILITVSIW FMVVTWLFAP FLFNPSGFEW QKIVDDWTDW NKWIYNRGGI GVPPEKSWES WWEKELEHLR HSGVRGITLE IFLALRFFIF QYGLVYHLST FKGKNQSFWV YGASWFVILF ILLIVKGLGV GRRRFSTNFQ LLFRIIKGLV FLTFVAILIT FLALPLITIK DLFICMLAFM PTGWGMLLIA QACKPLIQQL GIWSSVRTLA RGYEIVMGLL LFTPVAFLAW FPFVSEFQTR MLFNQAFSRG LQISRILGGQ RKDRSSKNKE // ID NC003070_338 HYPOTHETICAL; PRT; 1193 AA. AC NC003070_338; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1668793...1668539, 1668454...1667998, DE 1667777...1667661, 1667624...1667236, 1666942...1665597, DE 1665155...1665124, 1664068...1663999, 1663921...1663806, DE 1663733...1663235, 1663017...1662913, 1662837...1662642]; Length: 3582. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 85 FIRST EXON; p-value: 0.000. FT GENSCAN 86 237 INTERNAL EXON; p-value: 0.000. FT GENSCAN 238 238 AA on splice site: g/gg -> G. FT GENSCAN 239 276 INTERNAL EXON; p-value: 0.000. FT GENSCAN 277 277 AA on splice site: g/ga -> G. FT GENSCAN 278 406 INTERNAL EXON; p-value: 0.000. FT GENSCAN 407 854 INTERNAL EXON; p-value: 0.000. FT GENSCAN 855 855 AA on splice site: ag/a -> R. FT GENSCAN 856 865 INTERNAL EXON; p-value: 0.000. FT GENSCAN 866 866 AA on splice site: a/at -> N. FT GENSCAN 867 888 INTERNAL EXON; p-value: 0.000. FT GENSCAN 889 889 AA on splice site: ag/g -> R. FT GENSCAN 890 927 INTERNAL EXON; p-value: 0.000. FT GENSCAN 928 928 AA on splice site: g/ag -> E. FT GENSCAN 929 1093 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1094 1094 AA on splice site: cg/a -> R. FT GENSCAN 1095 1128 INTERNAL EXON; p-value: 0.000. FT GENSCAN 1129 1129 AA on splice site: tc/g -> S. FT GENSCAN 1130 1193 LAST EXON; p-value: 0.000. SQ SEQUENCE 1193 AA; 132240 MW; FDEB4456020A1DD7 CRC64; MSSGAPLNVT NPNYDIEESR FGKIVCYDQS LLFEKREQKG WESGSTLASS LPFFITQLFV ANLSYRVLYY LTRPLYLPPF VAQILCGLLF SPSVLGNTRF IIAHVFPYRF TMVLETFANL ALVYNIFLLG LGMDLRMVRI TELKPVIIAF TGLLVALPVG AFLYYLPGNG HPDKIISGCV FWSVALACTN FPDLARILAD LKLLRSDMGR TAMCAAIVTD LCTWVLLVFG FASFSKSGTW NKMMPFVIIT TAIFVLLCIF VIRPGIAWIF AKTVKAGGVV LCGLITDACG VHSITGAFLF GLSIPHDHII RNMIEEKLHD FLSGILMPLF YIICGLRADI GFMLQFTDKF MMVVVICSSF LVKIVTTVIT SLFMHIPMRD AFAIGALMNT KGTLSLVVLN AGRDTKALDS PMYTHMTIAL LVMSLVVEPL LAFAYKPKKK LAHYKHRTVQ KIKGETELRV LACVHVLPNV SGITNLLQVS NATKQSPLSV FAIHLVELTG RTTASLLIMN DECKPKANFS DRVRAESDQI AETFEAMEVN NDAMTVQTIT AVSPYATMHE DICVLAEDKR VCFIILPYHK HLTPDGRMGE GNSSHAEINQ NVLSHAPCSV GILVDRGMAM VRSESFRGES MKREVAMLFV GGPDDREALS YAWRMVGQHV IKLTVVRFVP GREALISSGK VAAEYEREKQ VDDECIYEFN FKTMNDSSVK YIEKVVNDGQ DTIATIREME DNNSYDLYVV GRGYNSDSPV TAGLNDWSSS PELGTIGDTL ASSNFTMHAS VLVIQQYSAT KRQAAVTAAA ATTVMGAVAG VTGNNLESAG GDAKMTRDAH EPFMKSMYED EDEDDEEDHQ YGIHRLCNIP LIDYGNVKKW LADARGDAMP DAFSWSCKRR YKNGYVWQDL LDDDLITPIS DNEYVLKGSE ILLSSPKEDY PNVEKKAWVT RNGGIDAEEK LQKLKLTSEK IQKESPVFCS QRSTATTSTV TEESTTNEEG FVLKKQDPKT VSGQRDGSTE NGSGNDVESG RPSVSSTTSS SSYIKNKSYS SVRASHVLRN LMKCGGLDTN DAVLVPLNKS RSGAFGPAWE DERRYQYHQQ HNARKSFEGA WSGIKMKETI EFCKPKVAPS KPSMAPLCSQ CGKLFKPEKM HSHMKLCRGM KNSSANNDLM TSNNTVKPRQ QRCRNIPGNP LGHQRVLTTT LKE // ID NC003070_339 HYPOTHETICAL; PRT; 1032 AA. AC NC003070_339; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1669870...1670659, 1670733...1671586, DE 1672220...1673674]; Length: 3099. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 263 FIRST EXON; p-value: 0.000. FT GENSCAN 264 264 AA on splice site: g/gt -> G. FT GENSCAN 265 548 INTERNAL EXON; p-value: 0.000. FT GENSCAN 549 1032 LAST EXON; p-value: 0.000. SQ SEQUENCE 1032 AA; 116481 MW; 3EA1C1ABCD10DFC6 CRC64; MLTLSKFHVI LIPILFFITL LSPLFSIALP INIWPKPRFL SWPQHKAIAL SPNFTILAPE HQYLSASVTR YHNLIRSENY SPLISYPVKL MKRYTLRNLV VTVTDFSLPL HHGVDESYKL SIPIGSFSAH LLAHSAWGAM RGLETFSQMI WGTSPDLCLP VGIYIQDSPL FGHRGVLLDT SRNYYGVDDI MRTIKAMSAN KLNVFHWHIT DSQSFPLVLP SEPSLAAKGS LGPDMVYTPE DVSKIVQYGF EHGVRVLPEI DTPGHTGSWG EAYPEIVTCA NMFWWPAGKS WEERLASEPG TGQLNPLSPK TYEVVKNVIQ DIVNQFPESF FHGGGDEVIP GCWKTDPAIN SFLSSGGTLS QLLEKYINST LPYIVSQNRT VVYWEDVLLD AQIKADPSVL PKEHTILQTW NNGPENTKRI VAAGYRVIVS SSEFYYLDCG HGGFLGNDSI YDQKESGGGS WCAPFKTWQS IYNYDIADGL LNEEERKLVL GGEVALWSEQ ADSTVLDSRL WPRASALAES LWSGNRDERG VKRCGEAVDR LNLWRYRMKQ KNPVTALKLF EEAKERFPSY GHNGSVYATM IDILGKSNRV LEMKYVIERM KEDSCECKDS VFASVIRTFS RAGRLEDAIS LFKSLHEFNC VNWSLSFDTL LQEMVKESEL EAACHIFRKY CYGWEVNSRI TALNLLMKVL CQVNRSDLAS QVFQEMNYQG CYPDRDSYRI LMKGFCLEGK LEEATHLLYS MFWRISQKGS GEDIVVYRIL LDALCDAGEV DDAIEILGKI LRKGLKAPKR CYHHIEAGHW ESSSEGIERV KRLLTETLIR GAIPCLDSYS AMATDLFEEG KLVEGEEVLL AMRSKGFEPT PFIYGAKVKA LCRAGKLKEA VSVINKEMMQ GHCLPTVGVY NVLIKGLCDD GKSMEAVGYL KKMSKQVSCV ANEETYQTLV DGLCRDGQFL EASQVMEEML IKSHFPGVET YHMMIKGLCD MDRRYEAVMW LEEMVSQDMV PESSVWKALA ESVCFCAIDV VEILEHLISS KR // ID NC003070_340 HYPOTHETICAL; PRT; 472 AA. AC NC003070_340; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1675933...1675883, 1675774...1675484, DE 1675391...1675131, 1675062...1674889, 1674816...1674713, DE 1674627...1674432, 1674421...1674323, 1674228...1674103, DE 1674010...1673894]; Length: 1419. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 17 FIRST EXON; p-value: NaN. FT GENSCAN 18 114 INTERNAL EXON; p-value: NaN. FT GENSCAN 115 201 INTERNAL EXON; p-value: NaN. FT GENSCAN 202 259 INTERNAL EXON; p-value: NaN. FT GENSCAN 260 293 INTERNAL EXON; p-value: NaN. FT GENSCAN 294 294 AA on splice site: aa/a -> K. FT GENSCAN 295 359 INTERNAL EXON; p-value: NaN. FT GENSCAN 360 392 INTERNAL EXON; p-value: NaN. FT GENSCAN 393 434 INTERNAL EXON; p-value: NaN. FT GENSCAN 435 472 LAST EXON; p-value: NaN. SQ SEQUENCE 472 AA; 52775 MW; ADC3000F0323EFD3 CRC64; MQISSSSFIT KFTNLHMSVA AIVFGGGSDS ELYPLTKTRS KGAIPIAANY RLIDAVISNC INSGITKIYA ITQFNSTSLN SHLSKAYSGF GLGKDRFVEV IAAYQSLEDQ GWFQGTADAI RRCLWVFEEF PVTEFLVLPG HHLYKMDYKM LIEDHRRSRA DITIVGLSSV TDHDFGFGFM EVDSTNAVTR FTIKGQQDLI SVANRTATRS DGTSSCSVPS AGIYVIGREQ MVKLLRECLI KSKDLASEII PGAISEGMKV KAHMFDGYWE DVRSIGAYYR ANMESIKRCR LDLKFYDRQC PLYTMPRCLP PSSMSVAVIT NSIIGDGCIL DVICKTLFQI LETLAAVVNQ KLEEFRHLLK CVIRGSVVGM RTRIADEVIV EDSIIVGSDI YEMEEDVRRK GKEKKIEIRI GIGEKSRIRR AIVDKNARIG KNVMIINRDN VEEGNREAQG YVIREGIIII LRNAVIPNDS IL // ID NC003070_341 HYPOTHETICAL; PRT; 264 AA. AC NC003070_341; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1678301...1678132, 1678130...1677506]; DE Length: 795. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 56 FIRST EXON; p-value: NaN. FT GENSCAN 57 57 AA on splice site: ta/t -> Y. FT GENSCAN 58 264 LAST EXON; p-value: NaN. SQ SEQUENCE 264 AA; 30556 MW; 2874FE18853D08CE CRC64; MVSREEEEGA PWVLFMMSHS LPVETKQSVL EAIEREARVF IQVPLRKRTL RIPISTYAPE EYVTKEDPPK ANIIDGPQTK STSKRKRCPL LLPPTEEKPK IATRKANCRF DAGASSSGTR EPTPEWLVRL MRVKYGENPI NVINKELTAT NVKPHHRRLS MPFSQIIDFE FLNPDEKRII EEHANKEREE GVDVILVNFD RREYMLNLRR WNMGTSPLYI LVSGRYNVVK GCRLKEGNEI RIWSFHFDDQ LNLAMVPLTP TESG // ID NC003070_342 HYPOTHETICAL; PRT; 358 AA. AC NC003070_342; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1679285...1679336, 1679555...1679715, DE 1679823...1679882, 1680120...1680305, 1680400...1680504, DE 1680622...1680718, 1680850...1681020, 1681161...1681296, DE 1681418...1681526]; Length: 1077. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 17 FIRST EXON; p-value: NaN. FT GENSCAN 18 18 AA on splice site: g/ag -> E. FT GENSCAN 19 71 INTERNAL EXON; p-value: NaN. FT GENSCAN 72 91 INTERNAL EXON; p-value: NaN. FT GENSCAN 92 153 INTERNAL EXON; p-value: NaN. FT GENSCAN 154 188 INTERNAL EXON; p-value: NaN. FT GENSCAN 189 220 INTERNAL EXON; p-value: NaN. FT GENSCAN 221 221 AA on splice site: g/aa -> E. FT GENSCAN 222 277 INTERNAL EXON; p-value: NaN. FT GENSCAN 278 278 AA on splice site: g/gt -> G. FT GENSCAN 279 322 INTERNAL EXON; p-value: NaN. FT GENSCAN 323 323 AA on splice site: ag/g -> R. FT GENSCAN 324 358 LAST EXON; p-value: NaN. SQ SEQUENCE 358 AA; 38913 MW; 405FDBB9616E5B21 CRC64; MAIGDRKKII IDTDPGIESY HEFILTGFDD VVDDAMAIFV ALNSPEVDVI GLTTIFGNVY TTLATRNALH LLEVAGRTDI PVAEGTHKTF LNDTKLRIAD FVHGKDGLGN QNFPPPKGKP IEKSGPEFLV EQAKLCPGEI TVVALGPLTN LALAVQLDPE FSKNVGQIVL LGGAFAVNGN VNPASEANIF GDPEAADIVF TCGADIIAVG INVTHQVIMT EHNLVKQSFL MAFFPVFLLD LADDKDKLAS SKGKLAQYLC KILDVYYDYH LTAYEIKGVY LHDPATILAA FLPSLFTYTE GVARVQTSGI TRGLTLLYNN LKRFEEANEW SDKPTVKVAV TVDAPAVVKL IMDRLMES // ID NC003070_343 HYPOTHETICAL; PRT; 1136 AA. AC NC003070_343; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1682482...1683298, 1683388...1683640, DE 1683733...1684540, 1684644...1684768, 1684902...1684986, DE 1685318...1685483, 1685586...1685626, 1685713...1685980, DE 1686126...1686249, 1686347...1686707, 1686790...1687152]; Length: 3411. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 272 FIRST EXON; p-value: NaN. FT GENSCAN 273 273 AA on splice site: g/gt -> G. FT GENSCAN 274 356 INTERNAL EXON; p-value: NaN. FT GENSCAN 357 357 AA on splice site: tg/g -> W. FT GENSCAN 358 626 INTERNAL EXON; p-value: NaN. FT GENSCAN 627 667 INTERNAL EXON; p-value: NaN. FT GENSCAN 668 668 AA on splice site: tg/g -> W. FT GENSCAN 669 696 INTERNAL EXON; p-value: NaN. FT GENSCAN 697 751 INTERNAL EXON; p-value: NaN. FT GENSCAN 752 752 AA on splice site: g/ct -> A. FT GENSCAN 753 765 INTERNAL EXON; p-value: NaN. FT GENSCAN 766 854 INTERNAL EXON; p-value: NaN. FT GENSCAN 855 855 AA on splice site: g/ga -> G. FT GENSCAN 856 895 INTERNAL EXON; p-value: NaN. FT GENSCAN 896 896 AA on splice site: at/g -> M. FT GENSCAN 897 1016 INTERNAL EXON; p-value: NaN. FT GENSCAN 1017 1136 LAST EXON; p-value: NaN. SQ SEQUENCE 1136 AA; 125699 MW; 4E90EE8E43900687 CRC64; MDSLIIEEED EEALATLVPV PPRRKTHSYS LQFDHKPHHQ IRKHSLDEVP RSATLASEAV YFDSSDDEFS TGGNITENAA DETNAGAEEY TIVNPPPNVG LGDDDTEPLP EFIGAGGGSG IFKVPVRAAV HPGRPPCLEL RPHPLRETQT GRFLRNIACT ETQLWAGQEN GIRFWNLEDA YEAGCGIGGQ VPRGDEDTAP FHESVTTSPT MCLVADQSNK LLWSGHKDGK IRAWKMDQSS VSHDDDDSDP FKERVSWLAH RGPVNSIVIS SYGDMWSCSE GGVIKIWPWD TLEKSLLLKP EEKHMAALLV ERSAIDLRSQ VTVNGTCSIS SSEVKFLLAD SVRAKVWAVQ SLSFSIWDAR SKDLLKVLNV DGQVENRGDL PPIQDQQVDD EMKLKFFSAS KREKPQGFLQ RSRNAIMGAA GAVRRVATRS AGAFSEDTRK TEAIVLAVDG TIWTGSISGL IVQWDGNGNR LRDVNHHHRP VLCFCTFGDR IYVGYASGYI QVLDLDGKLI SSWVSHNEPV IKLAAGGGFI FSLATHGGVR GWYVTSPGPL DNIIRTELSQ KETLYARQDN VRILIGTWNV GQGRASHDAL MSWLGSVTSD VGIVAVGLQE VEMGAGFLAM SAAKETVGLE GSAVGQWWID AIGKALDEKN TFERMGSRQL AGLLISLWAR KDIRTHVGDL DVAAVPCGFG RAIGNKGGVG LRIRVYDRIM CFVNCHLAAH LEAVNRRNAD FNHIFRLMVF SRGQNLSNAA AAGVSTSAYT TKSNTIPSTG AEEIKSDLAA ADMVAFFGDF NYRLFGITYD EARDFISQRS FDWLRERDQL RAEMKVGKVF QGMREALITF PPTYKFERNR SGLGGYDSGE KKRIPAWCDR VIYRDTQSSP FSESNLQCPV VSSVIMYEAC MDVTESDHKP VRCKFHATIA HVDKSVRRQE LGKIIRSNEK ILSIFEDLRF VPETSVSTNN IVLQSQDTVI LTITNNSPTS QAIFNILCGG QAVVKDDGED ADYNPRGSFG LPRWLEVSPA AGIINPEGSV DVKVHHEDFY SMEEYVDGIP QNWWCEDTRD KEAILMVNIR GSCSTTLRSH SVKVRHCFSA RVCLLENRPT NLTKNLGGSR RYPTDITRNG STRPRTEDSV RRGKSR // ID NC003070_344 HYPOTHETICAL; PRT; 627 AA. AC NC003070_344; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1689500...1688897, 1688797...1688388, DE 1688304...1687435]; Length: 1884. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 201 FIRST EXON; p-value: NaN. FT GENSCAN 202 202 AA on splice site: g/ag -> E. FT GENSCAN 203 338 INTERNAL EXON; p-value: NaN. FT GENSCAN 339 627 LAST EXON; p-value: NaN. SQ SEQUENCE 627 AA; 68699 MW; 3F19A7BF4471A02A CRC64; MSSLVPVMEK QESVRGNSVE EQQSLRCVGA ENQVRKGRAL DKQVSFLAVN VVEKHGIKGR AMEKQERAME RQRSFRGFVE KQKSFRVVME RQLSFMNVGG ERKKKTDSPG KRGDSPLHLA ARTGNLGKVM ELIRACNGIE ELKELSSKQN LEGETPLYSA AENGHSLVVE EMLKHMDLDT ASVKARNGFD PFHVAAKQGH IEALKKLLET FPNLAMTVDL SCTTALHTAA SQGHTDVVNL LLKTDSHLAK IAKNNGKTAL HSAARMGHRE VVKSLIGNDA SIGFRTDKKG QTALHMAVKG QNEGIVLELV KPDPAILSVE DSKGNTPLHT ATNKGRIKIV RCLVSFDGIN LNAMNKAGDT ALDIAEKIGN PELVSVLKEA GAATAKDLGK PRNPAKQLNQ TVSDIKHEVQ SQLQQSRQTG VRVRRIAKRL KKLHINGLNN AINSATVVAV LIATVAFAAI FTIPGQYEED RTKGLLLLGE ARIAGKAPFL VFFIFDSLAL FISLAVVVVQ TSVVVIEQKA KKNLVFVINK LMWLACLFIS VAFVSLSFIV VGKEDIWLAI CATIIGGTIM LTTIGAMCYC VVMHRIEESK LKSLRKERSK SKSFSLSHMP SESEILNGEF NKRMYAL // ID NC003070_345 HYPOTHETICAL; PRT; 284 AA. AC NC003070_345; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1692125...1691688, 1691267...1690983, DE 1690829...1690698]; Length: 855. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 146 FIRST EXON; p-value: NaN. FT GENSCAN 147 241 INTERNAL EXON; p-value: NaN. FT GENSCAN 242 284 LAST EXON; p-value: NaN. SQ SEQUENCE 284 AA; 30399 MW; A5D70FD9FFC19EC3 CRC64; MTKSAITFPL IFTLLTFIDV SSSASIVFNV VSFGAKPDGV TDSTAAFLKA WQGACGSAAS ATVVVPTGTF LLKVITFGGP CKSKITFQVT GTVVAPEDYR TFGNSGSWIL FNKVNRFSLV GGTFDARGSG FWSCRKSGQN CPPGVRSISF NSAKDVIISG VKSMNSQVSH MTLNGCTNVA VRNIRLVAPG DSPNTDGFTV QFSTGVTLTG STVQTGDDCV AIGQGTRNFL ISKLACGPGH GHREFGKAVK RRRSRERDSI EFGIHRITKR REDKVMGEAE YRIR // ID NC003070_346 HYPOTHETICAL; PRT; 394 AA. AC NC003070_346; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1696057...1695620, 1695284...1694995, DE 1694847...1694619, 1694512...1694285]; Length: 1185. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 146 FIRST EXON; p-value: NaN. FT GENSCAN 147 242 INTERNAL EXON; p-value: NaN. FT GENSCAN 243 243 AA on splice site: ag/c -> S. FT GENSCAN 244 319 INTERNAL EXON; p-value: NaN. FT GENSCAN 320 394 LAST EXON; p-value: NaN. SQ SEQUENCE 394 AA; 41855 MW; 0222B002BB23691D CRC64; MTKSVIRFSL LFTLLTFIDV SISASNVFNV VSFGAKPDGV TDSTGAFLKA WQGACVSASS ATVVVPKGTF LLKVITFGGP CKSKITFQVA GTVIAPEDYR TFGNSGFWIL FNKVNRFSLV GGTFDARANG FWSCRKSGQN CPPGVRSISF NSAKDVIISG VKSMNSQVTH MTLNGCTNVV VRNVKLVAPG NSPNTDGFHV QHSTGVTFTG STVQTGDDCV AIGPGTRNLL ITKLACGPGH GVSIGSLAKE LKEDGVENVT VSSSVFTGSQ NGVRIKSWAR PSNGFVRTVF FQDLVMKNVE NPIIIDQNYC PTHEGCPNEY SGVKISQVTY KNIQGTSATQ EAMKLVCSKS SPCTGITLQD IKLTYNKGTP ATSFCFNAVG KSLGVIQPTS CLNR // ID NC003070_347 HYPOTHETICAL; PRT; 1609 AA. AC NC003070_347; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1704638...1703994, 1703911...1703242, DE 1702674...1702070, 1701928...1701325, 1701152...1700989, DE 1700714...1698573]; Length: 4830. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 215 FIRST EXON; p-value: NaN. FT GENSCAN 216 438 INTERNAL EXON; p-value: NaN. FT GENSCAN 439 439 AA on splice site: g/ca -> A. FT GENSCAN 440 640 INTERNAL EXON; p-value: NaN. FT GENSCAN 641 841 INTERNAL EXON; p-value: NaN. FT GENSCAN 842 842 AA on splice site: g/tg -> V. FT GENSCAN 843 896 INTERNAL EXON; p-value: NaN. FT GENSCAN 897 1609 LAST EXON; p-value: NaN. SQ SEQUENCE 1609 AA; 182758 MW; E1F8A73B16990C98 CRC64; MREGSHLIVL PFPGQGHITP MSQFCKRLAS KGLKLTLVLV SDKPSPPYKT EHDSITVFPI SNGFQEGEEP LQDLDDYMER VETSIKNTLP KLVEDMKLSG NPPRAIVYDS TMPWLLDVAH SYGLSGAVFF TQPWLVTAIY YHVFKGSFSV PSTKYGHSTL ASFPSFPMLT ANDLPSFLCE SSSYPNILRI VVDQLSNIDR VDIVLCNTFD KLEEKLLKWV QSLWPVLNIG PTVPSMYLDK RLSEDKNYGF SLFNAKVAEC MEWLNSKEPN SVVYLSFGSL VILKEDQMLE LAAGLKQSGR FFLWVVRETE THKLPRNYVE EIGEKGLIVS WSPQLDVLAH KSIGCFLTHC GWNSTLEGLS LGVPMIGMPH WTDQPTNAKF MQDVWKVGVR VKAEGDGFVR REEIMRSVEE VMEGEKGKEI RKNAEKWKVL AQEAVSEGAQ GHITPMSQFC KRLASKSLKI TLVLVSDKPS PPYKTEHDTI TVVPISNGFQ EGQERSEDLD EYMERVESSI KNRLPKLIED MKLSGNPPRA LVYDSTMPWL LDVAHSYGLS GAVFFTQPWL VSAIYYHVFK GSFSVPSTKY GHSTLASFPS LPILNANDLP SFLCESSSYP YILRTVIDQL SNIDRVDIVL CNTFDKLEEK LLKWIKSVWP VLNIGPTVPS MYLDKRLAED KNYGFSLFGA KIAECMEWLN SKQPSSVVYV SFGSLVVLKK DQLIELAAGL KQSGHFFLWV VRETERRKLP ENYIEEIGEK GLTVSWSPQL EVLTHKSIGC FVTHCGWNST LEGLSLGVPM IGMPHWADQP TNAKFMEDVW KVGVRVKADS DGFVRREEFV RRVEEVMEAE QVVLAQKRIV LFVKMTVRIY DAVSTKIPKS IVVFNRTPCP SFSEFLFRDR EREKQKSRGL SFSTLTDTRP FPDYSPKKAS VRDTEFVHQI TNVIKLRRAE PLRRSLKPYE CKFKTDHLIW VLMKIKCDYR LVLDFFDWAR SRRDSNLESL CIVIHLAVAS KDLKVAQSLI SSFWERPKLN VTDSFVQFFD LLVYTYKDWG SDPRVFDVFF QVLVDFGLLR EARRVFEKML NYGLVLSVDS CNVYLTRLSK DCYKTATAII VFREFPEVGV CWNVASYNIV IHFVCQLGRI KEAHHLLLLM ELKGYTPDVI SYSTVVNGYC RFGELDKVWK LIEVMKRKGL KPNSYIYGSI IGLLCRICKL AEAEEAFSEM IRQGILPDTV VYTTLIDGFC KRGDIRAASK FFYEMHSRDI TPDVLTYTAI ISGFCQIGDM VEAGKLFHEM FCKGLEPDSV TFTELINGYC KAGHMKDAFR VHNHMIQAGC SPNVVTYTTL IDGLCKEGDL DSANELLHEM WKIGLQPNIF TYNSIVNGLC KSGNIEEAVK LVGEFEAAGL NADTVTYTTL MDAYCKSGEM DKAQEILKEM LGKGLQPTIV TFNVLMNGFC LHGMLEDGEK LLNWMLAKGI APNATTFNSL VKQYCIRNNL KAATAIYKDM CSRGVGPDGK TYENLVKGHC KARNMKEAWF LFQEMKGKGF SVSVSTYSVL IKGFLKRKKF LEAREVFDQM RREGLAADKE IFDFFSDTKY KGKRPDTIVD PIDEIIENYL VDEQLRGAN // ID NC003070_348 HYPOTHETICAL; PRT; 1154 AA. AC NC003070_348; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1707808...1708026, 1708104...1708234, DE 1708311...1708795, 1708806...1708925, 1709982...1710519, DE 1710646...1711121, 1711201...1711339, 1711595...1711666, DE 1711911...1711976, 1712092...1712284, 1712361...1712550, DE 1712676...1712826, 1712905...1713022, 1716098...1716377, DE 1716558...1716623, 1716802...1717022]; Length: 3465. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 73 FIRST EXON; p-value: NaN. FT GENSCAN 74 116 INTERNAL EXON; p-value: NaN. FT GENSCAN 117 117 AA on splice site: tg/c -> C. FT GENSCAN 118 278 INTERNAL EXON; p-value: NaN. FT GENSCAN 279 279 AA on splice site: g/ac -> D. FT GENSCAN 280 318 INTERNAL EXON; p-value: NaN. FT GENSCAN 319 319 AA on splice site: g/gt -> G. FT GENSCAN 320 497 INTERNAL EXON; p-value: NaN. FT GENSCAN 498 498 AA on splice site: ag/a -> R. FT GENSCAN 499 656 INTERNAL EXON; p-value: NaN. FT GENSCAN 657 657 AA on splice site: g/gt -> G. FT GENSCAN 658 702 INTERNAL EXON; p-value: NaN. FT GENSCAN 703 703 AA on splice site: ct/a -> L. FT GENSCAN 704 726 INTERNAL EXON; p-value: NaN. FT GENSCAN 727 727 AA on splice site: tt/c -> F. FT GENSCAN 728 748 INTERNAL EXON; p-value: NaN. FT GENSCAN 749 749 AA on splice site: cg/t -> R. FT GENSCAN 750 813 INTERNAL EXON; p-value: NaN. FT GENSCAN 814 876 INTERNAL EXON; p-value: NaN. FT GENSCAN 877 877 AA on splice site: g/tc -> V. FT GENSCAN 878 926 INTERNAL EXON; p-value: NaN. FT GENSCAN 927 927 AA on splice site: ct/a -> L. FT GENSCAN 928 966 INTERNAL EXON; p-value: NaN. FT GENSCAN 967 1059 INTERNAL EXON; p-value: NaN. FT GENSCAN 1060 1060 AA on splice site: g/ac -> D. FT GENSCAN 1061 1081 INTERNAL EXON; p-value: NaN. FT GENSCAN 1082 1082 AA on splice site: g/at -> D. FT GENSCAN 1083 1154 LAST EXON; p-value: NaN. SQ SEQUENCE 1154 AA; 129534 MW; F266D6FDB8A6508B CRC64; MSSSTKNIPK PPPLPCITYQ RFQSSTRKPS SLMRLVPKEA LETWDKLFKE GSGADTYVET DNKSHFPAHS SVLAAASPVI ATLLNQSRDK NGNTYLKIHG VPCEAVYMFI RFLYSSCYEE EEMKKFVLHL LVLSHCYSVP SLKRLCVEIL DQGWINKENV IDVLQLARNC DVTRICFVCL SMVIKDFKSV SSTEGWKVMK RSNPLLEQEL IEAVIESDSR KQERRRKLEE REVYLQLYEA MEALVHICRE GCGTIGPRDK ALKGSHTVCK FPACKGLEDI SWDASLGLHV LIAKGCGSFF SFTLVSVMIL ILAKFLSAGF ISIDCGIPSG SSYKDDTTGI NYVSDSSFVE TGVSKSIPFT AQRQLQNLRS FPEGSRNCYT LIPIQGKGKK YLIRASFMYG NYDGENGSPE FDLFLGGNIW DTVLLSNGSS IVSKEVVYLS QSENIFVCLG NKGKGTPFIS TLELRFLGND NTTYDSPNGA LFFSRRWDLR SLMGSPVRYD DDVYDRIWIP RNFGYCREIN TSLPVTSDNN SYSLSSLVMS TAMTPINTTR PITMTLENSD PNVRYFVYMH FAEVEDLSLK PNQTREFDIS INGVTVAAGF SPKYLQTNTF FLNPESQSKI AFSLVRTPKS TLPPIVNALE IYVANSFSQS LTNQEDGDAV TSLKTSYKVK KNWHGDPCLP NDYIWEGLNC SYDSLTPPRI TSLDLSNNGL TGDIPEFLSK LKFLRVFIVH PLVIVWCVLE NQKPEKQVRP MAKSENKLLF TFADVIKMTN NFGQVLGKGG FGTVYHGFYD NLQVAVKLLS ETSAQGFKEF RSEVEVLVRV HHVNLTALIG YFHEGDQMGL IYEFMANGNM ADHLAGKYQH TLSWRQRLQI ALDAAQVHRD VKTSNILLNE KNRAKLADFG LSRSFHTESR SHVSTLVAGT PGYLDPLCFE TNGLNEKSDI YSFGVVLLEM ITGKTVIKES QTKRVHDGNL VPYVTCLVSI NFGSCCVDDS LAHFLLTKGM EFSRDAGMMM ENKRNVCSLG ESSIKRHKSD LSFSSKVLLI LASFHGFLGR NSLRYNKLHD RHCISSSRRD ALHRVSSRTS EDLHLELEYL ARTDFPVFCN QEELEQYSLR NRGLCLVPME NTVGVAQSNG ADIWAPVKTP LSPAFSVTSQ SPFR // ID NC003070_349 HYPOTHETICAL; PRT; 158 AA. AC NC003070_349; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1718833...1718671, 1718487...1718429, DE 1718300...1718119, 1717748...1717676]; Length: 477. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 54 FIRST EXON; p-value: NaN. FT GENSCAN 55 55 AA on splice site: g/ag -> E. FT GENSCAN 56 74 INTERNAL EXON; p-value: NaN. FT GENSCAN 75 134 INTERNAL EXON; p-value: NaN. FT GENSCAN 135 135 AA on splice site: ag/a -> R. FT GENSCAN 136 158 LAST EXON; p-value: NaN. SQ SEQUENCE 158 AA; 17901 MW; A181CB37D4A33B8B CRC64; MAKMVVLSAM MILILASTIS AKEQLSTKEC EDLGFSGLAL CSDCHSLSEY VKDQELVSDC LKCCADDSED SMSKVTYSGA ILEVCMRKLV FYPEIVGFIE EEKEKFPSVK VQYIFNSPPK LIMLDEDGEH KESIRIDNWK REHLLQYMRE KVKPTAAS // ID NC003070_350 HYPOTHETICAL; PRT; 161 AA. AC NC003070_350; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1719106...1719234, 1719723...1719896, DE 1719976...1720158]; Length: 486. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 43 FIRST EXON; p-value: NaN. FT GENSCAN 44 101 INTERNAL EXON; p-value: NaN. FT GENSCAN 102 161 LAST EXON; p-value: NaN. SQ SEQUENCE 161 AA; 18593 MW; 3B77961BE51B0CDE CRC64; MDHIAAAEEQ IVTERIRRKL EEVNATAQSQ LSPIQDHINF TLQLMNQTRK IDQLQQAYFK CAYECFDRNR KQEEIANCVE HCSVPVVNAQ QHFEGEMSQF QERMNRSLMV CQDKFEAAKL HKNRGDAAKA MESCVNTSIE DSLDTLPHIV QRMKTSFSIA D // ID NC003070_351 HYPOTHETICAL; PRT; 620 AA. AC NC003070_351; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1720673...1720801, 1720922...1721092, DE 1721462...1723024]; Length: 1863. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 43 FIRST EXON; p-value: NaN. FT GENSCAN 44 100 INTERNAL EXON; p-value: NaN. FT GENSCAN 101 620 LAST EXON; p-value: NaN. SQ SEQUENCE 620 AA; 69756 MW; 840E40D248562BAD CRC64; MDRMAVAEEQ ILLERVRRKI EEVNASGQSQ LSPIQEHISF TLLIYAPLID DELQQAYFKC SNECFEKRRK PEVTTNCVEL CRVPVAKSQQ QFDSDMAKFQ SGLVRDKRKK SILKSHHLNR MGLLPVVGIT SPALITHKNH ANPKIQRHNQ STSETTVSWT SRINLLTRNG RLAEAAKEFS DMTLAGVEPN HITFIALLSG CGDFTSGSEA LGDLLHGYAC KLGLDRNHVM VGTAIIGMYS KRGRFKKARL VFDYMEDKNS VTWNTMIDGY MRSGQVDNAA KMFDKMPERD LISWTAMING FVKKGYQEEA LLWFREMQIS GVKPDYVAII AALNACTNLG ALSFGLWVHR YVLSQDFKNN VRVSNSLIDL YCRCGCVEFA RQVFYNMEKR TVVSWNSVIV GFAANGNAHE SLVYFRKMQE KGFKPDAVTF TGALTACSHV GLVEEGLRYF QIMKCDYRIS PRIEHYGCLV DLYSRAGRLE DALKLVQSMP MKPNEVVIGS LLAACSNHGN NIVLAERLMK HLTDLNVKSH SNYVILSNMY AADGKWEGAS KMRRKMKGLG LKKQPGFSSI EIDDCMHVFM AGDNAHVETT YIREVLELIS SDLRLQGCVV ETLAGDLLNA // ID NC003070_352 HYPOTHETICAL; PRT; 174 AA. AC NC003070_352; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1724439...1724233, 1724113...1723796]; DE Length: 525. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 69 FIRST EXON; p-value: NaN. FT GENSCAN 70 174 LAST EXON; p-value: NaN. SQ SEQUENCE 174 AA; 19200 MW; F38A420E5EDF0224 CRC64; MKIGPVGKHD ARSTTIVNWD EGSHDGFISQ IFLSHGVAGI MSIQFQFVMD GKLVLSDRHG PFSGNMFDVI ELNYPHEYIT GISGEYYKYE ANNPHMRSLK FNTNTSEYGP FGTSGSSNDK FAFKLGKSPQ FGGFHGTYDA SGLQYIGVYL RPKTVLPKID TGNAEETESK IVLG // ID NC003070_353 HYPOTHETICAL; PRT; 381 AA. AC NC003070_353; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1729574...1729357, 1727446...1727315, DE 1726763...1726557, 1726283...1726031, 1725881...1725546]; Length: 1146. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 72 FIRST EXON; p-value: NaN. FT GENSCAN 73 73 AA on splice site: ag/t -> S. FT GENSCAN 74 116 INTERNAL EXON; p-value: NaN. FT GENSCAN 117 117 AA on splice site: ag/c -> S. FT GENSCAN 118 185 INTERNAL EXON; p-value: NaN. FT GENSCAN 186 186 AA on splice site: cg/t -> R. FT GENSCAN 187 270 INTERNAL EXON; p-value: NaN. FT GENSCAN 271 381 LAST EXON; p-value: NaN. SQ SEQUENCE 381 AA; 43351 MW; 6ADB39C2FEBCE8B1 CRC64; MASSEIICDD VTQRLFWMMS SYKGYFWNLV EYGEQQRRTT VKIREMMTQI RRLREVSNKE KSYALHQFKN DFSCPSASSF NFSNLPRQII TTWAKRYRKP NLYFSLSLLE RERLKKSHGK TTELKFTEIE SPLILDEIRN RETQILLRVK GWIESGVGSS ECTLGLRLGG LNKIACKSGI SRFVGRKREQ KKHIKKKMEG KIKIGPVGTD YSGKKTMVDW DEGSHNGIIS QIFLSHGPTG VFSIQFQFML DDTFFLSSCH GQNTGSMFDV ILLNCPHEYI TGISGEYLKS DGASGPQIRS LAFATNLNQY GPFGGSSSQS SIWNHEQQFR FKLGKFRQFS GFYGTYNASG LQNIGVYLQP TIVKPTGTRN AEETESNIVL G // ID NC003070_354 HYPOTHETICAL; PRT; 2932 AA. AC NC003070_354; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1730076...1730126, 1731630...1731727, DE 1732878...1733298, 1733750...1733817, 1734113...1734213, DE 1734670...1734832, 1735111...1735216, 1735495...1735649, DE 1735937...1736043, 1736151...1736272, 1736354...1736451, DE 1736531...1736730, 1736834...1737200, 1738798...1738819, DE 1741401...1742514, 1744792...1745316, 1746209...1746501, DE 1746760...1746888, 1747035...1747100, 1747192...1747288, DE 1747767...1747793, 1748265...1748310, 1748405...1748775, DE 1749027...1749277, 1750361...1750519, 1750627...1750738, DE 1750855...1750974, 1751114...1751271, 1751526...1751705, DE 1753097...1753239, 1753678...1753742, 1754479...1755076, DE 1755261...1755312, 1755407...1755481, 1755562...1755823, DE 1756067...1756193, 1756561...1756766, 1756864...1756956, DE 1757046...1757143, 1757226...1757387, 1757733...1757876, DE 1757973...1758020, 1758263...1758283, 1758321...1758405, DE 1758460...1758578, 1759114...1759309, 1759617...1759862, DE 1760190...1760251, 1760334...1760411, 1760856...1761032, DE 1761083...1761097]; Length: 8799. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 17 FIRST EXON; p-value: NaN. FT GENSCAN 18 49 INTERNAL EXON; p-value: NaN. FT GENSCAN 50 50 AA on splice site: gg/g -> G. FT GENSCAN 51 190 INTERNAL EXON; p-value: NaN. FT GENSCAN 191 212 INTERNAL EXON; p-value: NaN. FT GENSCAN 213 213 AA on splice site: ag/g -> R. FT GENSCAN 214 246 INTERNAL EXON; p-value: NaN. FT GENSCAN 247 247 AA on splice site: g/gc -> G. FT GENSCAN 248 300 INTERNAL EXON; p-value: NaN. FT GENSCAN 301 301 AA on splice site: tg/g -> W. FT GENSCAN 302 336 INTERNAL EXON; p-value: NaN. FT GENSCAN 337 387 INTERNAL EXON; p-value: NaN. FT GENSCAN 388 388 AA on splice site: aa/t -> N. FT GENSCAN 389 423 INTERNAL EXON; p-value: NaN. FT GENSCAN 424 424 AA on splice site: g/ag -> E. FT GENSCAN 425 464 INTERNAL EXON; p-value: NaN. FT GENSCAN 465 496 INTERNAL EXON; p-value: NaN. FT GENSCAN 497 497 AA on splice site: ag/c -> S. FT GENSCAN 498 563 INTERNAL EXON; p-value: NaN. FT GENSCAN 564 564 AA on splice site: g/tt -> V. FT GENSCAN 565 685 INTERNAL EXON; p-value: NaN. FT GENSCAN 686 686 AA on splice site: ag/a -> R. FT GENSCAN 687 693 INTERNAL EXON; p-value: NaN. FT GENSCAN 694 1064 INTERNAL EXON; p-value: NaN. FT GENSCAN 1065 1065 AA on splice site: a/aa -> K. FT GENSCAN 1066 1239 INTERNAL EXON; p-value: NaN. FT GENSCAN 1240 1240 AA on splice site: a/at -> N. FT GENSCAN 1241 1337 INTERNAL EXON; p-value: NaN. FT GENSCAN 1338 1380 INTERNAL EXON; p-value: NaN. FT GENSCAN 1381 1402 INTERNAL EXON; p-value: NaN. FT GENSCAN 1403 1434 INTERNAL EXON; p-value: NaN. FT GENSCAN 1435 1435 AA on splice site: g/tt -> V. FT GENSCAN 1436 1443 INTERNAL EXON; p-value: NaN. FT GENSCAN 1444 1444 AA on splice site: g/aa -> E. FT GENSCAN 1445 1458 INTERNAL EXON; p-value: NaN. FT GENSCAN 1459 1459 AA on splice site: ca/a -> Q. FT GENSCAN 1460 1582 INTERNAL EXON; p-value: NaN. FT GENSCAN 1583 1583 AA on splice site: a/tc -> I. FT GENSCAN 1584 1666 INTERNAL EXON; p-value: NaN. FT GENSCAN 1667 1719 INTERNAL EXON; p-value: NaN. FT GENSCAN 1720 1756 INTERNAL EXON; p-value: NaN. FT GENSCAN 1757 1757 AA on splice site: g/ag -> E. FT GENSCAN 1758 1796 INTERNAL EXON; p-value: NaN. FT GENSCAN 1797 1797 AA on splice site: g/tg -> V. FT GENSCAN 1798 1849 INTERNAL EXON; p-value: NaN. FT GENSCAN 1850 1909 INTERNAL EXON; p-value: NaN. FT GENSCAN 1910 1956 INTERNAL EXON; p-value: NaN. FT GENSCAN 1957 1957 AA on splice site: ag/g -> R. FT GENSCAN 1958 1978 INTERNAL EXON; p-value: NaN. FT GENSCAN 1979 1979 AA on splice site: c/aa -> Q. FT GENSCAN 1980 2177 INTERNAL EXON; p-value: NaN. FT GENSCAN 2178 2178 AA on splice site: ag/g -> R. FT GENSCAN 2179 2195 INTERNAL EXON; p-value: NaN. FT GENSCAN 2196 2220 INTERNAL EXON; p-value: NaN. FT GENSCAN 2221 2307 INTERNAL EXON; p-value: NaN. FT GENSCAN 2308 2308 AA on splice site: c/gt -> R. FT GENSCAN 2309 2349 INTERNAL EXON; p-value: NaN. FT GENSCAN 2350 2350 AA on splice site: ag/g -> R. FT GENSCAN 2351 2418 INTERNAL EXON; p-value: NaN. FT GENSCAN 2419 2419 AA on splice site: g/ga -> G. FT GENSCAN 2420 2449 INTERNAL EXON; p-value: NaN. FT GENSCAN 2450 2450 AA on splice site: g/at -> D. FT GENSCAN 2451 2482 INTERNAL EXON; p-value: NaN. FT GENSCAN 2483 2536 INTERNAL EXON; p-value: NaN. FT GENSCAN 2537 2584 INTERNAL EXON; p-value: NaN. FT GENSCAN 2585 2600 INTERNAL EXON; p-value: NaN. FT GENSCAN 2601 2607 INTERNAL EXON; p-value: NaN. FT GENSCAN 2608 2635 INTERNAL EXON; p-value: NaN. FT GENSCAN 2636 2636 AA on splice site: g/ga -> G. FT GENSCAN 2637 2675 INTERNAL EXON; p-value: NaN. FT GENSCAN 2676 2740 INTERNAL EXON; p-value: NaN. FT GENSCAN 2741 2741 AA on splice site: g/ag -> E. FT GENSCAN 2742 2822 INTERNAL EXON; p-value: NaN. FT GENSCAN 2823 2823 AA on splice site: g/gg -> G. FT GENSCAN 2824 2843 INTERNAL EXON; p-value: NaN. FT GENSCAN 2844 2869 INTERNAL EXON; p-value: NaN. FT GENSCAN 2870 2928 INTERNAL EXON; p-value: NaN. FT GENSCAN 2929 2932 LAST EXON; p-value: NaN. SQ SEQUENCE 2932 AA; 328944 MW; B40CA6459D97A87B CRC64; MSGVNKERLS SEKKEIKGTI CFVLGLFLIF VRWPIIGIIL EIYGVIVLFG IRQRSSSADF SRRDVSSCFL ARRGIDLNPL NNLRRRSRTH SALRGERTKN QTIMWISRLK RVRRTIMVLG VANFVVIVSG CVLTLVSDAD CDSPGQLFPL FAVCFAAGVK LAAMVKVGTT QELMAMTIMD SPTQNNHQRK NLFSNFFAQG YLRTLVGGSK HFRGVIEEDE VCSVARLLGD LVSYRASGTG HLEFLAGLAL LQSNSQFPES YEDCMEAPAF HLQEAAMLHK FAEAAYTVCL TTQAPKLFLK WRPKLDGDNW WRGHAAAFLK FINFPAHVLR RGRICREKCK ATYFVVVLHY LRCVVIAVRG TETAEDLITD GLGRACSLTV EDLDGLTNHV HGMDTSRKHY GHSGIVEAAR DLFMQIEGDP KSGESESSGF LSSLIGDGCE CDGYSIRIVG HSLGGAIASL LGIRLRCRFP NLYVYAYGPL PCVDSDVAEA CSEFVTSIVL DNEFSSRLSY GSIRRLQVAA IKVLSQDPKA DTALIFRLAR RFLSASKRQR ENVEEKTSEE AIDVNNSPES QHDQIYPIWE EAEAEMQQDS EEFINPFHGM ASEDNPVSQF METGPTKEDD DEAPEMFMPG LVIHIVPEGN NMSVPIWRGW PICDVTDGYK AYVANRESFK EIMVSPSMFL DHLPWRHIHT TVKAIPPSRA PAVTLPLSRV WREIQGSNNW ENLIEPLSPI LQQEITRYGN LLSASYKGFD LNPNSKRYLS CKYGKKNLLK ESGIHDPDGY QVTKYIYATP DINLNPIKNE PNRARWIGYV AVSSDESVKR LGRRDILVTF RGTVTNHEWL ANLKSSLTPA RLDPHNPRPD VKVESGFLGL YTSGESESKF GLESCREQLL SEISRLMNKH KGEEISITLA GHSMGSSLAQ LLAYDIAELG MNQRRDEKPV PVTVFSFAGP RVGNLGFKKR CEELGVKVLR ITNVNDPITK LPGFLFNENF RSLGGVYELP WSCSCYTHVG VELTLDFFDV QNISCVHDLE TYITLVNRPR CSKLAVNEDN FGGEFLNRTS ELMFKTKLFR FLSLHLAIEE VMYQSSSSTS SSSQRSSLPG GGGLIRYGSA PGSFLNSVVD EVIGGGSSNA RDFTGYQPSS DNFIGNFFTG AADSSSLRSD STTCGVNNSS DGQKQLGNNN NNNSNKDIFL DRSYGGFNEI SQQHKSNDIG GGNSSGSYSL ARQRSSPADF FTYLASDKNN FSLNQPTSDY SPQGGSNGGR GHSRLKSQLS FTNHDSLARI NEVNETPVHD GSGHSFSAAS FGAATTDSWD DGSGSIGFTV TRPSKRSKDM DSGLFSQYSL PSDTSMNYMD NFMQLPEDSV PCKIRAKRGC ATHPRSIAER ERRTRISGKL KKLQDLVPNM DKQTSYSDML DLAVQHIKGL QHQLQVPLSY FFCLVPELNI EIKEKEKGND IKEGALNAQN LGKKKKTVKR AMSSDDEGRE EYLFKIVVIG DSAVGKSNLL SRYARNEFSA NSKATIGVEF QTQSMEIEGK EVKAQIWDTA GQERFRAVTS AYYRGAVGAL VVYDITRRTT FESVGRWLDE LKIHSDTTVA RMLVGNKCDL ENIRAVSVEE GKALAEEEGL FFVETSALDS TNVKTAFEMV ILDIYNNVSR KQLNSDTYKD ELTVNRVKVE NWVNGENGET FTAMTAQFGT MLPSDKDKAV KLPVALTTPL DSCSNLTSKL SWSIALSVRG ECAFTVKAQV AQAGGAAALV LINDKEELDE MVCGEKDTSL NVSIPILMIT TSSGDALKKS IMQNKKVELL LYAPKSPIVD YAVVFLWLMS VGTVFVASVW SHVTSPKKND EQYDELSPKK SSNVDATKGG AEEETLDISA MGAVIFVISA STFLVLLFFF MSSWFILILT IFFVIGGMQV ARGSKDTGES IPMLLRIPRL SDPWGGYNMI GFGDILFPGL LICFIFRVGE ERTQRLVELR DSTAFSCRQE EGEDTQIKTE LHDHAADNPV RYASLESVYS VSSSSSSLCC KTAAGSHKKV NALKLPMSDS FELQPHRRPE IVHVYCRRKR RRRRRRESFL ELAILQNEGV ERDDRIVKIE SAELDDEKEE ENKKKKQKKR RIGNGELMKL GVDSTTLSVS ATPPLRGCRI KAVCSGNKQD GSSRSKRNTV KNQEKVVTAS ATAKKWVRLS YDGVDPKHFI GLQCKVFWPL DAVWYPGSIV GYNVETKHHI VKYGDGDGEE LALRREKIKF LISRDDMELL NMKFGTNDVV VDGQDYDELV ILAASFEECQ DFEPRDIIWA KLTGECFKPN RFLYSCTRHA MWPAIIVDES VIVKRKGLNN KISGGRSVLV QFFGTHDFAR YLKEYKLPGR MDQLQKVADT DCSERINSGE EDSSNSGDDY TKDGEVWLRP TELGDCLHRI GDLQIINLGR IVTDSEFFKD SKHTWPEGYT ATRKFISLKD PNASAMYKME VLRDAESKTR PVFRVTTNSG EQFKGDTPSA CWNKIYNRIK KIQIASDNPD VLGEGLHESG TDMFGFSNPE VDKLIQGLLQ SRPPSKVSQR KYSSGKYQDH PTGYRPVRVE WKDLDKCNVC HMDEEYENNL FLQCDKCRMM VINPTLSPTK YLPGGAMKPT TDGRWAHLAC AIWIPGFWTL YLYYPSCMSN DCDIIQKHAY WMSRRWNRSM GLRKSLADED RLFLLSMDDD EADQCIRLLS FCKRHRQTSN YHLETEYMIK PAHNIAEYLP PPNPSGCART EPYNYLGRRG RKEPEALAGA SSKRLFVENQ PYIVGGYSRH EFSTYERIYG SKMSQITTPS NILSMAEKYT FMKETYRKRL AFGKSGIHGF GIFAKLPHRA GDMVIEYTGE LVRPPIADKR EHLIYNSMVY LQYHHITYGY ASVQPNCYSR VISVNGDEHI IIFAKRDVAK WEELTYDYRF IFVPFSSRVL FN // ID NC003070_355 HYPOTHETICAL; PRT; 859 AA. AC NC003070_355; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1768116...1767729, 1767510...1767351, DE 1767249...1766836, 1766171...1765917, 1765759...1765630, DE 1765547...1765300, 1764593...1764366, 1764114...1763952, DE 1763876...1763803, 1763570...1763540, 1763418...1763343, DE 1763161...1763090, 1762987...1762938, 1762611...1762321]; Length: 2580. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 129 FIRST EXON; p-value: NaN. FT GENSCAN 130 130 AA on splice site: t/gt -> C. FT GENSCAN 131 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 183 AA on splice site: tg/g -> W. FT GENSCAN 184 320 INTERNAL EXON; p-value: NaN. FT GENSCAN 321 321 AA on splice site: tc/g -> S. FT GENSCAN 322 405 INTERNAL EXON; p-value: NaN. FT GENSCAN 406 406 AA on splice site: gg/g -> G. FT GENSCAN 407 449 INTERNAL EXON; p-value: NaN. FT GENSCAN 450 531 INTERNAL EXON; p-value: NaN. FT GENSCAN 532 532 AA on splice site: gg/g -> G. FT GENSCAN 533 607 INTERNAL EXON; p-value: NaN. FT GENSCAN 608 608 AA on splice site: ca/a -> Q. FT GENSCAN 609 662 INTERNAL EXON; p-value: NaN. FT GENSCAN 663 686 INTERNAL EXON; p-value: NaN. FT GENSCAN 687 687 AA on splice site: ag/g -> R. FT GENSCAN 688 697 INTERNAL EXON; p-value: NaN. FT GENSCAN 698 722 INTERNAL EXON; p-value: NaN. FT GENSCAN 723 723 AA on splice site: g/at -> D. FT GENSCAN 724 746 INTERNAL EXON; p-value: NaN. FT GENSCAN 747 747 AA on splice site: t/gc -> C. FT GENSCAN 748 763 INTERNAL EXON; p-value: NaN. FT GENSCAN 764 859 LAST EXON; p-value: NaN. SQ SEQUENCE 859 AA; 95287 MW; D840BFBFDC8FB098 CRC64; MVTIRSGSIV ILVLLAVSFL ALVANGEDKT IKVKKVRGNK VCTQGWECSW WSKYCCNQTI SDYFQVYQFE QLFSKRNTPI AHAVGFWDYQ SFITAAALFE PLGFGTTGGK LMGQKEMAAF LGHVASKTSC GYGVATGGPL AWGLCYNREM SPMQSYCDES WKFKYPCSPG AEYYGRGALP IYWNFNYGAA GEALKADLLN HPEYIEQNAT LAFQAAIWRW MTPIKRAQPS AHDIFVGNWK PTKNDTLSKR GPTFGSTMNV LYGEYTCGQG SIDPMNNIIS HYLYFLDLMG IGREDAGPND ELSCAEQKPF NPSTVPSSSS SHVGRRKKMT LSIVSFPICG RFTLIWFLTA LVSVSCNPGV FNVKYRYPRL QGSLTALKEH DDRRQLTILA GIDLPLGGTG RPDIPGLYYA KIGIGTPAKS YYVQVDTGSD IMWVNCIQCK QCPRRSTLGI ELTLYNIDES DSGKLVSCDD DFCYQISGGP LSGCKANMSC PYLEIYGDGS STAGYFVKDV VQYDSVAGDL KTQTANGSVI FGCGARQSGD LDSSNEEALD GILGFGKANS SMISQLASSG RVKKIFAHCL DGRNGGGIFA IGRVVQPKVN MTPLVPNQPH YNVNMTAVQV GQEFLTIPAD LFQPGDRKGA IIDSGTTLAY LPEIIYEPLV KKITSQEPAL KVHIVDKDYK CFQYSGRVYP HDYLFPHEGM WCIGWQNSAM QSRDRRNMTL LGDLVLSNKL VLYDLENQLI GWTEYNCSSS IKVKDEGTGT VHLWQAIVNH TVSGGRGENT EECGVHKRLR KSLTFQTKVD DWTHGTFVHY LDSELLSFSP LHLRLPHRRF AAFALTYCPR TYTYNPNHMS ICKITMKSF // ID NC003070_356 HYPOTHETICAL; PRT; 280 AA. AC NC003070_356; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1769060...1769637, 1770084...1770348]; DE Length: 843. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 192 FIRST EXON; p-value: NaN. FT GENSCAN 193 193 AA on splice site: ag/a -> R. FT GENSCAN 194 280 LAST EXON; p-value: NaN. SQ SEQUENCE 280 AA; 31177 MW; 557AD6CC5830D524 CRC64; MASASKQNPS SSKPPRHPSP IIASTPSKSG VLEESQFRNP NNPSTSSNSP ISMAVEDQIL GNSNHLTRPE LLRRRSHNLK QLSRCYRDHY WALMEDLKAQ HRYYSWNYGV SPFKDENYHQ NKRRKVEGQT GDEIEGSGDN DNNNNDGVKA GNCVACGSGC KSKAMALTNY CQLHILMDKK QKLYTSCTYV NKRAQSKAIT CPKPTLASTV PALCNVHFQK AQKDVARALK DAGHNVSSAS RPPPKLHDIV AAFVHHIQAK RKDPRKEGKL KSLVKEELTS // ID NC003070_357 HYPOTHETICAL; PRT; 176 AA. AC NC003070_357; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1773227...1772697]; Length: 531. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 176 SINGLE EXON; p-value: NaN. SQ SEQUENCE 176 AA; 19406 MW; D260E6FB0FBD1345 CRC64; MGCVRCKSSD PWQTSANAFE SVDESGINEA WVEISSRRSF VAGEGSRKKL ERKKSQVLLE GYVETASSSS VDDQKDDLTR SKSLTDDDLE DLRGCLDLGF GFSYDEIPEL CNTLPALELC YSMSQKFLDD KQNKSPETSS VEDCPSPPLV TATPIANWKI SSPGKYLIFP SIDSVI // ID NC003070_358 HYPOTHETICAL; PRT; 2626 AA. AC NC003070_358; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1775654...1775886, 1776686...1776769, DE 1777194...1777272, 1777357...1777437, 1777652...1777723, DE 1777823...1777922, 1777999...1778125, 1779614...1779785, DE 1780173...1780297, 1780525...1780680, 1780719...1780809, DE 1780838...1780916, 1781086...1781258, 1781716...1781798, DE 1782190...1782297, 1782453...1782543, 1782649...1782738, DE 1783161...1783232, 1783318...1783411, 1783501...1783627, DE 1783982...1784112, 1784852...1786129, 1787147...1787265, DE 1787450...1787516, 1787785...1787909, 1788221...1788287, DE 1788414...1788505, 1788696...1788833, 1789161...1789244, DE 1790938...1791022, 1791093...1791115, 1791279...1791410, DE 1791517...1793340, 1793422...1793610, 1793758...1793894, DE 1794409...1794651, 1795070...1795256, 1795527...1795592, DE 1795705...1795954, 1796096...1796502]; Length: 7881. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 77 FIRST EXON; p-value: NaN. FT GENSCAN 78 78 AA on splice site: tg/g -> W. FT GENSCAN 79 105 INTERNAL EXON; p-value: NaN. FT GENSCAN 106 106 AA on splice site: at/g -> M. FT GENSCAN 107 132 INTERNAL EXON; p-value: NaN. FT GENSCAN 133 159 INTERNAL EXON; p-value: NaN. FT GENSCAN 160 183 INTERNAL EXON; p-value: NaN. FT GENSCAN 184 216 INTERNAL EXON; p-value: NaN. FT GENSCAN 217 217 AA on splice site: g/ag -> E. FT GENSCAN 218 258 INTERNAL EXON; p-value: NaN. FT GENSCAN 259 259 AA on splice site: ag/t -> S. FT GENSCAN 260 316 INTERNAL EXON; p-value: NaN. FT GENSCAN 317 357 INTERNAL EXON; p-value: NaN. FT GENSCAN 358 358 AA on splice site: tg/g -> W. FT GENSCAN 359 409 INTERNAL EXON; p-value: NaN. FT GENSCAN 410 410 AA on splice site: tg/t -> C. FT GENSCAN 411 440 INTERNAL EXON; p-value: NaN. FT GENSCAN 441 466 INTERNAL EXON; p-value: NaN. FT GENSCAN 467 467 AA on splice site: a/gt -> S. FT GENSCAN 468 524 INTERNAL EXON; p-value: NaN. FT GENSCAN 525 551 INTERNAL EXON; p-value: NaN. FT GENSCAN 552 552 AA on splice site: at/g -> M. FT GENSCAN 553 587 INTERNAL EXON; p-value: NaN. FT GENSCAN 588 588 AA on splice site: tg/g -> W. FT GENSCAN 589 618 INTERNAL EXON; p-value: NaN. FT GENSCAN 619 648 INTERNAL EXON; p-value: NaN. FT GENSCAN 649 672 INTERNAL EXON; p-value: NaN. FT GENSCAN 673 703 INTERNAL EXON; p-value: NaN. FT GENSCAN 704 704 AA on splice site: g/gg -> G. FT GENSCAN 705 745 INTERNAL EXON; p-value: NaN. FT GENSCAN 746 746 AA on splice site: ag/c -> S. FT GENSCAN 747 789 INTERNAL EXON; p-value: NaN. FT GENSCAN 790 790 AA on splice site: g/at -> D. FT GENSCAN 791 1215 INTERNAL EXON; p-value: NaN. FT GENSCAN 1216 1216 AA on splice site: g/ag -> E. FT GENSCAN 1217 1255 INTERNAL EXON; p-value: NaN. FT GENSCAN 1256 1277 INTERNAL EXON; p-value: NaN. FT GENSCAN 1278 1278 AA on splice site: a/gg -> R. FT GENSCAN 1279 1319 INTERNAL EXON; p-value: NaN. FT GENSCAN 1320 1341 INTERNAL EXON; p-value: NaN. FT GENSCAN 1342 1342 AA on splice site: g/cg -> A. FT GENSCAN 1343 1372 INTERNAL EXON; p-value: NaN. FT GENSCAN 1373 1418 INTERNAL EXON; p-value: NaN. FT GENSCAN 1419 1446 INTERNAL EXON; p-value: NaN. FT GENSCAN 1447 1474 INTERNAL EXON; p-value: NaN. FT GENSCAN 1475 1475 AA on splice site: g/tt -> V. FT GENSCAN 1476 1482 INTERNAL EXON; p-value: NaN. FT GENSCAN 1483 1526 INTERNAL EXON; p-value: NaN. FT GENSCAN 1527 2134 INTERNAL EXON; p-value: NaN. FT GENSCAN 2135 2197 INTERNAL EXON; p-value: NaN. FT GENSCAN 2198 2242 INTERNAL EXON; p-value: NaN. FT GENSCAN 2243 2243 AA on splice site: gt/c -> V. FT GENSCAN 2244 2323 INTERNAL EXON; p-value: NaN. FT GENSCAN 2324 2324 AA on splice site: ag/a -> R. FT GENSCAN 2325 2386 INTERNAL EXON; p-value: NaN. FT GENSCAN 2387 2408 INTERNAL EXON; p-value: NaN. FT GENSCAN 2409 2491 INTERNAL EXON; p-value: NaN. FT GENSCAN 2492 2492 AA on splice site: g/at -> D. FT GENSCAN 2493 2626 LAST EXON; p-value: NaN. SQ SEQUENCE 2626 AA; 293420 MW; 9E87B93639887A17 CRC64; MDNNSVIGSE VDAEADESYV NAALEDGQTG KKSVQRNYAT VLTEEDIRAL MEIDVQSVSD FTSLSKAEAT LLLSHLRWGL DAPLILVQVL EMPVSPVTVW LDSVGMFCHV DWIEDMEGTG GDLHFCTFDA VLSDQRGKMS ESDSNRYEDC YENWDSNELI QELSNTQLEN VSQLKFILEA GLQIIECRRV LEWTYVYGYY LREDEVGKQN LLKDTQERLK KFVENLKHCL ETNLQPFRYE EEPSKDFNAF RIKLTELTSL YLKIMDSDDD MHDMDSVDYD YYSGGTYDDN DSDETDFGFG EADTDDAAII ASYRSKSNYV VLKEEDIRRH QNDDVGRVSA VLSITDVEAS TLLLHYHWSV SKVNDEWFAD EERVRRTVGI LEGPVVTTPD GREVIPLILD SELFSIILAC LHVEYALIPT LLRKLYRFLV VILSALHVGQ MLTSVASSET AIVDLLLSLD LSECLFSYIS TTINDGPGCL MLKCPDPSCP AAIGRDMIDK LASKEDKEKY YRYFLRSYVE VNREDVNMQL ILLVGPKVMM FRACVRIAFA GMILANSKPC PKCKRPIEKN HGCMHMTCTP PCKFEFCWLC LNAWTEHGER TGGFYACNRY EAAKQEGLYD EAERRREMAK NSLERYTHYY ERWASNQVLG KLSDIQCTPE SQLKFIAEAW LQIIECRRVL KWTYAYGYYL QDHAKKPFFE YLQGEAESGL ERLHKCVEKD IEVFELAEGP SEEFNHFRTK LTGLTSITKT FFENLVKALE NGLADVDSQA ASSKPANSKP SSKTKGGGKD GSKGFVKRVA SSFSMRKKKN ATSEPKLLPR SKSTGSANFE SMRLPATKKI SDVTNKTRIK PLGGVAPAQP RREKIDDRGT NNKFGKWRSF DDSDSIWLSS DCASPTSLLE ERRLSVSFRF SVDESVVSWL SNLAKTSLSL NHQEVSSIKD RPRIPRNTKE NAENIQKKDS SRSVPNLTVV DSSTQSSQGK KVSFSKSSGT QLESGNHASS LIISSDVPSD LNNHTATSLV REICLDEKSA EIVDSKSSGS NVDEPLFWPY EQRFDWTPED ILKHFSMSPR RKKLLNAKVS AGSSPRSMRA QLLQARKLDL KDGSKRKLVF NGPLTNAAKI PELKRNNSNK KNDSIKNEPI RNCVKRNKSL PSRLRNSSKT CSKVVPFEVA EEVIAAERAK VEITARKLIN RRSKTMLEDD FSLMNDFSIE NAVGLEAAHG ASESETRVSL RKKRIKQDDL EPVKKCSARE TKARKDMCGL PDIEDSPYKK TNGTASSRLG IPPENWEKVL EGIRKMKPSE EAPVNAVECD RTGSFLPPKE RRFYVLIGTL LSSQTKEHIT GAAVERLHQN GLLTPEAIDK ADESTIKELI YPVGFYTRKA TNVKKVAKIC LMEYDGDIPR TLEELLSLPG VGPKIAHLKT SSPEETRVAL QQWLPKGEWV AINFLLKHKD KNGSLSDRKD AAQRKPSSEG IKCRVLSFST SLNVYTFAND VQPIASDLRR STRKRRISVN LEDYTDSSGA EDEDMMSPAY RTLRRRVHKN FSTSKSRKDM DAELAPRREG LRPRRSTTIA NKRLKTESGA DQDTSEEKDG QDETENGNEL DDADDGENEV EAEDEGNGED EGDGEDEGEE DGDDDEEGDE EQEGRKRYDL RNRAEVRRMP TGEINKQQQP RSPRRVLHQG MGTRVGRDGR RGGSRPHKRH RFTRTDDSDD SLLVDELDQG PAIPWARGGN RSGAPWLFGG LDTYGSSSLG LNVGASGWGH QSDGLAALTS GVQTAGPSSK GGADIQPLQI NEDINFDDIG GLSEYINDLK EMVFFPLLYP EFFASYSITP PRGVLLCGPP GTGKTLIARA LACAASKAGQ KVSFYMRKGA DVLSKWVGEA ERQLKLLFEE AQRNQPSIIF FDEIDGLAPV RSSKQEQIHN SIVSTLLALM DGLDSRGQVV LIGATNRVDA IDGALRRPGR FDREFNFSLP GCEARAEILD IHTRKWKHPP TRELKEELAA TCVGYCGADL KALCTEAAIR AFREKYPQVY TSDDKYAIDV GLVNVEKSHF VEAMSAITPA AHRGSVVQSR PLSPVVLPCL HRHLLESMSL ISDIFPSSAT SSELTKLSIL TFGSAIPLVY RPRLLLLGGE GVGLDHLGPA ILHELEKFPI HSLGLPSLLS DPGAKTPEEA LVHIFSEARR TTPSILYIPM FNNWWENAHE QLRAVFLTLL EELPSNLPIL LLATSYGELS DMEEQSVFDN RSVYTVDKPS SEDRSLFFDR LIEAALSVIS GLNGKPDGPQ PLPELPKVPK EPTGPKPAEV KAKVEAEQHA LRRLRMCLRD VCNRILYDKR FSAFHFPVTD EDAPNYRSII QIPMDTATLL QRVDTGQYLT CTPFLQDVDL IVRNAKAYNG DDYAGARIVS RAYELRDVVH GMLSQMDPAL LTYCDKIAAE GGPSLIPDDL SGSILGLAPV VQMGTVTRTS ARLRNVQPEV NLDRDYEGLK KPKKTTDAVS IDSAADKSQN QDSGQEMPSP DAANPQSAAP SPTDGDREDQ SEPPSKEASA EDMSGDSCKG PAAKSDKEIS SRTESVKGVF MERTDNYSIP QMERLYTRIM KGVLETLDKG LRDDDNNPKH SILRFLSEFA QHQANF // ID NC003070_359 HYPOTHETICAL; PRT; 633 AA. AC NC003070_359; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1797043...1797920, 1799031...1799116, DE 1799376...1800313]; Length: 1902. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 292 FIRST EXON; p-value: NaN. FT GENSCAN 293 293 AA on splice site: ag/c -> S. FT GENSCAN 294 321 INTERNAL EXON; p-value: NaN. FT GENSCAN 322 322 AA on splice site: g/tc -> V. FT GENSCAN 323 633 LAST EXON; p-value: NaN. SQ SEQUENCE 633 AA; 74201 MW; D534ED7FAB977202 CRC64; MSSRDKPKTF AEVREEICKR REEMISRDNQ KKTKTVAQVR EEKGKRREEM ISRYNQKKAK TVAQVKEGKG KRREEMISRD NRTKPKTVAQ VRDAKRKRTF DHVPRGTREP HAYLRNDPAP QVASVPKSVP EEKDVILDRI LSNVPRRKKT TSYEFVPPKH PQEPQWLLQV MSRMNGAGDP KLIIEKNLDS NDVDPRQNRL SIPINTVIQN DFLTLDESRL IDEDEITNEG NMGVAAFLVD QRTKKWNMGF KQWFMTTDSG SSYWSFVLRG EWSNVVETNG LKEGDKISLW SFSNRLTHES KVHSLGKFTA LIPDFRLKLY KVLLPQASRV RVQITMTTND DDHAAERNTM SMNYELAATL SDEEQRARKG KAKIVCKEED HVFIKKKEKY EEESEKREFF SHVPRKIRPA LRYPQPNFEN PNGASSSLNL PFEEDYYMAE YYKKTETINP PNPYHQWSPS SFLTEYTHPR MLEVLHRCGF NRPVVTCYSR TAREMRWWLR QVMKDMRAED LTLILEKTLS TTDVITTTHG RFSMHFNRLI SNDFLKPEER SILEEDTYND ETMGVGAILV DQRSQKWSVI LKRWGQNYFL SCGWNDVVKA NKLKAGDDIC LWAFRCDGVL CFAMRQYSSY FRH // ID NC003070_360 HYPOTHETICAL; PRT; 991 AA. AC NC003070_360; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1807470...1807306, 1807197...1806762, DE 1806684...1806537, 1806116...1805963, 1805878...1805666, DE 1805583...1805368, 1805252...1805186, 1805105...1804951, DE 1803830...1803692, 1803606...1803197, 1803117...1802941, DE 1802869...1802645, 1802259...1802023, 1801673...1801578, DE 1801501...1801364]; Length: 2976. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 55 FIRST EXON; p-value: NaN. FT GENSCAN 56 200 INTERNAL EXON; p-value: NaN. FT GENSCAN 201 201 AA on splice site: g/gt -> G. FT GENSCAN 202 249 INTERNAL EXON; p-value: NaN. FT GENSCAN 250 250 AA on splice site: tg/g -> W. FT GENSCAN 251 301 INTERNAL EXON; p-value: NaN. FT GENSCAN 302 372 INTERNAL EXON; p-value: NaN. FT GENSCAN 373 444 INTERNAL EXON; p-value: NaN. FT GENSCAN 445 466 INTERNAL EXON; p-value: NaN. FT GENSCAN 467 467 AA on splice site: g/gt -> G. FT GENSCAN 468 518 INTERNAL EXON; p-value: NaN. FT GENSCAN 519 564 INTERNAL EXON; p-value: NaN. FT GENSCAN 565 565 AA on splice site: g/ga -> G. FT GENSCAN 566 701 INTERNAL EXON; p-value: NaN. FT GENSCAN 702 760 INTERNAL EXON; p-value: NaN. FT GENSCAN 761 835 INTERNAL EXON; p-value: NaN. FT GENSCAN 836 914 INTERNAL EXON; p-value: NaN. FT GENSCAN 915 946 INTERNAL EXON; p-value: NaN. FT GENSCAN 947 991 LAST EXON; p-value: NaN. SQ SEQUENCE 991 AA; 107709 MW; 353EAA76BCFAD075 CRC64; MEEVDCSLPV TKTTDSCPTE DAIRALLESL VDPLLPSKPT DDLPSTSIRE SVAKQVHAVV LLYNYYHRKD NPHLECLSFE SFRSLATVMK PALLQHLKED GGVSGQTVLL EKVIVDACSL SMSLDASSDL FILNKCPIRR VAVLLVDSEK KSCYLQHSSI TQGVWSLLEK PIEKEKAARE NQKEEGVFQK VAFAVVKEAT GVNHKDIVIL ERHLVCSLSE EKTAVRFYIM KCTSQDKFSG ENPVEEVLSW RGDTEFVIEK EPEAVCDDIE SNKVDATKES EVSDIFERRE KAALKRRYEI KAKKVAALLS HPGARGKATT RLQNRYLKGS MSGAKEPNVH SETVVALKAK NVGNEMSPCK DNYSNGEKGG FEVASDPKEL KERGLQRKKA VPDRLNSIHK LNSTPASAHN SNPNLEELQT SLLSKATSLS ETALKVLLCK RDKLTRQQRN IEDEIAKCDK CIQNIKGDWE LQLETVLECC NETYPRRNLQ ESLDKSACQS NKRLKLSETL PSTKSLCQTA VRSTSGDSLV RRLGLFDLIL LGVGASIGAG VFVVTGTVAR DAGPGVTISF LLAGASCVLN ALCYAELSSR FPAVVGGAYM YSYSAFNEIT AFLVFVQLML DYHIGAASIS RSLASYAVAL LELFPALKGS IPLWMGSGKE LLGGLLSLNI LAPILLALLT LVLCQGVRES SAVNSVMTAT KVVIVLVVIC AGAFEIDVAN WSPFAPNGFK AVLTGATVVF FSYVGFDAVA NSAEESKNPQ RDLPIGIMGS LLVCISLYIG VCLVLTGMVP FSLLSEDAPL AEAFSSKGMK FVSILISIGA VAGLTTTLLV GLYVQTGYSV VAACVVALRL NDKKDRESSN RWTSSWQEGV ICLVIIACSG FGAGVFYRFS ASVIFILLSV GVAVVASAVL HYRQAYALPL GSGFSCPGVP IVPSVCIFFN IFLFAQLHYE AWIRFVVVSV LATAVYALYG QYHADPSMLD YQRAPETESD A // ID NC003070_361 HYPOTHETICAL; PRT; 1109 AA. AC NC003070_361; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1816990...1816544, 1816407...1816256, DE 1815971...1815929, 1814732...1814658, 1814561...1814446, DE 1814257...1814143, 1814054...1813926, 1813287...1813133, DE 1813028...1812832, 1812639...1812449, 1811756...1811607, DE 1811515...1811162, 1811020...1810871, 1810580...1810505, DE 1809960...1809812, 1809699...1809505, 1809361...1809221, DE 1809152...1809018, 1808935...1808735, 1808635...1808477]; Length: 3330. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 149 FIRST EXON; p-value: NaN. FT GENSCAN 150 199 INTERNAL EXON; p-value: NaN. FT GENSCAN 200 200 AA on splice site: cg/a -> R. FT GENSCAN 201 214 INTERNAL EXON; p-value: NaN. FT GENSCAN 215 239 INTERNAL EXON; p-value: NaN. FT GENSCAN 240 277 INTERNAL EXON; p-value: NaN. FT GENSCAN 278 278 AA on splice site: at/g -> M. FT GENSCAN 279 316 INTERNAL EXON; p-value: NaN. FT GENSCAN 317 359 INTERNAL EXON; p-value: NaN. FT GENSCAN 360 410 INTERNAL EXON; p-value: NaN. FT GENSCAN 411 411 AA on splice site: at/g -> M. FT GENSCAN 412 476 INTERNAL EXON; p-value: NaN. FT GENSCAN 477 477 AA on splice site: g/cc -> A. FT GENSCAN 478 540 INTERNAL EXON; p-value: NaN. FT GENSCAN 541 590 INTERNAL EXON; p-value: NaN. FT GENSCAN 591 708 INTERNAL EXON; p-value: NaN. FT GENSCAN 709 758 INTERNAL EXON; p-value: NaN. FT GENSCAN 759 783 INTERNAL EXON; p-value: NaN. FT GENSCAN 784 784 AA on splice site: g/at -> D. FT GENSCAN 785 833 INTERNAL EXON; p-value: NaN. FT GENSCAN 834 898 INTERNAL EXON; p-value: NaN. FT GENSCAN 899 945 INTERNAL EXON; p-value: NaN. FT GENSCAN 946 990 INTERNAL EXON; p-value: NaN. FT GENSCAN 991 1057 INTERNAL EXON; p-value: NaN. FT GENSCAN 1058 1109 LAST EXON; p-value: NaN. SQ SEQUENCE 1109 AA; 123700 MW; 1C87B853D019AD94 CRC64; MATPEEVAYE KFLERVRRTV YVDELTPLAT APVISSAFNQ FGTVKKVSFI PNYLGPKELP MGVLVEMENE EMTQAVISTV SQLPFMVAGM PRPVRACAAE PNMFVDKPKK PGRTVRFRWI KPNDPDFDKA RRVKRLARKH SAENSFMLKK QLEEAEKLSK QQAETAVTHH KKFEMMDKLL YDGVAQKLAG RYDLKGFPYR FLAFGFLHSR FLSLEAEPND RKIGKLCEYA SRNPLRIPKI TEYLEQKCYK ELRNGNIGSV KVVLCIYKKL LSSCKEQMPL FSCSLLSIVR TLLEQTKEEE VQILGCNTLV DFISLQTVNS HMFNLEGLIP KLCQLAQEMG DDERSLQLRS AGMQALAFMI ISVILENYMD LEKGQEDTKE VDQISDTKIP NMTKKVSFKP NPVTDYKLEN MDISKSPSYW SMVCLCNIAK LAKETTTVRR VLEPLLTAFD SGDYWSPQKG VASSVLLFLQ SRLEESATCL ALHAKQQASG AMTAVIADLI KHLRKCLQNA AESDVSVDKT KQNSDLQHAL ENCIAELSNK VGDAGPILDM FAVVLETIST NVVLSRTTAS AILRAAHIVS VVPNVSYHKK VFPDALFHQL LLAMSHADCT TRVEAHNIFS VVLLGTLRLP WSDQHKETSE AVSGSLSVDG ICTVRNQEEE KEKVEKSLNS ELCKDVNHIS RPSVSGQTSQ QLSCQSLDSL KDLDDGIKSL CSLRLSSHQV NMLLSSLWIQ ATSTDNTPEN FEAMASTYQI TLLFSLAKRS NHMALVQCFQ LAFSLRNLSL NQDDFWYNVE GGMQHSRRRS IFTFASYMLI FGAKISNILE LVPIIKESLT AQMVDPYLVL EGDIRLRAVC SGFPQEETYG SDKDDSAALN SSVIVTDDRR LKEIVITHFT SKLQTLSEEE QLNLRKEIQS DFSLDDAHSL GGQLFTDTPG PSSPLNQTEL PAFEEVELSD IAAFEGISPG ASGSQSGHRT SLSTNTNPVD VLSVNELLES VSETARQVAS LPVSSIPVPY DQMMNQCEAL VTGKQQKMSV LRSFKPQATK AITSEDNEKD EQYLLKETEE AGEDDEKAII VADVQPQGQL GFFSQEVPQN SFRLPPSSPY DKFLKAAGC // ID NC003070_362 HYPOTHETICAL; PRT; 150 AA. AC NC003070_362; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1818587...1819039]; Length: 453. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 150 SINGLE EXON; p-value: NaN. SQ SEQUENCE 150 AA; 16957 MW; 6AC33ADB5451D267 CRC64; MDPTELKRVF QMFDKNGDGT ITGKELSETL RSLGIYIPDK ELTQMIEKID VNGDGCVDID EFGELYKTIM DEEDEEEEDM KEAFNVFDQN GDGFITVDEL KAVLSSLGLK QGKTLDDCKK MIKKVDVDGD GRVNYKEFRQ MMKGGGFNSL // ID NC003070_363 HYPOTHETICAL; PRT; 435 AA. AC NC003070_363; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1821801...1820494]; Length: 1308. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 435 SINGLE EXON; p-value: NaN. SQ SEQUENCE 435 AA; 48123 MW; 52718C8CDD0F448C CRC64; MTTTTTKKPH VLVIPFPQSG HMVPHLDLTH QILLRGATVT VLVTPKNSSY LDALRSLHSP EHFKTLILPF PSHPCIPSGV ESLQQLPLEA IVHMFDALSR LHDPLVDFLS RQPPSDLPDA ILGSSFLSPW INKVADAFSI KSISFLPINA HSISVMWAQE DRSFFNDLET ATTESYGLVI NSFYDLEPEF VETVKTRFLN HHRIWTVGPL LPFKAGVDRG GQSSIPPAKV SAWLDSCPED NSVVYVGFGS QIRLTAEQTA ALAAALEKSS VRFIWAVRDA AKKVNSSDNS VEEDVIPAGF EERVKEKGLV IRGWAPQTMI LEHRAVGSYL THLGWGSVLE GMVGGVMLLA WPMQADHFFN TTLIVDKLRA AVRVGENRDS VPDSDKLARI LAESAREDLP ERVTLMKLRE KAMEAIKEGG SSYKNLDELV AEMCL // ID NC003070_364 HYPOTHETICAL; PRT; 94 AA. AC NC003070_364; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1823346...1823457, 1823497...1823669]; DE Length: 285. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 37 FIRST EXON; p-value: NaN. FT GENSCAN 38 38 AA on splice site: t/ta -> L. FT GENSCAN 39 94 LAST EXON; p-value: NaN. SQ SEQUENCE 94 AA; 10672 MW; 6CC4510A4C607B3E CRC64; MGREKSPGLK ILWVWTIGTA ASMILFLTFT EFRHEIILLV TSVVRTRMQD MQTMMNQNQE QAPKENQNGS AGDSSVLMDE TVLPESDREI AEKL // ID NC003070_365 HYPOTHETICAL; PRT; 345 AA. AC NC003070_365; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1824547...1824948, 1825313...1825645, DE 1825724...1825840, 1825915...1826100]; Length: 1038. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 134 FIRST EXON; p-value: NaN. FT GENSCAN 135 245 INTERNAL EXON; p-value: NaN. FT GENSCAN 246 284 INTERNAL EXON; p-value: NaN. FT GENSCAN 285 345 LAST EXON; p-value: NaN. SQ SEQUENCE 345 AA; 37629 MW; 7D2F453E14B813C9 CRC64; MASSTGEKGL IVSFGEMLID FVPTVSGVSL SESPGFLKAP GGAPANVAIA VSRLGGRAAF VGKLGDDDFG HMLAGILRKN GVDDQGINFD EGARTALAFV TLRSDGEREF MFYRNPSADM LLRPDELNLE LIRSAKVFHY GSISLITEPC RSAHMKAMEV AKEAGALLSY DPNLREPLWP SPEEARTQIM SIWDKADIIK VSDVELEFLT ENKTMDDKTA MSLWHPNLKL LLVTLGEKGC TYFTKKFHGS VETFHVDAVD TTGAGDSFVG ALLQQIVDDQ SVLEDEARLR KVLRFANACG AITTTKKGAI PALPTDIEAL SFLKDQKKRQ TNLKFSKWCC TASPC // ID NC003070_366 HYPOTHETICAL; PRT; 335 AA. AC NC003070_366; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1826741...1827283, 1827373...1827705, DE 1827797...1827928]; Length: 1008. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 181 FIRST EXON; p-value: NaN. FT GENSCAN 182 292 INTERNAL EXON; p-value: NaN. FT GENSCAN 293 335 LAST EXON; p-value: NaN. SQ SEQUENCE 335 AA; 36936 MW; 9D4925282EBA6D61 CRC64; MTSKQDLNPV FTLLEPYINY SKLRSLFILS IVFILRSLEH KIDLVQMTSS NGDNKGLVVS FGEMLIDFVP TESGVSLSES SGFLKAPGGA PANVAIAVSR LGGRAAFVGK LGDDEFGHML AGILRKNDVD DQGINFDKGA RTALAFVTLR SDGEREFMFY RNPSADMLLR PDELNLELIR SAKVFHYGSI SLITEPCRSA HMKAMEVAKE AGALLSYDPN LREPLWPSPE EARKQIMSIW DKADIIKVSD VELEFLTGNK TIDDETAMSL WHPNLKLLLV TLGENGCRYY TKDFHGSVET FHVDAVDTTG AGDSFVGALL NQIVDDQSVL EVNSH // ID NC003070_367 HYPOTHETICAL; PRT; 569 AA. AC NC003070_367; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1831666...1830729, 1829683...1829476, DE 1829306...1828986, 1828903...1828661]; Length: 1710. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 312 FIRST EXON; p-value: NaN. FT GENSCAN 313 313 AA on splice site: ca/a -> Q. FT GENSCAN 314 382 INTERNAL EXON; p-value: NaN. FT GENSCAN 383 489 INTERNAL EXON; p-value: NaN. FT GENSCAN 490 569 LAST EXON; p-value: NaN. SQ SEQUENCE 569 AA; 63462 MW; FFBD7D6BD9DF1E32 CRC64; MAGSVGEETE PEWIKRVKLE GAVPCLKPDD NCKNGWTTPS PDTFMVRGPK YFSDKVKIPA GDFLLKPLGF DWIKGPKKLS EILSYPSSRI RKVIDEEFQK DGTKPFVWAF NLQLPHKDNY SAVAYFVTTE PILEGSLMDR FLKGDDGFKK SRLKLIANIV KGPWIVRKAV GEQAICVIGR ALSCKYVSGE NFVEIDVDIG SSMVASAIVH LAFGYVTTLT VDLAFLIESQ TEAELPEKLL GAVRFSELQT ESATSIELSS STSNDQWDQT TSERSSWWKS IGNGFSNLLN QDTANMNNTS HGDIQKDEHV QKQYFGFLYP VMKIQCDVCE KAPATVICCA DEAALCPQCD IEIHAANKLA SKHQRLHLNS LSTKFPRCDI CQEKAAFIFC VEDRALLCRD CDESIHVANS RSANHQRFLA TGIKVALTST ICSKEIEKNQ PEPSNNQQKA NQIPAKSTSQ QQQQPSSATP LPWAVDDFFH FSDIESTDKK GQLDLGAGEL DWFSDMGFFG DQINDKALPA AEVPELSVSH LGHVHSYKPM KSNVSHKKPR FETRYDDDDE EHFIVPDLG // ID NC003070_368 HYPOTHETICAL; PRT; 202 AA. AC NC003070_368; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1833070...1833096, 1833419...1833560, DE 1833654...1833801, 1833885...1833960, 1834068...1834179, DE 1834658...1834761]; Length: 609. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 9 FIRST EXON; p-value: NaN. FT GENSCAN 10 56 INTERNAL EXON; p-value: NaN. FT GENSCAN 57 57 AA on splice site: c/aa -> Q. FT GENSCAN 58 105 INTERNAL EXON; p-value: NaN. FT GENSCAN 106 106 AA on splice site: tg/c -> C. FT GENSCAN 107 131 INTERNAL EXON; p-value: NaN. FT GENSCAN 132 168 INTERNAL EXON; p-value: NaN. FT GENSCAN 169 169 AA on splice site: g/ga -> G. FT GENSCAN 170 202 LAST EXON; p-value: NaN. SQ SEQUENCE 202 AA; 23272 MW; D799D26EEAAEAAB8 CRC64; MDPRQFEHTV VADNDIHSIV MSYLLHNCFN ETADSLASST GVKQPAIDRD NMERRKQIIH FILERKALKA FELTEQLAQD LLEKNKDLQF DLLCLHFVEL ICAGNCTEAL KFGKTRLAPF GKVKKYVEKL EDVMALLAYE DPEKSPMFHL LSSEYRQQVA DNLNRTILGD QYCEGQFRLG LVWFGSVLYQ KIVSSTLCEF SR // ID NC003070_369 HYPOTHETICAL; PRT; 423 AA. AC NC003070_369; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1837115...1836469, 1836245...1836113, DE 1836028...1835944, 1835606...1835200]; Length: 1272. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 215 FIRST EXON; p-value: NaN. FT GENSCAN 216 216 AA on splice site: ag/g -> R. FT GENSCAN 217 260 INTERNAL EXON; p-value: NaN. FT GENSCAN 261 288 INTERNAL EXON; p-value: NaN. FT GENSCAN 289 289 AA on splice site: g/ct -> A. FT GENSCAN 290 423 LAST EXON; p-value: NaN. SQ SEQUENCE 423 AA; 47132 MW; D4057B895A78A07F CRC64; MDKEKSPAPP PSGGLPPPSG RYSAFSPNGS SFAMKAESSF PPLTPSGSNS SDANRFSHDI SRMPDNPPKN LGHRRAHSEI LTLPDDLSFD SDLGVVGAAD GPSFSDDTDE DLLYMYLDME KFNSSATSTS QMGEPSEPTW RNELASTSNL QSTPGSSSER PRIRHQHSQS MDGSTTIKPE MLMSGNEDVS GVDSKKAISA AKLSELALID PKRAKRIWAN RQSAARSKER KMRYIAELER KVQTLQTEAT SLSAQLTLLQ RDTNGLGVEN NELKLRVQTM EQQVHLQDAL NDALKEEVQH LKVLTGQGPS NGTSMNYGSF GSNQQFYPNN QSMHTILAAQ QLQQLQIQSQ KQQQQQQQHQ QQQQQQQQQF HFQQQQLYQL QQQQRLQQQE QQSGASELRR PMPSPGQKES VTSPDRETPL TKD // ID NC003070_370 HYPOTHETICAL; PRT; 209 AA. AC NC003070_370; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1844104...1844101, 1844069...1843924, DE 1841466...1841350, 1839048...1838989, 1838924...1838816, DE 1837723...1837530]; Length: 630. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 1 FIRST EXON; p-value: NaN. FT GENSCAN 2 2 AA on splice site: a/aa -> K. FT GENSCAN 3 50 INTERNAL EXON; p-value: NaN. FT GENSCAN 51 89 INTERNAL EXON; p-value: NaN. FT GENSCAN 90 109 INTERNAL EXON; p-value: NaN. FT GENSCAN 110 145 INTERNAL EXON; p-value: NaN. FT GENSCAN 146 146 AA on splice site: a/ct -> T. FT GENSCAN 147 209 LAST EXON; p-value: NaN. SQ SEQUENCE 209 AA; 24797 MW; A31B7CCF53FEEA02 CRC64; MKLRKYGDVY NYEVGNFVGE KNERASSNLN GKKAAISEEI LEPFRDFEAP AYGEMENRCM SRSNKKLRIT EQSKTHAVEK ILQSANAYQL IILISTKKWT RCSMYIMEQK HMKQLKSNMN TNKPDMHEQT LECRLKFYIP VNQKNTDYTL IPTKVPLTRT KLRLNDIVYA VNLLMHTSSK DPLMGERMFY FFFFFFFFFF LLSRRISLI // ID NC003070_371 HYPOTHETICAL; PRT; 374 AA. AC NC003070_371; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1847920...1848231, 1848315...1848455, DE 1848588...1848728, 1851523...1851840, 1851956...1852096, DE 1852947...1853018]; Length: 1125. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 104 FIRST EXON; p-value: NaN. FT GENSCAN 105 151 INTERNAL EXON; p-value: NaN. FT GENSCAN 152 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 304 INTERNAL EXON; p-value: NaN. FT GENSCAN 305 351 INTERNAL EXON; p-value: NaN. FT GENSCAN 352 374 LAST EXON; p-value: NaN. SQ SEQUENCE 374 AA; 43789 MW; F569B278339BC3D1 CRC64; MGDTTKDDGS SQSKAVRGEK RAFFFRKWTR IDIARASAVG AVHLLCLLAP FNYKWEALRF GVILAIVTSL SITFSYHRNL THKSFKLPKW LEYPFAYSAL FALQGHPIDW VSTHRFHHQF TDSDRDPHSP IEGFWFSHVF WIFDTSYIRE KCGGRDNVMD LKQQWFYRFL RNTIGLHILT FWTLVYLWGG LPYLTCGVGP MSETTKDDGS SQKKSVRKEK RAYVLRKWTQ FDVGRASTVG TVHLLCLLAP FNYKWEAFRF GIILAILTNL CITFSYHRNL THRSFKLPKW LEYPFAYSAL LALQGDPLDW VSIHRFHHQF TDSDRDPHSP IEGFWFSHVL WIFDTDYIRE KALGLATNVK LPTDAQKRKM AIRR // ID NC003070_372 HYPOTHETICAL; PRT; 396 AA. AC NC003070_372; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1854937...1854734, 1854495...1853914, DE 1853832...1853614, 1853510...1853404, 1853314...1853236]; Length: 1191. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 68 FIRST EXON; p-value: NaN. FT GENSCAN 69 262 INTERNAL EXON; p-value: NaN. FT GENSCAN 263 335 INTERNAL EXON; p-value: NaN. FT GENSCAN 336 370 INTERNAL EXON; p-value: NaN. FT GENSCAN 371 371 AA on splice site: ag/t -> S. FT GENSCAN 372 396 LAST EXON; p-value: NaN. SQ SEQUENCE 396 AA; 43838 MW; 7E83434406B40A08 CRC64; MGLEDAGDLV LHIVLSKIGP ENTARVACVS KRLKVSASEE SLWSIFCSND LNISTPLDPH GDPAPSFKAT LRKGVTEDDL QEFETSLKVK LPLPTRLLYR FVDGQELSSP NGLDGSLGLI GGYSAYSHDV NVYLLPLKEV MRETKESFMR DLGFSSRLDL IVMAASVVAS LKIFLLDCTT GQLFTGTSNR QLLPCVPDAL VRSVHDTNGD QQQDAMLLWL EEHGRRLQTG TINVRQQNNV KSISLFPEIP PLCSVSVTNG VQVRASSVFI PEISNLRDQP PAYWYAYSIR MSLMPEGCIL NGTHHSSCQL YWRHWVIRAD NEVIDNVNGE AVIGKYPLLQ AGEEEFVYES CSSFPTTAGS IDGSFTFVPG SLRDPKGSQF EVKVVEFPLE LPDYIF // ID NC003070_373 HYPOTHETICAL; PRT; 174 AA. AC NC003070_373; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1855962...1856273, 1856354...1856494, DE 1857444...1857515]; Length: 525. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 104 FIRST EXON; p-value: NaN. FT GENSCAN 105 151 INTERNAL EXON; p-value: NaN. FT GENSCAN 152 174 LAST EXON; p-value: NaN. SQ SEQUENCE 174 AA; 20359 MW; 6DFE64482195A81D CRC64; MGDKNKDDSS SQSKAVRKEK RAFLFRKWTR VDVMRVSAVG AVHLLCLLAP FNYTWEAFRF AAMVGISTNL SITFSYHRNL THRSFKLPKW LEYPFAYSAL FALQGHPIDW VSTHRFHHQF TDSDRDPHSP IEGFWFSHVF WIFDTSYIRE KVLGLATDVK LPTDAQKRKM SLAR // ID NC003070_374 HYPOTHETICAL; PRT; 250 AA. AC NC003070_374; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1861125...1861080, 1860063...1859975, DE 1859763...1859560, 1859451...1859328, 1858792...1858692, DE 1858221...1858033]; Length: 753. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 15 FIRST EXON; p-value: NaN. FT GENSCAN 16 16 AA on splice site: c/ga -> R. FT GENSCAN 17 45 INTERNAL EXON; p-value: NaN. FT GENSCAN 46 113 INTERNAL EXON; p-value: NaN. FT GENSCAN 114 154 INTERNAL EXON; p-value: NaN. FT GENSCAN 155 155 AA on splice site: g/gc -> G. FT GENSCAN 156 188 INTERNAL EXON; p-value: NaN. FT GENSCAN 189 250 LAST EXON; p-value: NaN. SQ SEQUENCE 250 AA; 27918 MW; 52186CE1D3CBCDFA CRC64; MRTRLPTPDV YFCVPRLCVM LVNPFAFQDS AVSPMSLPHC KLNWFVPCLT DNYAYILHDE DTGTVGVVDP SEAVPVMDAL QKNSRNLTYI LNTHHHYDHT GGNLELKDRY GAKVIGSAAD RDRIPGIDVA LKDADKWMFA GHEVHIMETP GHTRGHISFY FPGARAIFTG DTLFSLSCGK LFEGTPEQIF LINFVSGKSQ IPTTMKMEKA CNPFLRTENT DIRRALGIPE TADEAEALGI IRRAKDNFKA // ID NC003070_375 HYPOTHETICAL; PRT; 92 AA. AC NC003070_375; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1862306...1862462, 1863127...1863161, DE 1863624...1863710]; Length: 279. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 52 FIRST EXON; p-value: NaN. FT GENSCAN 53 53 AA on splice site: g/aa -> E. FT GENSCAN 54 64 INTERNAL EXON; p-value: NaN. FT GENSCAN 65 92 LAST EXON; p-value: NaN. SQ SEQUENCE 92 AA; 10557 MW; 213887908502A3CF CRC64; MKRQVMIFVM LVAFFVVFLD VKQVEAMRPF PTAADEIRFV FQALQRGPVS GSEYFLFEKA SSRKQLEHES DLTVGLKRPS GFDLCGVQEN HE // ID NC003070_376 HYPOTHETICAL; PRT; 558 AA. AC NC003070_376; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1864795...1866471]; Length: 1677. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 558 SINGLE EXON; p-value: NaN. SQ SEQUENCE 558 AA; 62380 MW; 72C625DB6C9897F5 CRC64; MLPVNRARAL LTILSQAKTL NHTQQVHAKV IIHGFEDEVV LGSSLTNAYI QSNRLDFATS SFNRIPCWKR NRHSWNTILS GYSKSKTCCY SDVLLLYNRM RRHCDGVDSF NLVFAIKACV GLGLLENGIL IHGLAMKNGL DKDDYVAPSL VEMYAQLGTM ESAQKVFDEI PVRNSVLWGV LMKGYLKYSK DPEVFRLFCL MRDTGLALDA LTLICLVKAC GNVFAGKVGK CVHGVSIRRS FIDQSDYLQA SIIDMYVKCR LLDNARKLFE TSVDRNVVMW TTLISGFAKC ERAVEAFDLF RQMLRESILP NQCTLAAILV SCSSLGSLRH GKSVHGYMIR NGIEMDAVNF TSFIDMYARC GNIQMARTVF DMMPERNVIS WSSMINAFGI NGLFEEALDC FHKMKSQNVV PNSVTFVSLL SACSHSGNVK EGWKQFESMT RDYGVVPEEE HYACMVDLLG RAGEIGEAKS FIDNMPVKPM ASAWGALLSA CRIHKEVDLA GEIAEKLLSM EPEKSSVYVL LSNIYADAGM WEMVNCVRRK MGIKGYRKHV GQSATEVG // ID NC003070_377 HYPOTHETICAL; PRT; 1337 AA. AC NC003070_377; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1877149...1877042, 1876954...1876839, DE 1874684...1874559, 1872887...1872744, 1872430...1872349, DE 1872266...1872225, 1871972...1871913, 1871785...1871666, DE 1871574...1870366, 1870257...1870156, 1870002...1869862, DE 1869723...1869598, 1868765...1867128]; Length: 4014. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 36 FIRST EXON; p-value: NaN. FT GENSCAN 37 74 INTERNAL EXON; p-value: NaN. FT GENSCAN 75 75 AA on splice site: ag/t -> S. FT GENSCAN 76 116 INTERNAL EXON; p-value: NaN. FT GENSCAN 117 117 AA on splice site: ag/g -> R. FT GENSCAN 118 164 INTERNAL EXON; p-value: NaN. FT GENSCAN 165 165 AA on splice site: gg/g -> G. FT GENSCAN 166 192 INTERNAL EXON; p-value: NaN. FT GENSCAN 193 206 INTERNAL EXON; p-value: NaN. FT GENSCAN 207 226 INTERNAL EXON; p-value: NaN. FT GENSCAN 227 266 INTERNAL EXON; p-value: NaN. FT GENSCAN 267 669 INTERNAL EXON; p-value: NaN. FT GENSCAN 670 703 INTERNAL EXON; p-value: NaN. FT GENSCAN 704 750 INTERNAL EXON; p-value: NaN. FT GENSCAN 751 792 INTERNAL EXON; p-value: NaN. FT GENSCAN 793 1337 LAST EXON; p-value: NaN. SQ SEQUENCE 1337 AA; 149194 MW; 7FE119E31F423B20 CRC64; MSTTPPPSHP NPRVLQGPTK NRGNFQPIKP MKIIGQHREQ RTTVKICEMT ERRRLGEVSD KEKSLALHRF QYDFSISTQK IVCYYTYRKN KRKNEKENGM CSSPKGQTKK CEIESERVLT LEDVYCVNHE RGLMPESLHG GRHAHDPLGL AVAKMSYHVH SLGEGIVGQV AISGQHQWIF SEYLNDSHST LQVHNGWESQ ISAGIKTILI VAVGSCGVVQ LGSLCKVEED PALVTHIRHL FLALTDPLAD HASNLMQCDI NSPSDRPKIP SKCLHEASPD FSGEFDKAMD MEGLNIVSQN TSNRSNDLPY NFTPTYFHME RTAQVIGGLE AVQPSMFGSN DCVTSGFSVG VVDTKHKNQV DISDMSKVIY DEETGGYRYS RELDPNFQHY SRNHVRNSGG TSALAMESDR LKAGSSYPQL DSTVLTALKT DKDYSRRNEV FQPSESQGSI FVKDTEHRQE EKSESSQLDA LTASLCSFSG SELLEALGPA FSKTSTDYGE LAKFESAAAI RRTNDMSHSH LTFESSSENL LDAVVASMSN GDGNVRREIS SSRSTQSLLT TAEMAQAEPF GHNKQNIVST VDSVISQPPL ADGLIQQNPS NICGAFSSIG FSSTCLSSSS DQFPTSLEIP KKNKKRAKPG ESSRPRPRDR QLIQDRIKEL RELVPNGSKC SIDSLLECTI KHMLFLQSVS QHADKLTKSA SSKMQHKDTG TLGISSTEQG SSWAVEIGGH LQVCSIMVEN LDKEGVMLIE MLCEECSHFL EIANVIRSLE LIILRGTTEK QGEKTWICFV VEKIIKQCST PKLLESALAA MIKTSLNQDC RLMNQFITAC TSFKRLDLAV STMTQMQEPN VFVYNALFKG FVTCSHPIRS LELYVRMLRD SVSPSSYTYS SLVKASSFAS RFGESLQAHI WKFGFGFHVK IQTTLIDFYS ATGRIREARK VFDEMPERDD IAWTTMVSAY RRVLDMDSAN SLANQMSEKN EATSNCLING YMGLGNLEQA ESLFNQMPVK DIISWTTMIK GYSQNKRYRE AIAVFYKMME EGIIPDEVTM STVISACAHL GVLEIGKEVH MYTLQNGFVL DVYIGSALVD MYSKCGSLER ALLVFFNLPK KNLFCWNSII EGLAAHGFAQ EALKMFAKME MESVKPNAVT FVSVFTACTH AGLVDEGRRI YRSMIDDYSI VSNVEHYGGM VHLFSKAGLI YEALELIGNM EFEPNAVIWG ALLDGCRIHK NLVIAEIAFN KLMVLEPMNS GYYFLLVSMY AEQNRWRDVA EIRGRMRELG IEKICPGTSS IRIDKRDHLF AAADKSHSAS DEVCLLLDEI YDQMGLAGYV QETENVY // ID NC003070_378 HYPOTHETICAL; PRT; 244 AA. AC NC003070_378; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1883045...1883779]; Length: 735. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 244 SINGLE EXON; p-value: NaN. SQ SEQUENCE 244 AA; 27090 MW; 1077E478A2104AEB CRC64; MEYQTNFLSG EFSPENSSSS SWSSQESFLW EESFLHQSFD QSFLLSSPTD NYCDDFFAFE SSIIKEEGKE ATVAAEEEEK SYRGVRKRPW GKFAAEIRDS TRKGIRVWLG TFDTAEAAAL AYDQAAFALK GSLAVLNFPA DVVEESLRKM ENVNLNDGES PVIALKRKHS MRNRPRGKKK SSSSSTLTSS PSSSSSYSSS SSSSSLSSRS RKQSVVMTQE SNTTLVVLED LGAEYLEELM RSCS // ID NC003070_379 HYPOTHETICAL; PRT; 415 AA. AC NC003070_379; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1886548...1885832, 1885747...1885340, DE 1885267...1885145]; Length: 1248. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 239 FIRST EXON; p-value: NaN. FT GENSCAN 240 375 INTERNAL EXON; p-value: NaN. FT GENSCAN 376 415 LAST EXON; p-value: NaN. SQ SEQUENCE 415 AA; 46674 MW; A39C0435974A0B76 CRC64; MFEEIGCFDP NAPAEMTAES SFSPSEPPPT ITVIGSNSNS NCSLEDLSAF HLSPQDSSLP ASASAYAHQL HINATPNCDH QFQSSMHQTL QDPSYAQQSN HWDNGYQDFV NLGPNHTTPD LLSLLQLPRS SLPPFANPSI QDIIMTTSSS VAAYDPLFHL NFPLQPPNGS FMGVDQDQTE TNQGVNLMYD EENNNLDDGL NRKGRGSKKR KIFPTERERR VHFKDRFGDL KNLIPNPTKN DRASIVGEAI DYIKELLRTI DEFKLLVEKK RVKQRNREGD DVVDENFKAQ SEVVEQCLIN KKNNALRCSW LKRKSKFTDV DVRIIDDEVT IKIVQKKKIN CLLFVSKVVD QLELDLHHVA GAQIGEHHSF LFNAKISEGS SVYASAIADR VMEVLKKQYM EALSANNGYH CYSSD // ID NC003070_380 HYPOTHETICAL; PRT; 198 AA. AC NC003070_380; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1889509...1889641, 1890625...1891088]; DE Length: 597. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 44 FIRST EXON; p-value: NaN. FT GENSCAN 45 45 AA on splice site: g/ct -> A. FT GENSCAN 46 198 LAST EXON; p-value: NaN. SQ SEQUENCE 198 AA; 22321 MW; 34325C473DCB28DF CRC64; MGRRPCCEKI GLKKGPWSAE EDRILINYIS LHGHPNWRAL PKLAAAKLPG RTDNEIKNVW HTHLKKRLHH SQDQNNKEDF VSTTAAEMPT SPQQQSSSSA DISAITTLGN NNDISNSNKD SATSSEDVLA IIDESFWSEV VLMDCDISGN EKNEKKIENW EGSLDRNDKG YNHDMEFWFD HLTSSSCIIG EMSDISEF // ID NC003070_381 HYPOTHETICAL; PRT; 585 AA. AC NC003070_381; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1896962...1896780, 1895933...1895761, DE 1895173...1894990, 1894892...1894786, 1893576...1892466]; Length: 1758. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 61 FIRST EXON; p-value: NaN. FT GENSCAN 62 118 INTERNAL EXON; p-value: NaN. FT GENSCAN 119 119 AA on splice site: ag/g -> R. FT GENSCAN 120 180 INTERNAL EXON; p-value: NaN. FT GENSCAN 181 215 INTERNAL EXON; p-value: NaN. FT GENSCAN 216 216 AA on splice site: ag/a -> R. FT GENSCAN 217 585 LAST EXON; p-value: NaN. SQ SEQUENCE 585 AA; 65510 MW; 82C440DB21B31168 CRC64; MTDEPIKDHQ FDNGGDEAEI TDAAKRSLPI KSRKHSTMAS ITNYVRYMAH KLEYSLTLSL KKHTREKLSD RELFGVVMKN LFYGRISYLH SDKGKEMAPT MGTNESTLLV RKLPVVDTRY IFVGDAVVLK DPNETNKYIV RRLAALEGSE MVSSDEKDEP FVLEKDQCWV VAENQEMKSK EAYDSRTFGP ISMADIVGRA IYCLRTAVDH GPVSNRRTLA ILPCSSCLDH KNGRLKSVPN RSSFVCRASS GGYRRNPDFS RLNKHGYRGN NRQSGGREDF DIENSDMLSS RNGPLFNLSS SPKFQATSSP GPREKEIVEL FRKVQAQLRA RAAAKKEEKK IEEASKGQGK ESETVDSLLK LLRKHSGEQS KRQVSKFSSQ GEVQGDTVDK QDRTGNLVTS GNKDNNASSF TRPTSSFRRK SPVPRSQSPP AYSSEATFDQ SSSYSVTWTQ KKDTVELHDE PEHEPAYEHE HEPENESEPG PVTTMLEPDS ELKPESSSFY QEEEDDDVTF DVLSQDDGIL DVLSDDDESL DDADEDSDEA EEEAVKDLSE LKLVELRGIA KSRGLKGLSK MKKAELVELL GSDSS // ID NC003070_382 HYPOTHETICAL; PRT; 383 AA. AC NC003070_382; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1897566...1898051, 1898140...1898421, DE 1898602...1898985]; Length: 1152. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 162 FIRST EXON; p-value: NaN. FT GENSCAN 163 256 INTERNAL EXON; p-value: NaN. FT GENSCAN 257 383 LAST EXON; p-value: NaN. SQ SEQUENCE 383 AA; 42798 MW; 26BE6EB7B897F887 CRC64; MDKLKIAEWG EKLKTGGAQM SRMVSEKVKD MLQAPTLESK MVDEATLETL EEPNWGMNMR ICAQINNDEF NGTEIVRAIK RKISGKSPVS QRLSLELLEA CAMNCEKVFS EVASEKVLDE MVWLIKNGEA DSENRKRAFQ LIRAWGQSQD LTYLPVFHQT YMSLEGENGL HARGEENSMP GQSSLESLMQ RPVPVPPPGS YPVPNQEQAL GDDDGLDYNF GNLSIKDKKE QIEITRNSLE LLSSMLNTEG KPNHTEDDLT VSLMEKCKQS QPLIQMIIES TTDDEGVLFE ALHLNDELQQ VLSSYKKPDE TEKKASIVEQ ESSGSKDTGP KPTEQEEQEP VKKTGADDDK KHSEASGSSN KTVKEEKQAV KIELGLSSDE DEK // ID NC003070_383 HYPOTHETICAL; PRT; 987 AA. AC NC003070_383; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1900523...1901263, 1901350...1901418, DE 1901523...1902716, 1902828...1902926, 1903040...1903198, DE 1903315...1903400, 1903485...1903683, 1903762...1903872, DE 1903964...1904170, 1904484...1904582]; Length: 2964. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 247 FIRST EXON; p-value: NaN. FT GENSCAN 248 270 INTERNAL EXON; p-value: NaN. FT GENSCAN 271 668 INTERNAL EXON; p-value: NaN. FT GENSCAN 669 701 INTERNAL EXON; p-value: NaN. FT GENSCAN 702 754 INTERNAL EXON; p-value: NaN. FT GENSCAN 755 782 INTERNAL EXON; p-value: NaN. FT GENSCAN 783 783 AA on splice site: gg/g -> G. FT GENSCAN 784 849 INTERNAL EXON; p-value: NaN. FT GENSCAN 850 886 INTERNAL EXON; p-value: NaN. FT GENSCAN 887 955 INTERNAL EXON; p-value: NaN. FT GENSCAN 956 987 LAST EXON; p-value: NaN. SQ SEQUENCE 987 AA; 110443 MW; 868DD5F6597FBB79 CRC64; MESSLYDEFG NYVGPEIESD RDSDDEVEDE DLQDKHLEEN GSDGEQGPGG SNGWITTIND VEMENQIVLP EDKKYYPTAE EVYGEDVETL VMDEDEQPLE QPIIKPVRDI RFEVGVKDQA TYVSTQFLIG LMSNPALVRN VALVGHLQHG KTVFMDMLVE QTHHMSTFNA KNEKHMKYTD TRVDEQERNI SIKAVPMSLV LEDSRSKSYL CNIMDTPGHV NFSDEMTASL RLADGAVLIV DAAEGVMVNT ERAIRHAIQD HLPIVVVINK VDRLITELKL PPRDAYYKLR HTIEVINNHI SAASTTAGDL PLIDPAAGNV CFASGTAGWS FTLQSFAKMY AKLHGVAMDV DKFASRLWGD VYYHSDTRVF KRSPPVGGGE RAFVQFILEP LYKIYSQVIG EHKKSVETTL AELGVTLSNS AYKLNVRPLL RLACSSVFGS ASGFTDMLVK HIPSPREAAA RKVDHSYTGT KDSPIYESMV ECDPSGPLMV NVTKLYPKSD TSVFDVFGRV YSGRLQTGQS VRVLGEGYSP EDEEDMTIKE VTKLWIYQAR YRIPVSSAPP GSWVLIEGVD ASIMKTATLC NASYDEDVYI FRALQFNTLP VVKTATEPLN PSELPKMVEG LRKISKSYPL AITKVEESGE HTILGTGELY LDSIMKDLRE LYSEVEVKVA DPVVSFCETV VESSSMKCFA ETPNKKNKIT MIAEPLDRGL AEDIENGVVS IDWNRKQLGD FFRTKYDWDL LAARSIWAFG PDKQGPNILL DDTLPTEVDR NLMMAVKDSI VQGFQWGARE GPLCDEPIRN VKFKIVDARI APEPLHRGSG QMIPTARRVA YSAFLMATPR LMEPVYYVEI QTPIDCVTAI YTVLSRRRGH VTSDVPQPGT PAYIVKAFLP VIESFGFETD LRYHTQGQAF CLSVFDHWAI VPGDPLDKAI QLRPLEPAPI QHLAREFMVK TRRRKGMSED VSGNKFFDEA MMVELAQQTG DLHLQMI // ID NC003070_384 HYPOTHETICAL; PRT; 71 AA. AC NC003070_384; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1905524...1905530, 1906017...1906225]; DE Length: 216. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 2 FIRST EXON; p-value: NaN. FT GENSCAN 3 3 AA on splice site: t/ta -> L. FT GENSCAN 4 71 LAST EXON; p-value: NaN. SQ SEQUENCE 71 AA; 8157 MW; 62D6D48D2C047CC1 CRC64; MNLLELTSVH ECRPLVAEER FSGSSRLKKI RRELFERLKE MKGRSEGEET ILGNTLDSKR LSPGGPDPRH H // ID NC003070_385 HYPOTHETICAL; PRT; 1006 AA. AC NC003070_385; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1907625...1909721, 1909806...1909854, DE 1911397...1911566, 1911641...1911882, 1911968...1912285, DE 1912628...1912700, 1912803...1912874]; Length: 3021. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 699 FIRST EXON; p-value: NaN. FT GENSCAN 700 715 INTERNAL EXON; p-value: NaN. FT GENSCAN 716 716 AA on splice site: g/ct -> A. FT GENSCAN 717 772 INTERNAL EXON; p-value: NaN. FT GENSCAN 773 852 INTERNAL EXON; p-value: NaN. FT GENSCAN 853 853 AA on splice site: aa/g -> K. FT GENSCAN 854 958 INTERNAL EXON; p-value: NaN. FT GENSCAN 959 959 AA on splice site: tg/g -> W. FT GENSCAN 960 983 INTERNAL EXON; p-value: NaN. FT GENSCAN 984 1006 LAST EXON; p-value: NaN. SQ SEQUENCE 1006 AA; 111010 MW; 8AEF7D86DCAF0E37 CRC64; MASEPVNGGE GIDGAREKQI IKVYKRKGKG QRKQSSFFAL EAAIEKPEGL LENENDNNDV SPAETLAPEF EDPIVVVKNS IEEAALGTNS HGDKNLTEAP SENLPGDDSD KVIDKPLVEA FSQAQPQDDA SLAAMDKSEE VPSQIPKAQD DVNTVVVDEN SIKEPPKSLA QEDVTTVIVD KNPIEAPSQT LSLEDGDTLV VDKNPIEVSS EEDVHVIDAD NLIKEAHPEN FVERDTTDAQ QPAGLTSDSA HATAAGSMPM EEDADGRIRI HVASTTKQQK EEIRKKLEDQ LNVVRGMVKK IEDKEGEIGA YNDSRVLINT GINNGGGRIL SGFASAGLPR EVIRAPRPVN QLSISVLENT QGVNEHVEKE KRTPKANQFY RNSEFLLGDK LPPAESNKKS KSSSKKQGGD VGHGFGAGTK VFKNCSALLE RLMKHKHGWV FNAPVDVKGL GLLDYYTIIE HPMDLGTIKS ALMKNLYKSP REFAEDVRLT FHNAMTYNPE GQDVHLMAVT LLQIFEERWA VIEADYNREM RFVTGYEMNL PTPTMRSRLG PTMPPPPINV RNTIDRADWS NRQPTTTPGR TPTSATPSGR TPALKKPKAN EPNKRDMTYE EKQKLSGHLQ NLPPDKLDAI VQIVNKRNTA VKLRDEEIEV DIDSVDPETL WELDRFVTNY KKGLSKKKRK AELAIQARAE AERNSQQQMA PAPAAHEFSR EGGNTAASSL ADLGALVLST SDPLSKSHIS HLAFSRWRRE NLPVGSISHL PSSPARPPKP LLVATNQVPN PKDSNLPLNA HMLHNLAHVE LNAIDLAWDT VARFSPFFDL LGHNFFDDFA HVADDESRHF LWCSQRLAEL GFKYGDIPAN NLLMRECEKT SNNVAARLAC IPLVQVLPHL SDMDDTTNET PVCFNSIIHT NLVIQEARGL DAGPRLVKRL TGFGDNRTSK IVAKIAEEEV AHVAVGVDWY DPSCGTEVDK GDNEQGDKEQ LSAVYDRLTH IISMESENSS LEKPAK // ID NC003070_386 HYPOTHETICAL; PRT; 423 AA. AC NC003070_386; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1914849...1914614, 1914546...1914189, DE 1914015...1913338]; Length: 1272. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 78 FIRST EXON; p-value: NaN. FT GENSCAN 79 79 AA on splice site: ag/g -> R. FT GENSCAN 80 198 INTERNAL EXON; p-value: NaN. FT GENSCAN 199 423 LAST EXON; p-value: NaN. SQ SEQUENCE 423 AA; 48124 MW; E410984DC9F6DC74 CRC64; MVEGIPKRWK VLSGQNKWKG LLDPLDPDLR RYIIHYGEMS QVGYDAFNWD RKSRYAGDCY YSKNRLLART GFLKANPFRY KVTKYIYATA SIKLPISFIV KSLSKDASRV QTNWMGYIAV ATDQGKAMLG RRDIVVAWRG TLQPYEWAND FDFPLEPAIS VFPVTDPKDN PRIGSGWLDI YTASDSRSPY DTTSAQEQVQ GELKRLLELY KDEEISITFT GHSLGAVMSV LSAADLVYGK KNNININLQK KQVPITVFAF GSPRIGDHNF KNVVDSLQPL NILRIVNVPD VAPHYPLLLY SEIGEVLEIN TLNSTYLKRS LNFRNYHNLE IYLHGMAGMQ DTDGVFKLEI GRDISLVNKG LDALKDEYLV PSTWRCLANK GMLQMDDGTW KLDVHRRDHD DDVDADDNDD SSTSNQLQEL NTD // ID NC003070_387 HYPOTHETICAL; PRT; 33 AA. AC NC003070_387; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1916300...1916291, 1916153...1916062]; DE Length: 102. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 3 FIRST EXON; p-value: NaN. FT GENSCAN 4 4 AA on splice site: c/gt -> R. FT GENSCAN 5 33 LAST EXON; p-value: NaN. SQ SEQUENCE 33 AA; 3666 MW; AFAAFA2867CDE015 CRC64; MQKRKGTRCK TGADWLSLKI VDPSIVNFGV DNS // ID NC003070_388 HYPOTHETICAL; PRT; 343 AA. AC NC003070_388; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1916448...1916892, 1916998...1917584]; DE Length: 1032. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 148 FIRST EXON; p-value: NaN. FT GENSCAN 149 149 AA on splice site: g/ga -> G. FT GENSCAN 150 343 LAST EXON; p-value: NaN. SQ SEQUENCE 343 AA; 37771 MW; 692C03925EA988DC CRC64; MLNVLRNSNL TLAVLICFVL IASKLCSVDS SVYDPHKTLK QRFEKWLKTH SKLYGGRDEW MLRFGIYQSN VQLIDYINSL HLPFKLTDNR FADMTNSEFK AHFLGLNTSS LRLHKKQRPV CDPAGNVPDA VDWRTQGAVT PIRNQGKCGG CWAFSAVAAI EGINKIKTGN LVSLSEQQLI DCDVGTYNKG CSGGLMETAF EFIKTNGGLA TETDYPYTGI EGTCDQEKSK NKVVTIQGYQ KVAQNEASLQ IAAAQQPVSV GIDAGGFIFQ LYSSGVFTNY CGTNLNHGVT VVGYGVEGDQ KYWIVKNSWG TGWGEEGYIR MERGVSEDTG KCGIAMMASY PLQ // ID NC003070_389 HYPOTHETICAL; PRT; 343 AA. AC NC003070_389; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1919272...1918241]; Length: 1032. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 343 SINGLE EXON; p-value: NaN. SQ SEQUENCE 343 AA; 38935 MW; 7BD02BBE7C0347F0 CRC64; MAIRLTHLRK PLAYSRSFDV CIPFFRSISS FEAVEKAIKC AVETKEYLRI PELVVSLKEP YQNSTLFSFL SAFQRHHRIR VIDEILQSFV PVRPRSLPKI VYSSLLTYCL QSSDPLPLSF AILQRTLRSG CLPNPQTHLL LSDAWLERRR GSQSVADIIN EMKLIGYSPD TGTCNYLVSS LCAVDKLDEA IKVVEEMSAA GCIPDVESYG AVINSLCLAR KTTDVVKIVK EMVSKAGISP RKGMLTKVAA ALRANREIWK AIEMIEFVES RDYPVEFESY EVVVEGCLEV REYILAGKVV MRMTDRGFIP YIKVRQKVVE RLINIGEWKL ACTVRQRVSE LRS // ID NC003070_390 HYPOTHETICAL; PRT; 205 AA. AC NC003070_390; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1920943...1920326]; Length: 618. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 205 SINGLE EXON; p-value: NaN. SQ SEQUENCE 205 AA; 23286 MW; 404FD0C4FC32A342 CRC64; MQRNSNNTSI TSNISNNSSS HQACASCKHQ RKKCNNECIL SPYFPARKTK EFQAVHKVFG VSNVQKMVRT VREEDRTKLS DSLTWEALWR QKDPVLGSYG EYRRICEELK LYKSLVHNQP LIGWDNNQRV FNNNSNNKNG LAMTNSSGSG GFSVNNNGVG VNREIVNGGY ASRNVQGGWE NLKHDQRQQC YAVINNGFKQ HYLPL // ID NC003070_391 HYPOTHETICAL; PRT; 665 AA. AC NC003070_391; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1922422...1922891, 1923140...1923257, DE 1923352...1923459, 1923599...1923741, 1923821...1923920, DE 1924008...1924082, 1924192...1924322, 1924464...1924599, DE 1924685...1924804, 1924896...1925042, 1925137...1925241, DE 1925340...1925441, 1925759...1926001]; Length: 1998. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 156 FIRST EXON; p-value: NaN. FT GENSCAN 157 157 AA on splice site: tg/g -> W. FT GENSCAN 158 196 INTERNAL EXON; p-value: NaN. FT GENSCAN 197 232 INTERNAL EXON; p-value: NaN. FT GENSCAN 233 279 INTERNAL EXON; p-value: NaN. FT GENSCAN 280 280 AA on splice site: tg/g -> W. FT GENSCAN 281 313 INTERNAL EXON; p-value: NaN. FT GENSCAN 314 338 INTERNAL EXON; p-value: NaN. FT GENSCAN 339 381 INTERNAL EXON; p-value: NaN. FT GENSCAN 382 382 AA on splice site: ac/a -> T. FT GENSCAN 383 427 INTERNAL EXON; p-value: NaN. FT GENSCAN 428 467 INTERNAL EXON; p-value: NaN. FT GENSCAN 468 516 INTERNAL EXON; p-value: NaN. FT GENSCAN 517 551 INTERNAL EXON; p-value: NaN. FT GENSCAN 552 585 INTERNAL EXON; p-value: NaN. FT GENSCAN 586 665 LAST EXON; p-value: NaN. SQ SEQUENCE 665 AA; 74555 MW; 1A6202F80099E51D CRC64; MSDNRALRRA HVLANHILQS NPPSSNPSLS RELCLQYSPP ELNESYGFDV KEMRKLLDGH NVVDRDWIYG LMMQSNLFNR KERGGKIFVS PDYNQTMEQQ REITMKRIWY LLENGVFKGW LTETGPEAEL RKLALLEVCG IYDHSVSIKV GVHFFLWGNA VKFFGTKRHH EKWLKNTEDY VVKGCFAMTE LGHGSNVRGI ETVTTYDPKT EEFVINTPCE SAQKYWIGGA ANLHINGTNQ GVHAFIAQIR DQDGSICPNI RIADCGHKIG LNGVDNGRIW FDNLRIPREN LLNAVADVSS DGKYVSSIKD PDQRFGAFMA PLTSGRVTIA SSAIYSAKVG LSIAIRYSLS RRAFSVTANG PEVLLLDYPS HQRRLLPLLA KTYAMSFAAN ELKMIYVKRT PETNKAIHVV SSGFKAVLTW HNMHTLQECR EAVGGQGVKT ENLVGQLKGE FDVQTTFEGD NNVLMQQVSK ALFAEYVSCK KRNKPFKGLG LEHMNSPRPV LPTQLTSSTL RCSQFQTNVF CLRERDLLEQ FTSEVAQLQG RGESREFSFL LSHQLAEDLG KAFTEKAILQ TILDAEAKLP TGSVKDVLGL VRSMYALISL EEDPSLLRYG YLSQDNVGDV RREVSKLCGE LRPHALALVT SFGIPDSFLS PIAFNWVEAN AWSSV // ID NC003070_392 HYPOTHETICAL; PRT; 865 AA. AC NC003070_392; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1926801...1927270, 1927359...1927476, DE 1927554...1927661, 1927736...1927908, 1927955...1928093, DE 1928177...1928251, 1928362...1928492, 1928734...1928869, DE 1928958...1929077, 1929161...1929307, 1929580...1929681, DE 1929984...1930196, 1930523...1930699, 1930790...1930834, DE 1931005...1931448]; Length: 2598. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 156 FIRST EXON; p-value: NaN. FT GENSCAN 157 157 AA on splice site: tg/g -> W. FT GENSCAN 158 196 INTERNAL EXON; p-value: NaN. FT GENSCAN 197 232 INTERNAL EXON; p-value: NaN. FT GENSCAN 233 289 INTERNAL EXON; p-value: NaN. FT GENSCAN 290 290 AA on splice site: tg/g -> W. FT GENSCAN 291 336 INTERNAL EXON; p-value: NaN. FT GENSCAN 337 361 INTERNAL EXON; p-value: NaN. FT GENSCAN 362 404 INTERNAL EXON; p-value: NaN. FT GENSCAN 405 405 AA on splice site: ac/a -> T. FT GENSCAN 406 450 INTERNAL EXON; p-value: NaN. FT GENSCAN 451 490 INTERNAL EXON; p-value: NaN. FT GENSCAN 491 539 INTERNAL EXON; p-value: NaN. FT GENSCAN 540 573 INTERNAL EXON; p-value: NaN. FT GENSCAN 574 644 INTERNAL EXON; p-value: NaN. FT GENSCAN 645 703 INTERNAL EXON; p-value: NaN. FT GENSCAN 704 718 INTERNAL EXON; p-value: NaN. FT GENSCAN 719 865 LAST EXON; p-value: NaN. SQ SEQUENCE 865 AA; 97498 MW; 2D8CF19C937668B1 CRC64; MSENVELRRA HILANHILRS PRPSSNPSLT PEVCFQYSPP ELNESYGFEV KEMRKLLDGH NLEERDWLYG LMMQSNLFNP KQRGGQIFVS PDYNQTMEQQ RQISMKRIFY LLEKGVFQGW LTETGPEAEL KKFALYEVCG IYDYSLSAKL GVHFLLWGNA VKFFGTKRHH EKWLKDTEDY VVKGCFAMTE LGHGTNVRGI ETVTTYDPTT EEFVINTPCE SAQKYWIGEA ANHANHAIVI SQLSMNGTNQ GIHVFIAQIR DHDGNTCPNV RIADCGHKIG LNGVDNGRIW VCTSYGRLYH VYRFDNLRIP RENLLNSVAD VLADGKYVSS IKDPDQRFGA FLAPLTSGRV TIASSAIYSA KLGLAVAIRY SLSRRAFSVA ANGPEVLLLD YPSHQRRLLP LLAKTYAMSF AVNDLKMIYV KRTPETNKAI HVVSSGFKAV LTWHNMRTLQ ECREAVGGQG LKTENRVGHL KGEYDVQTTF EGDNNVLMQL VSKALFAEYV SCKKRNKPFK GLGLEHMNSP RPVLPTQLTS STLRCSQFQN HQLSEDLSKA FTEKAILQTV LDAEAKLPPG SVKDVLGLVR SMYALISLEE DPSLLRYGHL SRDNVGDVRK EVSKLCGELR PHALALVASF GIPDAFLSPI AFNWYHSTAF SFVTLLTNKH LRERERDRGR MKNGGEKKIT VEEYVEFCNS GNSIHFTIAY LNQILHLHGF RKLHKLQKKI VEEAVDSLDL LDLSRSTLKQ VTDSSPSSSS LTLDEVISDI EALKWQECCF TSLQIINSQE TTPSEISKPK QKSNKRKKAT MKKSLNTNFG DENDNTMMMM LPTKKMRNKK TTKNLKSIST FVKDASSASK PLSANNLSSR FSSFP // ID NC003070_393 HYPOTHETICAL; PRT; 294 AA. AC NC003070_393; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1933986...1933594, 1932277...1932157, DE 1932040...1931670]; Length: 885. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 131 FIRST EXON; p-value: NaN. FT GENSCAN 132 171 INTERNAL EXON; p-value: NaN. FT GENSCAN 172 172 AA on splice site: g/ga -> G. FT GENSCAN 173 294 LAST EXON; p-value: NaN. SQ SEQUENCE 294 AA; 33464 MW; D9CB407435B4F6EF CRC64; MEFVKGDQVE VCSKEDGFLG SYFGATVVSK TPEGSYYKIK YKNLVSDTDQ SKRLVEVISA DELRPMPPKS LHVLIRCGDK VDAFDKDGWW VGEVTAVRRN IYSVYFSTTD EELEYPLYSL RKHHEWVNGS WLLLFMNLTL NLIQLQTIEM RVHMDCVGCE SRVKNALQKM RGVDAVEIDM VQQKVTVTGY ADQKKVLKKV RKTGRRAELW QLPYNPDHMG GSSSNGGYFY NPQGCNGPIN HAAPVPTSSY NYYKHGYDSN DYSSYRHHPV HASIFSHQTG SKFSDENPNA CSIM // ID NC003070_394 HYPOTHETICAL; PRT; 458 AA. AC NC003070_394; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1941145...1940921, 1940657...1940515, DE 1940438...1940105, 1940013...1939873, 1939797...1939657, DE 1936771...1936687, 1936335...1936227, 1935707...1935509]; Length: 1377. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 75 FIRST EXON; p-value: NaN. FT GENSCAN 76 122 INTERNAL EXON; p-value: NaN. FT GENSCAN 123 123 AA on splice site: ac/a -> T. FT GENSCAN 124 234 INTERNAL EXON; p-value: NaN. FT GENSCAN 235 281 INTERNAL EXON; p-value: NaN. FT GENSCAN 282 328 INTERNAL EXON; p-value: NaN. FT GENSCAN 329 356 INTERNAL EXON; p-value: NaN. FT GENSCAN 357 357 AA on splice site: c/gt -> R. FT GENSCAN 358 392 INTERNAL EXON; p-value: NaN. FT GENSCAN 393 393 AA on splice site: ag/g -> R. FT GENSCAN 394 458 LAST EXON; p-value: NaN. SQ SEQUENCE 458 AA; 53491 MW; DA66CBEBEE0C33FC CRC64; MNPSKATGGI VEDLTKQLPK SMFLSMDADK TLLHIIQKKI TSAKNVNCIL GYCRCPRSLK SLHETDGIKF RVFHDAGNKL ISTLRRVKTC AGARVLNMQM EEVDRAFTMS GQLHLTTWTR RRTKTTPQRS MCDPIREDGS NKRGAVSKEK RPYIHREWSW ADIIRALTVI NVHFLCLLAP FNYKWEALRF GFVLYALTSL SITFSYHRNL AHRSFKLPKW LEYPLAYFAV FALQGDPLDW VSIHRFHHQF TDSDRDPHSP IEGFWFSHVW WICDTRYIKY KCGGRNNVMD LKQQWFYWFL RMTIGFHVLM FWTVLYLYGG LPYLTCGGRR WLQPKQSGEY YAKEGVFSET MAVGRCRSIT SSQIRTATHI ALKKDYCSAI SCGYLTPNTS NTRWLSLFTM GESWHNNHHA FESSARQGLE WWQIDITWYL IRLFEVLGIA TDVKLPSELQ KQKMALVR // ID NC003070_395 HYPOTHETICAL; PRT; 237 AA. AC NC003070_395; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1943849...1944220, 1944411...1944579, DE 1944747...1944919]; Length: 714. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 124 FIRST EXON; p-value: NaN. FT GENSCAN 125 180 INTERNAL EXON; p-value: NaN. FT GENSCAN 181 181 AA on splice site: g/cg -> A. FT GENSCAN 182 237 LAST EXON; p-value: NaN. SQ SEQUENCE 237 AA; 27691 MW; AAB1D1747DDB73C2 CRC64; MKKSVKVKSM TGNSTSAMSE FRNSLHRMRG SFRKLRGTMT HEQHIIRSNR QVKPRTKDLT QQKLVSAMKC SIQNIKTNIK EYSCRVKMQK RIKRKQRDLG SLTHFRLHQP LNIVGTRNYQ HSVMISCKIM KKYTDHYVGS SIKARHMKNF IQWDAMSNDR TAHLSIMINL RNTQQHRIDT AFFRFLVAIE EDAAYNELKR YRGKLLINRK REAASIFAGS RGSAKSFAGG QRKRLSR // ID NC003070_396 HYPOTHETICAL; PRT; 239 AA. AC NC003070_396; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1945123...1945807, 1946913...1946947]; DE Length: 720. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 228 FIRST EXON; p-value: NaN. FT GENSCAN 229 229 AA on splice site: g/ct -> A. FT GENSCAN 230 239 LAST EXON; p-value: NaN. SQ SEQUENCE 239 AA; 27068 MW; FA9389B4BBFB6EDA CRC64; MSTTTTTAAL PQRHQSKVDP QNVNRAVKSL LKWWDSKSKT ENSESLENDG FVYLIVTLKR IPQLDRTNPL MIPLPHPLID LVAEDPPELC LIIDDKHKNK ITKEAALKKI EAEKIPITTV IKVSKLKSDL RKLEEEKRFE LYFAERRLMP MLPKLLGKEF VKKNKTPIAI NLRHGSWKEQ IEKACESALF FVGTGTCSVV KVAKLSMGRN EIAENVVAAM NGIGDLVPAS RRRRFETPS // ID NC003070_397 HYPOTHETICAL; PRT; 371 AA. AC NC003070_397; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1948125...1948214, 1948428...1948487, DE 1948573...1948845, 1948935...1949009, 1949084...1949140, DE 1949314...1949454, 1949566...1949616, 1949724...1949819, DE 1949937...1950020, 1950148...1950249, 1950330...1950416]; Length: 1116. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 30 FIRST EXON; p-value: NaN. FT GENSCAN 31 50 INTERNAL EXON; p-value: NaN. FT GENSCAN 51 141 INTERNAL EXON; p-value: NaN. FT GENSCAN 142 166 INTERNAL EXON; p-value: NaN. FT GENSCAN 167 185 INTERNAL EXON; p-value: NaN. FT GENSCAN 186 232 INTERNAL EXON; p-value: NaN. FT GENSCAN 233 249 INTERNAL EXON; p-value: NaN. FT GENSCAN 250 281 INTERNAL EXON; p-value: NaN. FT GENSCAN 282 309 INTERNAL EXON; p-value: NaN. FT GENSCAN 310 343 INTERNAL EXON; p-value: NaN. FT GENSCAN 344 371 LAST EXON; p-value: NaN. SQ SEQUENCE 371 AA; 42129 MW; A02C996652E7F5FD CRC64; MSAAVIEGND AVTGHIISTT IGGKNGEPKQ TISYMAERVV GTGSFGIVFQ AKCLETGESV AIKKVLQDRR YKNRELQLMR PMDHPNVISL KHCFFSTTSR DELFLNLVME YVPETLYRVL RHYTSSNQRM PIFYVKLYTY QIFRGLAYIH TVPGVCHRDV KPQNLLVDPL THQVKLCDFG SAKVLVKGEP NISYICSRYY RAPELIFGAT EYTASIDIWS AGCVLAELLL GQPLFPGENS VDQLVEIIKV LGTPTREEIR CMNPNYTDFR FPQIKAHPWH KVFHKRMPPE AIDLASRLLQ YSPSLRCTAL EACAHPFFNE LREPNARLPN GRPLPPLFNF KQELGGASME LINRLIPEHV RRQMSTGLQN S // ID NC003070_398 HYPOTHETICAL; PRT; 243 AA. AC NC003070_398; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1953924...1953830, 1952671...1952465, DE 1951517...1951088]; Length: 732. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 31 FIRST EXON; p-value: NaN. FT GENSCAN 32 32 AA on splice site: at/a -> I. FT GENSCAN 33 100 INTERNAL EXON; p-value: NaN. FT GENSCAN 101 101 AA on splice site: ag/a -> R. FT GENSCAN 102 243 LAST EXON; p-value: NaN. SQ SEQUENCE 243 AA; 27123 MW; B241A622FCA2A13B CRC64; MSPLRARRSA WRIGHVIWRK NVRTRDAVCD AIADEEYDYL FKLVLIGDSG VGKSNLLSRF TKNEFNLESK STIGVEFATK TTKVEGKVVK AQIWDTAGQE RYRAITSAYY RGAVGALLIY DVTRHATFEN AARWLRELRG HTDPNIVVML IGNKCDLRHL VAVKTEEAKA FAERESLYFM ETSALDATNV ENAFTEVLTQ IHKIVSKRSV DGGGESADLP GKGETINVKE DGSVLKRMGC CSN // ID NC003070_399 HYPOTHETICAL; PRT; 714 AA. AC NC003070_399; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1955412...1956970, 1957476...1957749, DE 1957841...1958152]; Length: 2145. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 519 FIRST EXON; p-value: NaN. FT GENSCAN 520 520 AA on splice site: cg/g -> R. FT GENSCAN 521 611 INTERNAL EXON; p-value: NaN. FT GENSCAN 612 714 LAST EXON; p-value: NaN. SQ SEQUENCE 714 AA; 80993 MW; 125880FC6AA456E0 CRC64; MISRSYTNLL DLASGNFPVM GRERRRLPRV MTVPGNVSEF DEDQAYSVSS DNPSSVSSDR MIIVANRLPL KAEKRNGSWS FSWDQDSLYL QLKDGLPEDM EILYVGSLSV DVDSNEQDDV AQILLDKFKC VPTFFPPDLQ SKFYDGFCKR QIWPLFHYML PFSADHGGRF DRSLWEAYVA TNKLFFQKVI EVINPDDDFV WIHDYHLMVL PTFLRRRFNR IRMGFFLHSP FPSSEIYRSL PVREEILKAL LNSDLIGFHT FDYARHFLTC CSRMLGLEYQ SKRGYIGLEY YGRTVGIKIM PVGINMGRIQ SVMRYSEEEG KVMELRNRFE GKTVLLGIDD MDIFKGINLK LLAMEQMLRQ HPNWRGRAVL VQIVNPARGK GIDVEEIRGE IEESCRRING EFGKPGYQPI IYIDTPVSIN EINAYYHIAE CVVVTAVRDG MNLTPYEYIV CRQGLLGSES DFSGPKKSML VASEFIGCSP SLSGAIRVNP WNVEATGEAL NEALSMSDAE KQLRHEKHFR WSGSEEWETC GQSSDFGWMQ IVEPVMKQYT ESTDGSSIEI KESALVWQYR DADPGFGSLQ AKEMLEHLES VLANEPVAVK SGHYIVEVKP QGVSKGSVSE KIFSSMAGKG KPVDFVLCIG DDRSDEDMFE AIGNAMSKRL LCDNALVFAC TVGQKPSKAK YYLDDTTEVT CMLESLAEAS EASNFSMREL DEAL // ID NC003070_400 HYPOTHETICAL; PRT; 221 AA. AC NC003070_400; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1958986...1959651]; Length: 666. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 221 SINGLE EXON; p-value: NaN. SQ SEQUENCE 221 AA; 25852 MW; C2D2278225F5FB20 CRC64; MEDHEKIDGK KKKKKSVALI PANYVSILQL QERWLKEKEK KQKEKDFVER GVKQQVDQRQ RRREEEENVV KAMETKVKLE EHSLSGGVRM HCSVNRWKRD QVCVKKEEIK VSGIVSNKDE DGVDSREKKK KNPVKENTRR VFKSKGENAA KEVTQCWIKK KVEEERETSE VKGTARLISK QGYYQNKRHD WSSTRVIRAT TSTMVWVKKG KKDGAVGENK V // ID NC003070_401 HYPOTHETICAL; PRT; 685 AA. AC NC003070_401; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1962524...1961373, 1961282...1960969, DE 1960894...1960654, 1960563...1960213]; Length: 2058. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 384 FIRST EXON; p-value: NaN. FT GENSCAN 385 488 INTERNAL EXON; p-value: NaN. FT GENSCAN 489 489 AA on splice site: gg/a -> G. FT GENSCAN 490 569 INTERNAL EXON; p-value: NaN. FT GENSCAN 570 685 LAST EXON; p-value: NaN. SQ SEQUENCE 685 AA; 73200 MW; FF935260BD3E0FBD CRC64; MAASSACLLG NGLSVYTTKQ RFQKLGLDRT SKVTVVKASL DEKKHEGRRG FFKLLLGNAA AGVGLLASGN ANADEQGQGV SSSRMSYSRF LEYLDKGRVE KVDLYENGTI AIVEAVSPEL GNRIQRVRVQ LPGLSQELLQ KLRAKNIDFA AHNAQEDQGS PILNLIGNLA FPVILIGGLF LLSRRSSGGM GGPGGPGFPL QIGQSKAKFQ MEPNTGVTFD DVAGVDEAKQ DFMEVVEFLK KPERFTAVGA RIPKGVLLVG PPGTGKTLLA KAIAGEAGVP FFSISGSEFV EMFVGVGASR VRDLFKKAKE NAPCIVFVDE IDAVGRQRGT GIGGGNDERE QTLNQLLTEM DGFEGNTGVI VVAATNRADI LDSALLRPGR FDRQVSVDVP DVKGRTDILK VHSGNKKFES GVSLEVIAMR TPGFSGADLA NLLNEAAILA GRRGKTAISS KEIDDSIDRI VAGMEGTVMT DGKSKSLVAY HEVGHAICGT LTPGHDAVQK VTLIPRGQAR GLTWFIPSDD PTLISKQQLF ARIVGGLGGR AAEEVIFGES EVTTGAVSDL QQITGLAKQM VTTFGMSEIG PWSLMDSSEQ SDVIMRMMAR NSMSEKLAND IDTAVKTLSD KAYEIALSQI RNNREAMDKI VEILLEKETM SGDEFRAILS EFTEIPPENR VASSTSTSTP TPASV // ID NC003070_402 HYPOTHETICAL; PRT; 390 AA. AC NC003070_402; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1964904...1963732]; Length: 1173. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 390 SINGLE EXON; p-value: NaN. SQ SEQUENCE 390 AA; 44693 MW; EC60036DBE732B7B CRC64; MRFLLMRSLL VHPCGGSIIN VRSTTTSAQY VASRSRDPVF EKLMDKYKNL LKVIAIQDLT LANPTADPPS LSIEFLSRLS QKLHLNRGAA SFLRKYPHIF HVLYDPVKAE PFCRLTDVAM EISRQEALAI TATLSLVVDR LVRLLSMSIS KSIPLRAVFK VWRELGLPDD FEDSVISKNP HLFKLSDGHE SNTHILELVQ EEEKRLEFEA AVEKWRVVEC SKEDCSVDRT EIQFSFKHSY PPGMRLSKTF KAKVKEWQRL PYVGPYEDMV GKKKSRSGVM GIEKRAVAIA HEFLNLTVEK MVEVEKISHF RKCFGIDLNI RDLFLDHPGM FYVSTKGKRH TVFLREAYER GRLIDPNPVY DARRKLLDLV LLGRHAALSE SGNTSMSEQE // ID NC003070_403 HYPOTHETICAL; PRT; 360 AA. AC NC003070_403; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1965923...1967005]; Length: 1083. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 360 SINGLE EXON; p-value: NaN. SQ SEQUENCE 360 AA; 40479 MW; 230C350A7894E54D CRC64; MNEIHQPLAR RVWRSNVDEE MARMAECLKR FPLIAFDTEY PGIIFRTYFD SSSDECYRAM KGNVENTKLI QCGFTLFNAK GEIGGVWEIN FSNFGDPSDT RNELSIEFLR RHGLDLQKIR DEGVDMFGYG FFPKLMTVFR SQKHVEFVTF QGAYDFAYFL SILNHGKLPE THGEFATEVV KVFGQVYDTK VMAGFCEGLG EHLGLSKLAQ LLQITRVGRA HHAGSDSLMT ALVFIKLKHV YEDSRFARGL IYGIGKSNLV AAPAPAPVPE PTLPLMCQQN VASYPVFHNG YVQNYEQPQL VSYDPSGAPW AFCNATGTYV QLTHLPASTF AYPSQTPSAT VDYLGPVPNY YNNNACYVVE // ID NC003070_404 HYPOTHETICAL; PRT; 494 AA. AC NC003070_404; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1967688...1967779, 1968274...1968410, DE 1969265...1969393, 1970675...1970996, 1971082...1971251, DE 1971477...1971626, 1972387...1972571, 1972710...1972796, DE 1973222...1973316, 1973422...1973539]; Length: 1485. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 30 FIRST EXON; p-value: NaN. FT GENSCAN 31 31 AA on splice site: cg/a -> R. FT GENSCAN 32 76 INTERNAL EXON; p-value: NaN. FT GENSCAN 77 77 AA on splice site: g/gc -> G. FT GENSCAN 78 119 INTERNAL EXON; p-value: NaN. FT GENSCAN 120 120 AA on splice site: g/tt -> V. FT GENSCAN 121 226 INTERNAL EXON; p-value: NaN. FT GENSCAN 227 227 AA on splice site: ct/g -> L. FT GENSCAN 228 283 INTERNAL EXON; p-value: NaN. FT GENSCAN 284 284 AA on splice site: g/tt -> V. FT GENSCAN 285 333 INTERNAL EXON; p-value: NaN. FT GENSCAN 334 334 AA on splice site: g/ga -> G. FT GENSCAN 335 395 INTERNAL EXON; p-value: NaN. FT GENSCAN 396 424 INTERNAL EXON; p-value: NaN. FT GENSCAN 425 455 INTERNAL EXON; p-value: NaN. FT GENSCAN 456 456 AA on splice site: aa/a -> K. FT GENSCAN 457 494 LAST EXON; p-value: NaN. SQ SEQUENCE 494 AA; 55005 MW; 384C7FB69C3FFC0B CRC64; MISKDRSFGA CINFLTDRCA STCHSQAAAE RFGDKEGKKM SSFAGLEKCG AAYAGEEESK PVVLVAENLV DAAASCGLQI RKRETKREGE YQGDERCGGG GIVTGCREVS GEGSEPASSV VCEWKPKRFA GEFRLLMEQR VQLRVGTMET ISNEGDVDRE QVLETFGIEN ETGKETNGSR SFDVGYSSGD TLETLPKASK VDISPADVLK TLFFILVWYT FSTFLTLYNK TLLGDDLGKF PAPLLMNTIH FSIQAVLSKM ITWYWSGRFQ PDVTISWRDY FVRVVPTALG TAMDINLSNE SLVFISVTFA TMVSYFIYYG RSMQLLWNLR VKQGLKNPFI FMSCVAPVMA IATGLLSLLL DPWSEFRDNK YFDSGAHFAR TCFLMLFGGA LAFCMVLTEY VLVSVTSAVT VTIAGVVKEA VTIVVAVFYF HDEFTWLKGV GLMIIMVGVS LFNWYKYDKL QKGHKTEEEK QLQAPSQTGK YVILDEMDDQ ENSP // ID NC003070_405 HYPOTHETICAL; PRT; 93 AA. AC NC003070_405; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1974272...1973991]; Length: 282. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 93 SINGLE EXON; p-value: NaN. SQ SEQUENCE 93 AA; 10722 MW; D50B4B323D3FBF08 CRC64; MGVMKKWASL KKKKLQQEEE QVTAKETQTW QRLRNLFSTS SSSSSSSAKW KRVEIIMLTE IVDGVVYKVM YVVEAFVLVS TLCFFYLCCG CHI // ID NC003070_406 HYPOTHETICAL; PRT; 1673 AA. AC NC003070_406; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1978761...1978995, 1979084...1979184, DE 1979321...1979433, 1979530...1979617, 1979707...1979817, DE 1979897...1980049, 1980145...1980234, 1980811...1980945, DE 1981453...1981575, 1981658...1981775, 1981857...1982241, DE 1982905...1982974, 1983256...1983420, 1983813...1983976, DE 1984078...1984201, 1984306...1984406, 1984558...1984669, DE 1984749...1984900, 1985073...1985182, 1985259...1985378, DE 1985688...1985877, 1985958...1986069, 1986230...1986352, DE 1986427...1986545, 1986643...1986726, 1986913...1987025, DE 1987104...1987333, 1987435...1988232, 1988319...1988369, DE 1988524...1988709, 1988800...1989045]; Length: 5022. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 78 FIRST EXON; p-value: NaN. FT GENSCAN 79 79 AA on splice site: t/gt -> C. FT GENSCAN 80 112 INTERNAL EXON; p-value: NaN. FT GENSCAN 113 149 INTERNAL EXON; p-value: NaN. FT GENSCAN 150 150 AA on splice site: cc/g -> P. FT GENSCAN 151 179 INTERNAL EXON; p-value: NaN. FT GENSCAN 180 216 INTERNAL EXON; p-value: NaN. FT GENSCAN 217 267 INTERNAL EXON; p-value: NaN. FT GENSCAN 268 297 INTERNAL EXON; p-value: NaN. FT GENSCAN 298 342 INTERNAL EXON; p-value: NaN. FT GENSCAN 343 383 INTERNAL EXON; p-value: NaN. FT GENSCAN 384 422 INTERNAL EXON; p-value: NaN. FT GENSCAN 423 423 AA on splice site: g/cg -> A. FT GENSCAN 424 550 INTERNAL EXON; p-value: NaN. FT GENSCAN 551 551 AA on splice site: aa/a -> K. FT GENSCAN 552 574 INTERNAL EXON; p-value: NaN. FT GENSCAN 575 629 INTERNAL EXON; p-value: NaN. FT GENSCAN 630 683 INTERNAL EXON; p-value: NaN. FT GENSCAN 684 684 AA on splice site: ag/g -> R. FT GENSCAN 685 725 INTERNAL EXON; p-value: NaN. FT GENSCAN 726 758 INTERNAL EXON; p-value: NaN. FT GENSCAN 759 759 AA on splice site: ga/a -> E. FT GENSCAN 760 796 INTERNAL EXON; p-value: NaN. FT GENSCAN 797 846 INTERNAL EXON; p-value: NaN. FT GENSCAN 847 847 AA on splice site: ag/t -> S. FT GENSCAN 848 883 INTERNAL EXON; p-value: NaN. FT GENSCAN 884 884 AA on splice site: g/ag -> E. FT GENSCAN 885 923 INTERNAL EXON; p-value: NaN. FT GENSCAN 924 924 AA on splice site: g/ct -> A. FT GENSCAN 925 986 INTERNAL EXON; p-value: NaN. FT GENSCAN 987 987 AA on splice site: aa/g -> K. FT GENSCAN 988 1024 INTERNAL EXON; p-value: NaN. FT GENSCAN 1025 1065 INTERNAL EXON; p-value: NaN. FT GENSCAN 1066 1104 INTERNAL EXON; p-value: NaN. FT GENSCAN 1105 1105 AA on splice site: ag/t -> S. FT GENSCAN 1106 1132 INTERNAL EXON; p-value: NaN. FT GENSCAN 1133 1133 AA on splice site: ag/g -> R. FT GENSCAN 1134 1170 INTERNAL EXON; p-value: NaN. FT GENSCAN 1171 1171 AA on splice site: g/gg -> G. FT GENSCAN 1172 1247 INTERNAL EXON; p-value: NaN. FT GENSCAN 1248 1513 INTERNAL EXON; p-value: NaN. FT GENSCAN 1514 1530 INTERNAL EXON; p-value: NaN. FT GENSCAN 1531 1592 INTERNAL EXON; p-value: NaN. FT GENSCAN 1593 1673 LAST EXON; p-value: NaN. SQ SEQUENCE 1673 AA; 193510 MW; 963A115C7FB26213 CRC64; MASTSSGGRG EDGRPPQMQP VRSMSRKMTR AGTMMIEHPN EDERPIDSEL VPSSLASIAP ILRVANDIDQ DNARVAYLCR FHAFEKAHRM DPTSSGRGVR QFKTYLLHKL EEEEEITEHM LAKSDPREIQ LYYQTFYENN IQDGEGKKTP EEMAKLYQIA TVLYDVLKTV VPQARIDDKT LRYAKEVERK KEQYEHYNIL PLYALGAKTA VMELPEIKAA ILAVCNVDNL PRPRFHSASA NLDEVDRERG RSFNDILEWL ALVFGFQRGN VANQREHLIL LLANIDVRKR DLENYVEMAN EVHGILFGNV YPVTGDTYEA GAPDEEAFLR NVITPIYQVL RKRHDQVSHG KRKPKTNFVE ARTFWNLYRS FDRMWMFLVL SLQTMIIVAW HPSGSILAIF TEDVFRNVLT IFITSAFLNL LQATLDLVLS FGAWKSLKFS QIMRYITKFL MAAMWAIMLP ITYSKSVQNP TGLIKFFSSW VGSWLHRSLY DYAIALYVLP NILAAVFFLL PPLRRIMERS NMRIVTLIMW WAQPKLYIGR GMHEEMFALF KIYDIHAATH NIGVIIAIWG PIVLIRTLGM LRSRFKVVPS AFCSKLTPLP LGHAKRKHLV LATHTNLIDK IVYANKFRFI PIALDMAKDF KGKEDVDLFK KIKSEYYMHY AVVEAYETVR DIIYGLLQDE SDKRIVREIC YEVDISIQQH RFLSEFRMTG MPLLSDKLEK FLKILLSDYE EDDYKSQIIN VLQDIIEIIT QDVMVNGHEI LERAHLQSGD IESDKKEQRF EKIDLSLTQN ISWREKVVRL LLLLTVKESA INIPQSLEAR RRMTFFANSL FMNMPDAPRV RDMLSFSVLT PYYKEDVLYS EEELNKENED GITILFYLQR IYPEEWSNYC ERVNDLKRNL SEKDKAEQLR QWVSYRGQTL SRTATNGGYL PSESNEDDRK AFSDRARALA DLKFTYVVSC QVYGNQKKSS ESRDRSCYNN ILQLMLKYPS LRVAYIDERE ETVNGKSQKV FYSVLLKGCD KLDEEIYRIK LPGPPTEIGE GKPENQNHAI IFTRGEALQT IDMNQDNYFE ECFKMRNVLQ EFDEGRRGKR NPTILGLREH IFTGSVSSLA WFMSNQETSF VTIGQRVLAN PLRVRFHYGH PDIFDRIFHI TRGGISKASK IINLSEDIFA GYNSTLRGGY VTHHEYIQAG KGRDVGMNQI SFFEAKVANG NGEQTLSRDV YRLGRRFDFY RMLSFYFTTV GFYFSSMITV LTVYVFLYGR LYLVLSGLEK NILQSASVHE SNALEQALAA QSVFQLGFLM VLPMVMEIGL EKGFRTALGD FIIMQLQLAS VFFTFQLGTK AHYFGRTILH GGSKYRATGR GFVVFHAKFA ENYRLYSRSH FVKGLELVIL LVVYQVYGTS YRSSSTYMYI TFSMWFLVTS WLFAPFIFNP SGFEWQKTVD DWTDWKRWMG NRGGIGIVLD KSWESWWDIE QEHLKHTNLR GRVLEILLAL RFLLYQYGIV YHLNIARRHT TFLVYGLSWA ILLSVLLVLK MVSMGRRKFG TDFQVMFRIL KALLFLGFLS VMTVLFVVCG LTISDLFASI LAFLPTGWAI LLIGQALRSV FKGLGFWDSV KELGRAYEYI MGLVIFTPIA VLSWFPFVSE FQTRLLFNQA FSRGLQISMI LAGKKDKETP STK // ID NC003070_407 HYPOTHETICAL; PRT; 135 AA. AC NC003070_407; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1990178...1990585]; Length: 408. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 135 SINGLE EXON; p-value: NaN. SQ SEQUENCE 135 AA; 14557 MW; 666BC06E9A84E374 CRC64; MHSTVQHGGN KSGKSNVWAN TNLAKTVAAV DEFKFGFPSG GLTTVSNKWW GRAEKGGRED GGGENTENGH VAACDETQNS LVAIRKRIAE EGREAVELGL HQGFGSKRPG KRDQALLFQI FNSAMPKDWV TPDSS // ID NC003070_408 HYPOTHETICAL; PRT; 265 AA. AC NC003070_408; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1992388...1992544, 1992706...1992905, DE 1993023...1993463]; Length: 798. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 52 FIRST EXON; p-value: NaN. FT GENSCAN 53 53 AA on splice site: g/ca -> A. FT GENSCAN 54 119 INTERNAL EXON; p-value: NaN. FT GENSCAN 120 265 LAST EXON; p-value: NaN. SQ SEQUENCE 265 AA; 29538 MW; 3B8BD7854713251C CRC64; MVTTTIGGVP VVIAQSRRIP TSLRCFSATA NSDLLRSQLD RLHAEAESTR AKANSNRLRL LRLSEAAENL REQAAVNVRT GKENDARDLL LQKKKVMQAL DKAKARIELL DTLSSKLNEA ISVKETQLIG NISLDLEEDG ENTSGGIHIV SPKPESTEDG VENDHTHLDS EGIQLIERNV EDYQELLDTN NNVLEDVSIG SILKEVSSYE SFLENLDQKL SRIEAELVTV VNVASLVLNH EDKPKNLKVQ QTAEILEEIR RVRER // ID NC003070_409 HYPOTHETICAL; PRT; 585 AA. AC NC003070_409; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1996066...1995179, 1995038...1994169]; DE Length: 1758. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 296 FIRST EXON; p-value: NaN. FT GENSCAN 297 585 LAST EXON; p-value: NaN. SQ SEQUENCE 585 AA; 66516 MW; 3F7FDCE161C28EAF CRC64; MVLPELLVIL AEWVLYRLLA KSCYRAARKL RGYGFQLKNL LSLSKTQSLH NNSQHHLHNH HQQNHPNQTL QDSLDPLFPS LTKYQELLLD KNRACSVSSD HYRDTFFCDI DGVLLRQHSS KHFHTFFPYF MLVAFEGGSI IRAILLLLSC SFLWTLQQET KLRVLSFITF SGLRVKDMDN VSRSVLPKFF LENLNIQVYD IWARTEYSKV VFTSLPQVLV ERFLREHLNA DDVIGTKLQE IKVMGRKFYT GLASGSGFVL KHKSAEDYFF DSKKKPALGI GSSSSPQDHI FISICKEAYF WNEEESMSKN NALPRERYPK PLIFHDGRLA FLPTPLATLA MFIWLPIGFL LAVFRISVGV FLPYHVANFL ASMSGVRITF KTHNLNNGRP EKGNSGVLYV CNHRTLLDPV FLTTSLGKPL TAVTYSLSKF SEFIAPLKTV SLKRDRKKDG EAMQRLLSKG DLVVCPEGTT CREPYLLRFS PLFAELTEDI VPVAVDARVS MFYGTTASGL KCLDPIFFLM NPRPVYCLEI LKKLPKEMTC AGGKSSFEVA NFIQGELARV LGFECTNLTR RDKYLVLAGN EGIVR // ID NC003070_410 HYPOTHETICAL; PRT; 60 AA. AC NC003070_410; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[1999010...1999064, 1999802...1999929]; DE Length: 183. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 18 FIRST EXON; p-value: NaN. FT GENSCAN 19 19 AA on splice site: c/tt -> L. FT GENSCAN 20 60 LAST EXON; p-value: NaN. SQ SEQUENCE 60 AA; 7158 MW; 8FA0C6E288CAB426 CRC64; MEMDNIVIKK ESLSFTRTLS AQRKLKTRNI LLPIEDWSSV DSQWNMTCDI NQYMELNTIY // ID NC003070_411 HYPOTHETICAL; PRT; 113 AA. AC NC003070_411; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[2003392...2003051]; Length: 342. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 113 SINGLE EXON; p-value: NaN. SQ SEQUENCE 113 AA; 11981 MW; 8185B1BA517C340A CRC64; MADSLLQKAT SALGEAKQTV MASAETAKTN VVKDAVDNVV SRGIDGAKTL LHGLEEKKDE VSSKIMGAVT HFTGSADSAA TTANRDLPVS TDNQVPIQTP KENFIFLYLL CQN // ID NC003070_412 HYPOTHETICAL; PRT; 358 AA. AC NC003070_412; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[2006563...2006531, 2006433...2006350, DE 2006115...2006052, 2005959...2005886, 2005633...2005549, DE 2005470...2005385, 2005169...2005048, 2004926...2004866, DE 2004785...2004687, 2004450...2004349, 2004299...2004192, DE 2004096...2003987, 2003881...2003833]; Length: 1077. OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; core OC eudicots; Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; CC -------------------------------------------------------------------------- CC This entry is NOT a real protein sequence. It is a hypothetical protein CC obtained by genscan from a DNA sequence. It is known to be error prone. CC -------------------------------------------------------------------------- DR RefSeq; NC_003070; -. FT GENSCAN 1 11 FIRST EXON; p-value: NaN. FT GENSCAN 12 39 INTERNAL EXON; p-value: NaN. FT GENSCAN 40 60 INTERNAL EXON; p-value: NaN. FT GENSCAN 61 61 AA on splice site: a/ag -> K. FT GENSCAN 62 85 INTERNAL EXON; p-value: NaN. FT GENSCAN 86 113 INTERNAL EXON; p-value: NaN. FT GENSCAN 114 114 AA on splice site: g/ga -> G. FT GENSCAN 115 142 INTERNAL EXON; p-value: NaN. FT GENSCAN 143 182 INTERNAL EXON; p-value: NaN. FT GENSCAN 183 183 AA on splice site: aa/g -> K. FT GENSCAN 184 203 INTERNAL EXON; p-value: NaN. FT GENSCAN 204 236 INTERNAL EXON; p-value: NaN. FT GENSCAN 237 270 INTERNAL EXON; p-value: NaN. FT GENSCAN 271 306 INTERNAL EXON; p-value: NaN. FT GENSCAN 307 342 INTERNAL EXON; p-value: NaN. FT GENSCAN 343 343 AA on splice site: ag/g -> R. FT GENSCAN 344 358 LAST EXON; p-value: NaN. SQ SEQUENCE 358 AA; 40332 MW; D61671FAAFE253DA CRC64; MAQEGQNIDE PVVIGEEKGS VRLTTLNRPR QLNVISPEVG TGRAFSAGGD LKVFYHGQES KDSCLEVVYR MYWLCYHIHT YKKTQVFATP EASFGFHTDC GFSYIHSRLP GHLGEFLALT GARLNGKELV AIGMATHFVP SGKLMDLEAR LVSLDSGDAD VVQSTIEEFS EKVNLDKDSI LNKQSVINEC FSKESVKQII QAFEAEASKD GNEWITPVIK GLKRSSPTGL KIVLQSIREG RKQTLSDCLK KEFRLTLNIL RKTISPDMYE ENRRIMLLSL YILFLILCYN SQGIRALTID KDNSPKWNPA TLDEVDDEKI NSVFKLFEDD DIELQIPETE ENRWGGKYET SGYASVRG // ID NC003070_413 HYPOTHETICAL; PRT; 1636 AA. AC NC003070_413; DT 30-Apr-2008 (Rel. 19.00, Created) DT 30-Apr-2008 (Rel. 19.00, Last sequence update) DT 30-Apr-2008 (Rel. 19.00, Last annotation update) DE Chromosome: 1; NC_003070[2024504...2024334, 2024021...2023862, DE 2023776...2023655, 2023283...2023049, 2022243...2022093, DE 2021905...2021776, 2021404...2021372, 2020996...2020748, DE 2019944...2019682, 2018929...2018870, 2018718...2018