>HEM1 GTTTCTTCGAATTCGCGGCCGCTTCTAGAGAATTTTATTATATAGTTTAAGGGATAATATTTTATTAATATTTTTTTTATTTATTTATTTAATTATATTATATATATAATATATATATAACAATAAATTTATGCAACGTTCTATATTCGCCAGATTTGGTAACAGTAGTGCTGCTGTGTCCACTTTAAACAGACTGTCTACGACGGCTGCGCCTCATGCTAAAAACGGCTACGCAACTGCTACTGGCGCCGGTGCAGCAGCAGCTACAGCAACCGCATCCTCTACACATGCAGCTGCCGCCGCAGCCGCCGCTGCCAACCATTCTACACAAGAATCTGGCTTCGATTATGAGGGACTAATTGATTCAGAATTGCAGAAAAAGCGTTTGGACAAATCTTATAGATACTTTAACAACATTAACAGGCTGGCCAAAGAGTTCCCATTAGCCCATAGACAGAGAGAAGCCGATAAGGTTACGGTCTGGTGCTCAAATGATTATCTTGCTTTATCCAAACATCCAGAGGTGCTTGATGCTATGCACAAAACAATCGATAAATACGGATGCGGTGCTGGTGGTACTCGTAACATTGCCGGACATAACATACCAACCTTGAACTTAGAAGCAGAATTGGCTACGTTACACAAAAAGGAGGGTGCTTTGGTATTTTCTTCATGTTATGTGGCCAATGATGCAGTTCTGAGCTTATTGGGCCAGAAGATGAAAGATCTAGTGATCTTCAGCGATGAACTAAACCACGCCTCCATGATTGTTGGGATCAAACATGCAAATGTTAAGAAACACATTTTTAAGCACAATGATCTGAATGAGCTTGAACAACTGTTACAATCATATCCCAAATCAGTACCTAAATTGATTGCTTTTGAATCTGTGTATTCGATGGCGGGCAGTGTTGCGGATATAGAAAAGATTTGCGACCTTGCCGATAAGTATGGTGCTCTAACCTTTTTAGATGAAGTTCATGCCGTTGGTTTGTACGGCCCTCACGGCGCTGGTGTTGCCGAACATTGTGACTTTGAATCACATAGAGCTTCCGGTATTGCGACGCCAAAAACTAATGATAAGGGTGGTGCTAAAACTGTCATGGATAGAGTGGATATGATTACTGGTACATTAGGGAAATCCTTTGGCTCTGTAGGAGGATATGTTGCGGCGTCAAGAAAACTTATTGACTGGTTTAGATCTTTCGCCCCAGGTTTCATTTTTACAACAACTTTGCCCCCATCAGTGATGGCAGGGGCCACAGCCGCCATAAGATATCAAAGGTGTCATATTGATCTACGTACATCTCAACAAAAGCACACTATGTACGTCAAAAAGGCATTTCATGAGCTGGGTATTCCTGTGATCCCGAATCCTTCTCATATTGTACCTGTGTTGATAGGTAACGCTGATCTAGCAAAACAAGCCTCCGATATTCTGATCAACAAGCATCAAATTTATGTCCAAGCAATCAACTTTCCAACAGTTGCTAGAGGAACTGAAAGGCTGAGAATAACGCCTACTCCAGGTCACACTAACGATTTGAGTGATATTTTGATTAATGCCGTGGATGATGTCTTTAACGAATTGCAATTACCCAGAGTTAGAGACTGGGAGAGTCAAGGTGGTCTTCTGGGGGTGGGAGAAAGCGGCTTTGTTGAAGAATCTAATTTATGGACGAGCAGTCAATTATCCTTGACTAACGATGACCTGAATCCTAATGTCAGAGATCCGATCGTAAAGCAATTAGAAGTTTCGTCTGGGATAAAACAGTATCCATACGATGTACCCGATTACGCGTGAAATTATATATCTAAATGATTAATATATATATTATTAATAATTAACAATAATTAATATATTATAATTTATATATATATATTTTATATTATTATTACTAGTAGCGGCCGCTGCAGGAAGAAAC Translation Map New AA Segment 1 ATGCAACGTTCTATATTCGCCAGATTTGGTAACAGTAGTGCTGCTGTGTCCACTTTAAAC 1 M Q R S I F A R F G N S S A A V S T L N 61 AGACTGTCTACGACGGCTGCGCCTCATGCTAAAAACGGCTACGCAACTGCTACTGGCGCC 21 R L S T T A A P H A K N G Y A T A T G A 121 GGTGCAGCAGCAGCTACAGCAACCGCATCCTCTACACATGCAGCTGCCGCCGCAGCCGCC 41 G A A A A T A T A S S T H A A A A A A A 181 GCTGCCAACCATTCTACACAAGAATCTGGCTTCGATTATGAGGGACTAATTGATTCAGAA 61 A A N H S T Q E S G F D Y E G L I D S E 241 TTGCAGAAAAAGCGTTTGGACAAATCTTATAGATACTTTAACAACATTAACAGGCTGGCC 81 L Q K K R L D K S Y R Y F N N I N R L A 301 AAAGAGTTCCCATTAGCCCATAGACAGAGAGAAGCCGATAAGGTTACGGTCTGGTGCTCA 101 K E F P L A H R Q R E A D K V T V W C S 361 AATGATTATCTTGCTTTATCCAAACATCCAGAGGTGCTTGATGCTATGCACAAAACAATC 121 N D Y L A L S K H P E V L D A M H K T I 421 GATAAATACGGATGCGGTGCTGGTGGTACTCGTAACATTGCCGGACATAACATACCAACC 141 D K Y G C G A G G T R N I A G H N I P T 481 TTGAACTTAGAAGCAGAATTGGCTACGTTACACAAAAAGGAGGGTGCTTTGGTATTTTCT 161 L N L E A E L A T L H K K E G A L V F S 541 TCATGTTATGTGGCCAATGATGCAGTTCTGAGCTTATTGGGCCAGAAGATGAAAGATCTA 181 S C Y V A N D A V L S L L G Q K M K D L 601 GTGATCTTCAGCGATGAACTAAACCACGCCTCCATGATTGTTGGGATCAAACATGCAAAT 201 V I F S D E L N H A S M I V G I K H A N 661 GTTAAGAAACACATTTTTAAGCACAATGATCTGAATGAGCTTGAACAACTGTTACAATCA 221 V K K H I F K H N D L N E L E Q L L Q S 721 TATCCCAAATCAGTACCTAAATTGATTGCTTTTGAATCTGTGTATTCGATGGCGGGCAGT 241 Y P K S V P K L I A F E S V Y S M A G S 781 GTTGCGGATATAGAAAAGATTTGCGACCTTGCCGATAAGTATGGTGCTCTAACCTTTTTA 261 V A D I E K I C D L A D K Y G A L T F L 841 GATGAAGTTCATGCCGTTGGTTTGTACGGCCCTCACGGCGCTGGTGTTGCCGAACATTGT 281 D E V H A V G L Y G P H G A G V A E H C 901 GACTTTGAATCACATAGAGCTTCCGGTATTGCGACGCCAAAAACTAATGATAAGGGTGGT 301 D F E S H R A S G I A T P K T N D K G G 961 GCTAAAACTGTCATGGATAGAGTGGATATGATTACTGGTACATTAGGGAAATCCTTTGGC 321 A K T V M D R V D M I T G T L G K S F G 1021 TCTGTAGGAGGATATGTTGCGGCGTCAAGAAAACTTATTGACTGGTTTAGATCTTTCGCC 341 S V G G Y V A A S R K L I D W F R S F A 1081 CCAGGTTTCATTTTTACAACAACTTTGCCCCCATCAGTGATGGCAGGGGCCACAGCCGCC 361 P G F I F T T T L P P S V M A G A T A A 1141 ATAAGATATCAAAGGTGTCATATTGATCTACGTACATCTCAACAAAAGCACACTATGTAC 381 I R Y Q R C H I D L R T S Q Q K H T M Y 1201 GTCAAAAAGGCATTTCATGAGCTGGGTATTCCTGTGATCCCGAATCCTTCTCATATTGTA 401 V K K A F H E L G I P V I P N P S H I V 1261 CCTGTGTTGATAGGTAACGCTGATCTAGCAAAACAAGCCTCCGATATTCTGATCAACAAG 421 P V L I G N A D L A K Q A S D I L I N K 1321 CATCAAATTTATGTCCAAGCAATCAACTTTCCAACAGTTGCTAGAGGAACTGAAAGGCTG 441 H Q I Y V Q A I N F P T V A R G T E R L 1381 AGAATAACGCCTACTCCAGGTCACACTAACGATTTGAGTGATATTTTGATTAATGCCGTG 461 R I T P T P G H T N D L S D I L I N A V 1441 GATGATGTCTTTAACGAATTGCAATTACCCAGAGTTAGAGACTGGGAGAGTCAAGGTGGT 481 D D V F N E L Q L P R V R D W E S Q G G 1501 CTTCTGGGGGTGGGAGAAAGCGGCTTTGTTGAAGAATCTAATTTATGGACGAGCAGTCAA 501 L L G V G E S G F V E E S N L W T S S Q 1561 TTATCCTTGACTAACGATGACCTGAATCCTAATGTCAGAGATCCGATCGTAAAGCAATTA 521 L S L T N D D L N P N V R D P I V K Q L 1621 GAAGTTTCGTCTGGGATAAAACAGTATCCATACGATGTACCCGATTACGCG 541 E V S S G I K Q Y P Y D V P D Y A Stop 1 TGA 1 * Restriction Sites Name Seq. Locations AatI AGGCCT none AccI GTMKAC 195 AflII CTTAAG none AgeI ACCGGT none AlwI GGATC 773, 1365(c), 1729(c) AlwNI CAGNNNCTG none ApaI GGGCCC none ApaLI GTGCAC none AscI GGCGCGCC none AseI ATTAAT 63, 1558, 1821, 1836, 1853 AvaI CYCGRG none AvaII GGWCC none AvrII CCTAGG none BamHI GGATCC none BbsI GAAGAC 1628(c) BbvI GCAGC 253, 259, 289, 301, 169(c), 205(c), 292(c), 310(c), 1910(c) BclI TGATCA 1439 BglI GCCNNNNNGGC 998 BglII AGATCT 723, 1198 BlpI GCTNAGC none BsaI GGTCTC none BsmAI GTCTC 1607(c) BsmBI CGTCTC none BstEII GGTNACC none BstXI CCANNNNNNTGG none ClaI ATCGAT 547 DraIII CACNNNGTG none EagI CGGCCG 15, 1905 EarI CTCTTC none EcoRI GAATTC 8 EcoRV GATATC 1274 FokI GGATG 559, 1569, 275(c), 514(c) FseI GGCCGGCC none HindIII AAGCTT none KasI GGCGCC 244 KpnI GGTACC none MluI ACGCGT 1796 NarI GGCGCC 244 NcoI CCATGG none NdeI CATATG none NheI GCTAGC none NotI GCGGCCGC 14, 1904 NsiI ATGCAT none PacI TTAATTAA none PciI ACATGT none PmeI GTTTAAAC none PstI CTGCAG 1911 PvuI CGATCG 1733 PvuII CAGCTG 290 SacI GAGCTC none SacII CCGCGG none SalI GTCGAC none SapI GCTCTTC none SfiI GGCCNNNNNGGCC none SgrAI CRCCGGYG 246 SmaI CCCGGG none SpeI ACTAGT 1897 SphI GCATGC none SspI AATATT 55, 66 StuI AGGCCT none SwaI ATTTAAAT none TliI CTCGAG none XbaI TCTAGA 23 XhoI CTCGAG none XmaI CCCGGG none XmnI GAANNNNTTC none Codon Usage Table AmAcid Codon Number /1000 Fraction END TAA 0 0.0 0.0 END TGA 1 1.79 1.0 END TAG 0 0.0 0.0 ALA GCT 20 35.84 0.31 ALA GCA 15 26.88 0.23 ALA GCC 22 39.42 0.34 ALA GCG 7 12.54 0.10 CYS TGT 3 5.37 0.5 CYS TGC 3 5.37 0.5 ASP GAT 27 48.38 0.81 ASP GAC 6 10.75 0.18 GLU GAA 18 32.25 0.72 GLU GAG 7 12.54 0.28 PHE TTT 14 25.08 0.7 PHE TTC 6 10.75 0.3 GLY GGT 19 34.05 0.47 GLY GGA 7 12.54 0.17 GLY GGC 9 16.12 0.22 GLY GGG 5 8.96 0.12 HIS CAT 14 25.08 0.63 HIS CAC 8 14.33 0.36 ILE ATT 18 32.25 0.56 ILE ATA 7 12.54 0.21 ILE ATC 7 12.54 0.21 LYS AAA 20 35.84 0.60 LYS AAG 13 23.29 0.39 LEU TTG 14 25.08 0.28 LEU TTA 13 23.29 0.26 LEU CTA 6 10.75 0.12 LEU CTT 6 10.75 0.12 LEU CTG 10 17.92 0.20 LEU CTC 0 0.0 0.0 MET ATG 9 16.12 1.0 ASN AAT 11 19.71 0.39 ASN AAC 17 30.46 0.60 PRO CCA 9 16.12 0.39 PRO CCT 8 14.33 0.34 PRO CCC 4 7.16 0.17 PRO CCG 2 3.58 0.08 GLN CAA 14 25.08 0.77 GLN CAG 4 7.16 0.22 ARG AGA 15 26.88 0.68 ARG AGG 3 5.37 0.13 ARG CGT 4 7.16 0.18 ARG CGA 0 0.0 0.0 ARG CGC 0 0.0 0.0 ARG CGG 0 0.0 0.0 SER TCT 14 25.08 0.33 SER TCA 8 14.33 0.19 SER AGT 6 10.75 0.14 SER TCC 8 14.33 0.19 SER AGC 4 7.16 0.09 SER TCG 2 3.58 0.04 THR ACT 13 23.29 0.39 THR ACA 10 17.92 0.30 THR ACC 3 5.37 0.09 THR ACG 7 12.54 0.21 VAL GTT 13 23.29 0.36 VAL GTA 6 10.75 0.16 VAL GTC 6 10.75 0.16 VAL GTG 11 19.71 0.30 TRP TGG 4 7.16 1.0 TYR TAT 11 19.71 0.61 TYR TAC 7 12.54 0.38 GC Percentage: 40.0%