Gene PaVE DNA Sequence Optimized DNA Sequence Protein

advertisement
Gene
HPV6 E1
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
ATGGCGGACGATTCAGGTACAGAAAATGAG
GGGTCTGGGTGTACAGGATGGTTTATGGTAG
AAGCTATAGTGCAACACCCAACAGGTACACA
AATATCAGACGATGAGGATGAGGAGGTGGA
GGACAGTGGGTATGACATGGTGGACTTTATT
GATGACAGCAATATTACACACAATTCACTGG
AAGCACAGGCATTGTTTAACAGGCAGGAGGC
GGACACCCATTATGCGACTGTGCAGGACCTA
AAACGAAAGTATTTAGGTAGTCCATATGTTA
GTCCTATAAACACTATAGCCGAGGCAGTGGA
AAGTGAAATAAGTCCACGATTGGACGCCATT
AAACTTACAAGACAGCCAAAAAAGGTAAAGC
GACGGCTGTTTCAAACCAGGGAACTAACGGA
CAGTGGATATGGCTATTCTGAAGTGGAAGCT
GGAACGGGAACGCAGGTAGAGAAACATGGC
GTACCGGAAAATGGGGGAGATGGTCAGGAA
AAGGACACAGGAAGGGACATAGAGGGGGA
GGAACATACAGAGGCGGAAGCGCCCACAAA
CAGTGTACGGGAGCATGCAGGCACAGCAGG
AATATTGGAATTGTTAAAATGTAAAGATTTAC
GGGCAGCATTACTTGGTAAGTTTAAAGAATG
CTTTGGGCTGTCTTTTATAGATTTAATTAGGC
CATTTAAAAGTGATAAAACAACATGTTTAGAT
TGGGTGGTAGCAGGGTTTGGTATACATCATA
GCATATCAGAGGCATTTCAAAAATTAATTGA
GCCATTAAGTTTATATGCACATATACAATGGC
TAACAAATGCATGGGGAATGGTATTGTTAGT
ATTATTAAGATTTAAAGTAAATAAAAGTAGA
AGTACCGTTGCACGTACACTTGCAACGCTATT
AAATATACCTGAAAACCAAATGTTAATAGAG
CCACCAAAAATACAAAGTGGTGTTGCAGCCC
TGTATTGGTTTCGTACAGGTATATCAAATGCC
AGTACAGTTATAGGGGAAGCACCAGAATGG
ATAACACGCCAAACAGTTATTGAACACGGGT
TGGCAGACAGTCAGTTTAAATTAACAGAAAT
GGTGCAGTGGGCGTATGATAATGACATATGC
GAGGAGAGTGAAATTGCATTTGAATATGCAC
AAAGGGGAGATTTTGATTCTAATGCACGAGC
ATTTTTAAATAGCAATATGCAGGCAAAATATG
TGAAAGATTGTGCAACTATGTGTAGACATTAT
AAACATGCAGAAATGAGGAAGATGTCTATAA
AACAATGGATAAAACATAGGGGTTCTAAAAT
AGAAGGCACAGGAAATTGGAAACCAATTGTA
CAATTCCTACGACATCAAAATATAGAATTCAT
TCCTTTTTTAACTAAATTTAAATTATGGCTGCA
CGGTACGCCAAAAAAAAACTGCATAGCCATA
GTAGGCCCTCCAGATACTGGGAAATCGTACT
TTTGTATGAGTTTAATAAGCTTTCTAGGAGGT
ACAGTTATTAGTCATGTAAATTCCAGCAGCCA
TTTTTGGTTGCAACCGTTAGTAGATGCTAAGG
TAGCATTGTTAGATGATGCAACACAGCCATG
TTGGATATATATGGATACATATATGAGAAATT
TGTTAGATGGTAATCCTATGAGTATTGACAG
AAAGCATAAAGCATTGACATTAATTAAATGTC
CACCTCTGCTAGTAACGTCCAACATAGATATT
ACTAAAGAAGATAAATATAAGTATTTACATAC
TAGAGTAACAACATTTACATTTCCAAATCCAT
TCCCTTTTGACAGAAATGGGAATGCAGTGTA
TGAACTGTCAAATACAAACTGGAAATGTTTTT
TTGAAAGACTGTCGTCAAGCCTAGACATTCA
GGATTCTGAGGACGAGGAAGATGGAAGCAA
TAGCCAAGCGTTTAGATGCGTGCCAGGAACA
GTTGTTAGAACTTTATGA
ATGGCAGACGATTCAGGGACAGAAAATGAG
GGGAGCGGGTGTACTGGGTGGTTTATGGTG
GAGGCTATTGTGCAGCATCCTACTGGAACCC
AGATCTCCGACGATGAGGACGAGGAAGTGG
AAGATTCTGGGTACGACATGGTCGATTTTATC
GACGATTCTAACATTACACACAATAGTCTGGA
GGCTCAGGCACTGTTCAACAGACAGGAAGCA
GACACACATTATGCCACTGTGCAGGATCTGA
AGAGGAAATACCTGGGCAGTCCCTATGTGTC
ACCTATCAATACCATTGCCGAGGCTGTCGAGT
CTGAAATCAGTCCACGACTGGACGCCATCAA
GCTGACACGGCAGCCCAAGAAAGTGAAGCG
GAGACTGTTTCAGACCCGCGAGCTGACAGAT
AGCGGGTACGGATATTCCGAGGTCGAAGCC
GGCACAGGGACTCAGGTGGAGAAACACGGA
GTCCCAGAAAACGGAGGGGACGGACAGGAG
AAGGACACAGGACGCGATATCGAAGGCGAG
GAACACACAGAGGCAGAAGCCCCCACTAATA
GCGTGCGAGAGCATGCCGGCACTGCTGGGA
TCCTGGAACTGCTGAAGTGCAAAGACCTGCG
GGCCGCTCTGCTGGGCAAGTTCAAAGAGTGT
TTTGGGCTGAGTTTCATCGATCTGATTAGACC
TTTCAAGTCAGACAAAACCACATGCCTGGATT
GGGTGGTCGCTGGATTTGGCATCCACCATTC
CATTTCTGAGGCATTCCAGAAGCTGATCGAA
CCACTGTCCCTGTACGCACACATTCAGTGGCT
GACTAACGCCTGGGGCATGGTGCTGCTGGTC
CTGCTGAGGTTTAAGGTGAACAAGAGTAGGT
CAACCGTCGCTCGCACCCTGGCAACACTGCT
GAACATCCCCGAGAATCAGATGCTGATCGAA
CCCCCTAAGATTCAGAGCGGAGTGGCAGCCC
TGTATTGGTTCCGCACAGGGATCTCAAACGC
TAGCACTGTGATTGGAGAGGCACCTGAATGG
ATCACTCGGCAGACCGTCATTGAGCACGGCC
TGGCCGACTCTCAGTTTAAGCTGACCGAAAT
GGTGCAGTGGGCTTACGACAACGATATCTGT
GAGGAAAGCGAGATTGCCTTCGAATATGCTC
AGAGAGGGGACTTTGATTCAAATGCTAGGGC
ATTCCTGAACAGCAATATGCAGGCCAAGTAC
GTGAAAGATTGCGCTACCATGTGTCGCCACT
ATAAGCATGCCGAGATGCGAAAGATGAGCAT
CAAACAGTGGATTAAGCATAGAGGATCCAAA
ATCGAAGGGACAGGAAACTGGAAGCCTATT
GTGCAGTTTCTGAGGCACCAGAATATCGAGT
TCATTCCTTTTCTGACCAAGTTCAAACTGTGG
CTGCATGGCACACCAAAGAAAAACTGCATCG
CCATTGTGGGGCCACCCGACACAGGAAAATC
TTACTTTTGTATGTCCCTGATCTCTTTCCTGGG
AGGCACTGTGATTAGTCACGTCAATAGCTCCT
CTCATTTTTGGCTGCAGCCCCTGGTGGACGCA
AAGGTCGCCCTGCTGGACGATGCAACTCAGC
CTTGCTGGATCTACATGGATACCTATATGAG
GAACCTGCTGGACGGCAATCCAATGAGCATC
GATCGCAAGCACAAAGCTCTGACCCTGATCA
AGTGTCCTCCACTGCTGGTGACCTCCAACATC
GACATTACAAAGGAGGATAAGTACAAATATC
TGCATACTCGAGTGACTACCTTCACCTTTCCC
AATCCTTTCCCATTTGATCGGAACGGCAATGC
CGTCTACGAGCTGTCTAACACCAATTGGAAG
TGCTTCTTTGAACGGCTGAGTTCAAGCCTGG
ACATCCAGGATAGCGAGGACGAGGAAGATG
GCAGCAACTCCCAGGCCTTCCGGTGTGTGCC
AGGAACTGTGGTCAGAACCCTGTGA
MADDSGTENEGSGCTGWFMVEAI
VQHPTGTQISDDEDEEVEDSGYDM
VDFIDDSNITHNSLEAQALFNRQEAD
THYATVQDLKRKYLGSPYVSPINTIAE
AVESEISPRLDAIKLTRQPKKVKRRLF
QTRELTDSGYGYSEVEAGTGTQVEK
HGVPENGGDGQEKDTGRDIEGEEH
TEAEAPTNSVREHAGTAGILELLKCK
DLRAALLGKFKECFGLSFIDLIRPFKSD
KTTCLDWVVAGFGIHHSISEAFQKLI
EPLSLYAHIQWLTNAWGMVLLVLLR
FKVNKSRSTVARTLATLLNIPENQML
IEPPKIQSGVAALYWFRTGISNASTVI
GEAPEWITRQTVIEHGLADSQFKLTE
MVQWAYDNDICEESEIAFEYAQRG
DFDSNARAFLNSNMQAKYVKDCAT
MCRHYKHAEMRKMSIKQWIKHRGS
KIEGTGNWKPIVQFLRHQNIEFIPFLT
KFKLWLHGTPKKNCIAIVGPPDTGKS
YFCMSLISFLGGTVISHVNSSSHFWL
QPLVDAKVALLDDATQPCWIYMDT
YMRNLLDGNPMSIDRKHKALTLIKC
PPLLVTSNIDITKEDKYKYLHTRVTTFT
FPNPFPFDRNGNAVYELSNTNWKCF
FERLSSSLDIQDSEDEEDGSNSQAFR
CVPGTVVRTL
Page 1 of 48
% Nucleotide
Change
41.3%
Gene
HPV6 E2
HPV6 E4
HPV6 E5A
HPV6 E5B
HPV6 E6
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
ATGGAAGCAATAGCCAAGCGTTTAGATGCGT
GCCAGGAACAGTTGTTAGAACTTTATGAAGA
AAACAGTACTGACCTACACAAACATGTATTGC
ATTGGAAATGCATGAGACATGAAAGTGTATT
ATTATATAAAGCAAAACAAATGGGCCTAAGC
CACATAGGAATGCAAGTAGTGCCACCATTAA
AGGTGTCCGAAGCAAAAGGACATAATGCCAT
TGAAATGCAAATGCATTTAGAATCATTATTAA
GGACTGAGTATAGTATGGAACCGTGGACATT
ACAAGAAACAAGTTATGAAATGTGGCAAACA
CCACCTAAACGCTGTTTTAAAAAACGGGGCA
AAACTGTAGAAGTTAAATTTGATGGCTGTGC
AAACAATACAATGGATTATGTGGTATGGACA
GATGTGTATGTGCAGGACAATGACACCTGGG
TAAAGGTGCATAGTATGGTAGATGCTAAGGG
TATATATTACACATGTGGACAATTTAAAACAT
ATTATGTAAACTTTGTAAAAGAGGCAGAAAA
GTATGGGAGCACCAAACATTGGGAAGTATGT
TATGGCAGCACAGTTATATGTTCTCCTGCATC
TGTATCTAGCACTACACAAGAAGTATCCATTC
CTGAATCTACTACATACACCCCCGCACAGACC
TCCACCCTTGTGTCCTCAAGCACCAAGGAAG
ACGCAGTGCAAACGCCGCCTAGGAAACGAG
CACGAGGAGTCCAACAGTCCCCTTGCAACGC
CTTGTGTGTGGCCCACATTGGACCCGTGGAC
AGTGGAAACCACAACCTCATCACTAACAATC
ACGACCAGCACCAAAGACGGAACAACAGTA
ACAGTTCAGCTACGCCTATAGTGCAATTTCAA
GGTGAATCCAATTGTTTAAAGTGTTTTAGATA
TAGGCTAAATGACAGACACAGACATTTATTT
GATTTAATATCATCAACGTGGCACTGGGCCTC
CTCAAAGGCACCACATAAACATGCCATTGTA
ACTGTAACATATGATAGTGAGGAACAAAGGC
AACAGTTTTTAGATGTTGTAAAAATACCCCCT
ACCATTAGCCACAAACTGGGATTTATGTCACT
GCACCTATTGTAA
ATGGGAGCACCAAACATTGGGAAGTATGTTA
TGGCAGCACAGTTATATGTTCTCCTGCATCTG
TATCTAGCACTACACAAGAAGTATCCATTCCT
GAATCTACTACATACACCCCCGCACAGACCTC
CACCCTTGTGTCCTCAAGCACCAAGGAAGAC
GCAGTGCAAACGCCGCCTAGGAAACGAGCA
CGAGGAGTCCAACAGTCCCCTTGCAACGCCT
TGTGTGTGGCCCACATTGGACCCGTGGACAG
TGGAAACCACAACCTCATCACTAACAATCACG
ACCAGCACCAAAGACGGAACAACAGTAACA
GTTCAGCTACGCCTATAG
ATGGAAGTGGTGCCTGTACAAATAGCTGCAG
GAACAACCAGCACATTCATACTGCCTGTTATA
ATTGCATTTGTTGTATGTTTTGTTAGCATCATA
CTTATTGTATGGATATCTGAGTTTATTGTGTA
CACATCTGTGCTAGTACTAACACTGCTTTTAT
ATTTACTATTGTGGCTGCTATTAACAACCCCC
TTGCAATTTTTCCTACTAACTCTACTTGTGTGT
TACTGTCCCGCATTGTATATACACTACTATATT
GTTACCACACAGCAATGA
ATGATGCTAACATGTCAATTTAATGATGGAG
ATACCTGGCTGGGTTTGTGGTTGTTATGTGCC
TTTATTGTAGGGATGTTGGGGTTATTATTGAT
GCACTATAGAGCTGTACAAGGGGATAAACAC
ACCAAATGTAAGAAGTGTAACAAACACAACT
GTAATGATGATTATGTAACTATGCATTATACT
ACTGATGGTGATTATATATATATGAATTAG
ATGGAAAGTGCAAATGCCTCCACGTCTGCAA
CGACCATAGACCAGTTGTGCAAGACGTTTAA
TCTATCTATGCATACGTTGCAAATTAATTGTG
TGTTTTGCAAGAATGCACTGACCACAGCAGA
GATTTATTCATATGCATATAAACACCTAAAGG
TCCTGTTTCGAGGCGGCTATCCATATGCAGCC
TGCGCGTGCTGCCTAGAATTTCATGGAAAAA
TAAACCAATATAGACACTTTGATTATGCTGGA
TATGCAACAACAGTTGAAGAAGAAACTAAAC
AAGACATCTTAGACGTGCTAATTCGGTGCTAC
ATGGAGGCTATTGCTAAAAGACTGGACGCTT
GTCAGGAACAGCTGCTGGAACTGTACGAGG
AAAACTCTACCGATCTGCATAAACATGTGCTG
CACTGGAAGTGCATGCGCCATGAGTCCGTGC
TGCTGTACAAGGCCAAACAGATGGGCCTGTC
TCACATCGGGATGCAGGTGGTCCCCCCTCTG
AAGGTGAGTGAAGCTAAAGGCCACAACGCA
ATTGAGATGCAGATGCATCTGGAAAGCCTGC
TGCGGACCGAGTACTCCATGGAACCATGGAC
TCTGCAGGAGACCTCCTATGAAATGTGGCAG
ACCCCACCCAAGCGATGCTTCAAGAAAAGGG
GCAAGACAGTGGAGGTCAAATTTGACGGGT
GTGCCAACAATACCATGGACTACGTGGTCTG
GACAGATGTGTATGTCCAGGACAACGATACA
TGGGTGAAGGTCCACTCTATGGTGGATGCTA
AAGGGATCTACTATACATGTGGACAGTTCAA
GACTTACTATGTGAATTTTGTCAAGGAGGCA
GAAAAATACGGATCAACAAAACATTGGGAG
GTGTGCTATGGCAGCACTGTCATCTGTTCCCC
TGCATCTGTGAGCTCCACCACACAGGAGGTC
AGTATTCCAGAATCAACTACCTATACCCCCGC
CCAGACAAGCACTCTGGTGTCTAGTTCAACTA
AGGAAGACGCCGTGCAGACCCCTCCAAGGA
AACGAGCTCGAGGAGTGCAGCAGTCCCCTTG
CAACGCCCTGTGTGTGGCTCACATCGGACCA
GTCGACTCTGGCAACCATAATCTGATTACTAA
CAATCACGATCAGCATCAGCGGAGAAACAAT
AGTAATAGCTCCGCCACCCCCATTGTGCAGTT
CCAGGGAGAGTCAAACTGCCTGAAGTGTTTT
CGGTACAGACTGAATGACAGGCACCGCCATC
TGTTCGATCTGATCTCTAGTACATGGCACTGG
GCATCAAGCAAGGCCCCTCACAAACATGCTA
TTGTGACCGTCACATATGACTCCGAGGAACA
GAGACAGCAGTTCCTGGATGTGGTCAAGATC
CCCCCTACTATTTCTCACAAACTGGGGTTTAT
GAGTCTGCATCTGCTG
ATGGGGGCTCCTAATATCGGAAAGTATGTCA
TGGCCGCTCAGCTGTATGTCCTGCTGCATCTG
TATCTGGCACTGCACAAGAAGTATCCCTTCCT
GAACCTGCTGCACACTCCCCCTCATAGGCCAC
CACCTCTGTGCCCACAGGCACCACGAAAGAC
CCAGTGTAAACGGAGACTGGGCAACGAGCA
CGAGGAATCTAATAGTCCTCTGGCTACACCAT
GCGTGTGGCCCACACTGGACCCTTGGACTGT
CGAAACCACAACTAGCTCCCTGACCATCACCA
CATCCACAAAGGATGGGACTACCGTGACCGT
CCAGCTGCGACTG
ATGGAGGTCGTCCCCGTCCAGATTGCCGCCG
GAACCACTTCAACTTTCATCCTGCCCGTCATC
ATTGCTTTCGTCGTGTGTTTCGTGTCTATCATT
CTGATCGTGTGGATTAGCGAGTTCATCGTCTA
CACATCCGTGCTGGTCCTGACTCTGCTGCTGT
ATCTGCTGCTGTGGCTGCTGCTGACCACACCC
CTGCAGTTCTTTCTGCTGACCCTGCTGGTGTG
CTACTGTCCTGCCCTGTATATCCACTACTATAT
TGTCACTACCCAGCAG
ATGATGCTGACTTGTCAGTTCAACGATGGCG
ATACTTGGCTGGGGCTGTGGCTGCTGTGTGC
TTTTATTGTCGGAATGCTGGGGCTGCTGCTG
ATGCACTACCGGGCCGTGCAGGGCGACAAG
CATACTAAATGCAAGAAATGTAACAAGCACA
ACTGCAATGACGATTACGTCACCATGCATTAT
ACCACAGACGGGGATTACATCTATATGAAT
ATGGAGTCCGCTAACGCTTCTACTTCCGCAAC
TACTATCGACCAGCTGTGTAAGACCTTCAACC
TGTCAATGCATACCCTGCAGATTAACTGCGTG
TTCTGTAAGAATGCTCTGACCACAGCAGAAA
TCTACAGCTATGCCTACAAGCACCTGAAAGTC
CTGTTTAGGGGCGGGTATCCCTACGCCGCTT
GCGCTTGCTGTCTGGAGTTCCACGGAAAAAT
TAACCAGTATCGCCATTTTGACTATGCAGGCT
ACGCCACTACCGTGGAGGAAGAGACCAAGC
AGGACATCCTGGATGTCCTGATTCGATGCTA
MEAIAKRLDACQEQLLELYEENSTDL
HKHVLHWKCMRHESVLLYKAKQM
GLSHIGMQVVPPLKVSEAKGHNAIE
MQMHLESLLRTEYSMEPWTLQETS
YEMWQTPPKRCFKKRGKTVEVKFD
GCANNTMDYVVWTDVYVQDNDT
WVKVHSMVDAKGIYYTCGQFKTYY
VNFVKEAEKYGSTKHWEVCYGSTVI
CSPASVSSTTQEVSIPESTTYTPAQTS
TLVSSSTKEDAVQTPPRKRARGVQQ
SPCNALCVAHIGPVDSGNHNLITNN
HDQHQRRNNSNSSATPIVQFQGES
NCLKCFRYRLNDRHRHLFDLISSTWH
WASSKAPHKHAIVTVTYDSEEQRQQ
FLDVVKIPPTISHKLGFMSLHLL
Page 2 of 48
% Nucleotide
Change
25.5%
MGAPNIGKYVMAAQLYVLLHLYLAL
HKKYPFLNLLHTPPHRPPPLCPQAPR
KTQCKRRLGNEHEESNSPLATPCVW
PTLDPWTVETTTSSLTITTSTKDGTTV
TVQLRL
23.6%
MEVVPVQIAAGTTSTFILPVIIAFVVC
FVSIILIVWISEFIVYTSVLVLTLLLYLLL
WLLLTTPLQFFLLTLLVCYCPALYIHYY
IVTTQQ
27.2%
MMLTCQFNDGDTWLGLWLLCAFIV
GMLGLLLMHYRAVQGDKHTKCKKC
NKHNCNDDYVTMHYTTDGDYIYM
N
22.4%
MESANASTSATTIDQLCKTFNLSMH
TLQINCVFCKNALTTAEIYSYAYKHLK
VLFRGGYPYAACACCLEFHGKINQYR
HFDYAGYATTVEEETKQDILDVLIRCY
LCHKPLCEVEKVKHILTKARFIKLNCT
WKGRCLHCWTTCMEDMLP
22.5%
Gene
HPV6 E7
HPV6 L1
PaVE DNA Sequence
Optimized DNA Sequence
CTGTGTCACAAACCGCTGTGTGAAGTAGAAA
AGGTAAAACATATACTAACCAAGGCGCGGTT
CATAAAGCTAAATTGTACGTGGAAGGGTCGC
TGCCTACACTGCTGGACAACATGCATGGAAG
ACATGTTACCCTAA
CCTGTGTCACAAACCCCTGTGTGAAGTGGAG
AAGGTCAAACATATCCTGACCAAGGCCCGGT
TCATCAAGCTGAACTGCACATGGAAGGGGAG
ATGCCTGCATTGTTGGACAACTTGTATGGAA
GATATGCTGCCT
ATGCATGGAAGACATGTTACCCTAAAGGATA
TTGTATTAGACCTGCAACCTCCAGACCCTGTA
GGGTTACATTGCTATGAGCAATTAGTAGACA
GCTCAGAAGATGAGGTGGACGAAGTGGACG
GACAAGATTCACAACCTTTAAAACAACATTTC
CAAATAGTGACCTGTTGCTGTGGATGTGACA
GCAACGTTCGACTGGTTGTGCAGTGTACAGA
AACAGACATCAGAGAAGTGCAACAGCTTCTG
TTGGGAACACTAAACATAGTGTGTCCCATCTG
CGCACCGAAGACCTAA
ATGTGGCGGCCTAGCGACAGCACAGTATATG
TGCCTCCTCCTAACCCTGTATCCAAAGTTGTT
GCCACGGATGCTTATGTTACTCGCACCAACAT
ATTTTATCATGCCAGCAGTTCTAGACTTCTTG
CAGTGGGACATCCTTATTTTTCCATAAAACGG
GCTAACAAAACTGTTGTGCCAAAGGTGTCAG
GATATCAATACAGGGTATTTAAGGTGGTGTT
ACCAGATCCTAACAAATTTGCATTGCCTGACT
CGTCTCTTTTCGATCCCACAACACAACGTTTA
GTATGGGCATGCACAGGCCTAGAGGTGGGC
AGGGGACAGCCATTAGGTGTGGGTGTAAGT
GGACATCCTTTCCTAAATAAATATGATGATGT
TGAAAATTCAGGGAGTGGTGGTAACCCTGGA
CAGGATAACAGGGTTAATGTAGGTATGGATT
ATAAACAAACACAATTATGCATGGTTGGATG
TGCCCCCCCTTTGGGCGAGCATTGGGGTAAA
GGTAAACAGTGTACTAATACACCTGTACAGG
CTGGTGACTGCCCGCCCTTAGAACTTATTACC
AGTGTTATACAGGATGGCGATATGGTTGACA
CAGGCTTTGGTGCTATGAATTTTGCTGATTTG
CAGACCAATAAATCAGATGTTCCTATTGACAT
ATGTGGCACTACATGTAAATATCCAGATTATT
TACAAATGGCTGCAGACCCATATGGTGATAG
ATTATTTTTTTTTCTACGGAAGGAACAAATGT
TTGCCAGACATTTTTTTAACAGGGCTGGCGA
GGTGGGGGAACCTGTGCCTGATACACTTATA
ATTAAGGGTAGTGGAAATCGCACGTCTGTAG
GGAGTAGTATATATGTTAACACCCCGAGCGG
CTCTTTGGTGTCCTCTGAGGCACAATTGTTTA
ATAAGCCATATTGGCTACAAAAAGCCCAGGG
ACATAACAATGGTATTTGTTGGGGTAATCAA
CTGTTTGTTACTGTGGTAGATACCACACGCAG
TACCAACATGACATTATGTGCATCCGTAACTA
CATCTTCCACATACACCAATTCTGATTATAAA
GAGTACATGCGTCATGTGGAAGAGTATGATT
TACAATTTATTTTTCAATTATGTAGCATTACAT
TGTCTGCTGAAGTAATGGCCTATATTCACACA
ATGAATCCCTCTGTTTTGGAAGACTGGAACTT
TGGGTTATCGCCTCCCCCAAATGGTACATTAG
AAGATACCTATAGGTATGTGCAGTCACAGGC
CATTACCTGTCAAAAGCCCACTCCTGAAAAG
GAAAAGCCAGATCCCTATAAGAACCTTAGTTT
TTGGGAGGTTAATTTAAAAGAAAAGTTTTCTA
GTGAATTGGATCAGTATCCTTTGGGACGCAA
GTTTTTGTTACAAAGTGGATATAGGGGACGG
TCCTCTATTCGTACAGGTGTTAAGCGCCCTGC
TGTTTCCAAAGCCTCTGCTGCCCCTAAACGTA
AGCGCGCCAAAACTAAAAGGTAA
ATGCACGGAAGACACGTCACCCTGAAAGATA
TTGTCCTGGACCTGCAGCCTCCCGACCCTGTG
GGCCTGCATTGCTATGAACAGCTGGTGGACA
GCTCCGAGGACGAAGTGGATGAGGTCGACG
GCCAGGATTCTCAGCCCCTGAAGCAGCACTT
CCAGATCGTGACATGCTGTTGCGGGTGTGAC
AGCAACGTCCGGCTGGTGGTCCAGTGCACCG
AGACAGATATTAGAGAAGTGCAGCAGCTGCT
GCTGGGCACTCTGAATATCGTCTGTCCCATTT
GCGCCCCTAAAACC
ATGTGGCGGCCTTCAGATTCAACTGTCTATGT
GCCCCCTCCAAACCCCGTGTCAAAAGTCGTCG
CTACCGATGCTTATGTCACCAGAACCAATATC
TTTTACCACGCTAGCTCCTCTAGGCTGCTGGC
AGTGGGCCATCCATATTTCTCAATTAAGCGCG
CCAACAAGACAGTGGTCCCCAAGGTGTCTGG
CTACCAGTATAGGGTCTTTAAGGTGGTCCTG
CCTGACCCAAACAAATTTGCTCTGCCCGACAG
TTCACTGTTCGATCCTACCACACAGCGGCTGG
TGTGGGCATGCACTGGCCTGGAAGTCGGAA
GAGGACAGCCACTGGGAGTGGGAGTCTCCG
GACACCCCTTCCTGAATAAGTACGACGATGT
GGAGAACAGCGGATCCGGAGGAAATCCAGG
ACAGGACAACCGAGTGAATGTCGGCATGGAT
TATAAACAGACCCAGCTGTGCATGGTGGGAT
GTGCACCACCTCTGGGAGAACATTGGGGCAA
GGGGAAACAGTGCACTAACACCCCTGTGCAG
GCTGGAGATTGTCCACCCCTGGAGCTGATCA
CCTCCGTGATTCAGGACGGCGATATGGTCGA
CACAGGATTTGGCGCTATGAACTTCGCAGAT
CTGCAGACAAATAAGAGCGACGTGCCTATCG
ATATTTGCGGGACTACCTGTAAATACCCTGAC
TATCTGCAGATGGCCGCTGACCCATACGGAG
ATCGCCTGTTCTTTTTCCTGCGAAAGGAACAG
ATGTTCGCCCGACACTTTTTCAATCGAGCTGG
AGAAGTGGGAGAACCAGTCCCTGATACCCTG
ATCATCAAGGGGAGTGGAAATAGGACATCA
GTGGGGAGCTCCATCTACGTCAACACTCCTTC
TGGAAGTCTGGTGTCTAGTGAGGCACAGCTG
TTTAACAAGCCATATTGGCTGCAGAAAGCCC
AGGGGCATAACAATGGAATTTGCTGGGGCA
ATCAGCTGTTCGTGACCGTGGTCGACACAAC
TCGAAGCACCAACATGACACTGTGTGCCTCC
GTGACCACATCAAGCACATACACTAACTCCG
ACTACAAGGAGTATATGCGCCACGTGGAGGA
ATATGATCTGCAGTTTATCTTCCAGCTGTGCT
CCATTACTCTGTCTGCCGAAGTGATGGCTTAC
ATCCATACCATGAACCCATCTGTCCTGGAGGA
CTGGAATTTTGGACTGAGTCCTCCACCCAACG
GCACTCTGGAGGATACCTACAGATATGTGCA
GAGTCAGGCAATTACATGTCAGAAGCCAACT
CCCGAGAAGGAAAAACCTGACCCATATAAAA
ACCTGTCTTTTTGGGAAGTGAATCTGAAGGA
AAAATTCTCCTCTGAGCTGGATCAGTACCCCC
TGGGCCGGAAGTTCCTGCTGCAGAGCGGATA
TCGGGGCAGAAGTTCAATCAGAACAGGGGT
GAAGAGGCCCGCAGTCTCAAAAGCCAGCGC
AGCCCCTAAGAGGAAACGCGCTAAGACTAAA
AGA
Page 3 of 48
Protein Sequence
% Nucleotide
Change
MHGRHVTLKDIVLDLQPPDPVGLHC
YEQLVDSSEDEVDEVDGQDSQPLKQ
HFQIVTCCCGCDSNVRLVVQCTETDI
REVQQLLLGTLNIVCPICAPKT
22.9%
MWRPSDSTVYVPPPNPVSKVVATD
AYVTRTNIFYHASSSRLLAVGHPYFSI
KRANKTVVPKVSGYQYRVFKVVLPD
PNKFALPDSSLFDPTTQRLVWACTG
LEVGRGQPLGVGVSGHPFLNKYDDV
ENSGSGGNPGQDNRVNVGMDYKQ
TQLCMVGCAPPLGEHWGKGKQCT
NTPVQAGDCPPLELITSVIQDGDMV
DTGFGAMNFADLQTNKSDVPIDICG
TTCKYPDYLQMAADPYGDRLFFFLR
KEQMFARHFFNRAGEVGEPVPDTLII
KGSGNRTSVGSSIYVNTPSGSLVSSE
AQLFNKPYWLQKAQGHNNGICWG
NQLFVTVVDTTRSTNMTLCASVTTS
STYTNSDYKEYMRHVEEYDLQFIFQL
CSITLSAEVMAYIHTMNPSVLEDWN
FGLSPPPNGTLEDTYRYVQSQAITCQ
KPTPEKEKPDPYKNLSFWEVNLKEKF
SSELDQYPLGRKFLLQSGYRGRSSIRT
GVKRPAVSKASAAPKRKRAKTKR
25.2%
Gene
HPV6 L2
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
ATGGCACATAGTAGGGCCCGACGACGCAAG
CGTGCGTCAGCTACACAGCTATATCAAACAT
GTAAACTCACTGGAACATGCCCCCCAGATGT
AATTCCTAAGGTGGAGCACAACACCATTGCA
GATCAAATATTAAAATGGGGAAGTTTGGGGG
TGTTTTTTGGAGGGTTGGGTATAGGCACGGG
TTCCGGCACTGGGGGTCGTACTGGCTATGTT
CCCTTACAAACTTCTGCAAAACCTTCTATTACT
AGTGGGCCTATGGCTCGTCCTCCTGTGGTGG
TGGAGCCTGTGGCCCCTTCGGATCCATCTATT
GTGTCTTTAATTGAAGAATCGGCAATCATTAA
CGCAGGGGCGCCTGAAATTGTGCCCCCTGCA
CACGGTGGGTTTACAATTACATCCTCTGAAAC
AACTACCCCTGCAATATTGGATGTATCAGTTA
CTAGTCACACTACTACTAGTATATTTAGAAAT
CCTGTCTTTACAGAACCTTCTGTAACACAACC
CCAACCACCCGTGGAGGCTAATGGACATATA
TTAATTTCTGCACCCACTGTAACGTCACACCC
TATAGAGGAAATTCCTTTAGATACTTTTGTGG
TATCATCTAGTGATAGCGGTCCTACATCCAGT
ACCCCTGTTCCTGGTACTGCACCTCGGCCTCG
TGTGGGCCTATATAGTCGTGCATTGCACCAG
GTGCAGGTTACAGACCCTGCATTTCTTTCCAC
TCCTCAACGCTTAATTACATATGATAACCCTG
TATATGAAGGGGAGGATGTTAGTGTACAATT
TAGTCATGATTCTATACACAATGCACCTGATG
AGGCTTTTATGGACATAATTCGTTTGCACAGA
CCTGCCATTGCGTCCCGACGTGGCCTTGTGC
GGTACAGTCGCATTGGACAACGGGGGTCTAT
GCACACTCGCAGCGGAAAGCACATAGGGGC
CCGCATTCATTATTTTTATGATATTTCACCTAT
TGCACAGGCTGCAGAAGAAATAGAAATGCAC
CCTCTTGTGGCTGCACAGGATGATACATTTGA
TATTTATGCTGAATCTTTTGAACCTGGCATTA
ACCCTACCCAACACCCTGTTACAAATATATCA
GATACATATTTAACTTCCACACCTAATACAGT
TACACAACCGTGGGGTAACACCACAGTTCCA
TTGTCACTTCCTAATGACCTGTTTTTACAATCT
GGCCCTGATATAACTTTTCCTACTGCACCTAT
GGGAACACCCTTTAGTCCTGTAACTCCTGCTT
TACCTACAGGCCCTGTTTTCATTACAGGTTCT
GGATTTTATTTGCATCCTGCATGGTATTTTGC
ACGTAAACGCCGTAAACGTATTCCCTTATTTT
TTTCAGATGTGGCGGCCTAG
ATGGCACACTCAAGGGCACGAAGAAGAAAG
AGGGCATCCGCTACCCAGCTGTACCAGACCT
GTAAACTGACCGGCACCTGTCCACCTGATGTC
ATCCCTAAGGTGGAGCATAACACCATCGCCG
ACCAGATTCTGAAATGGGGCAGTCTGGGGGT
GTTCTTTGGCGGGCTGGGCATTGGGACTGGA
TCAGGAACCGGAGGACGAACAGGATACGTG
CCACTGCAGACTAGCGCTAAGCCCTCTATCAC
CAGTGGACCTATGGCAAGACCCCCTGTGGTC
GTGGAACCTGTCGCCCCATCAGATCCCAGCA
TCGTGTCCCTGATTGAGGAAAGCGCTATCATT
AATGCAGGAGCTCCAGAGATCGTGCCACCAG
CACATGGGGGCTTCACCATTACCAGCTCCGA
AACCACAACTCCTGCTATCCTGGACGTCTCTG
TGACCAGTCACACCACAACTTCCATCTTCAGG
AACCCTGTCTTTACTGAGCCATCTGTGACCCA
GCCCCAGCCTCCAGTCGAAGCAAATGGACAT
ATCCTGATTAGTGCCCCAACAGTGACTTCACA
CCCTATCGAGGAAATTCCACTGGACACCTTTG
TCGTGTCTAGTTCAGATTCCGGACCAACAAGC
TCCACTCCAGTCCCTGGAACAGCACCACGACC
ACGAGTGGGACTGTACTCCCGAGCTCTGCAT
CAGGTCCAGGTGACCGATCCAGCATTCCTGT
CTACCCCCCAGCGCCTGATTACATACGATAAC
CCCGTGTATGAGGGGGAAGATGTCAGCGTG
CAGTTTTCACACGACAGCATCCATAATGCTCC
AGACGAGGCATTCATGGATATCATTAGACTG
CACAGGCCCGCAATTGCCTCTCGGAGAGGCC
TGGTGCGCTATAGTCGAATCGGACAGAGGG
GCTCCATGCACACACGCTCTGGGAAACATAT
CGGAGCCCGCATTCACTACTTTTATGACATCA
GCCCCATTGCTCAGGCCGCTGAGGAAATTGA
GATGCATCCTCTGGTGGCAGCCCAGGACGAT
ACCTTCGATATCTACGCCGAGAGCTTTGAACC
AGGCATTAACCCCACACAGCACCCTGTGACT
AATATCAGCGACACCTATCTGACCTCCACACC
TAACACTGTCACCCAGCCATGGGGGAATACC
ACAGTGCCACTGTCACTGCCCAACGATCTGTT
CCTGCAGAGCGGACCTGACATCACCTTTCCTA
CAGCACCAATGGGCACACCCTTCAGTCCTGTC
ACACCAGCCCTGCCCACTGGCCCTGTGTTCAT
TACTGGGTCTGGATTTTACCTGCACCCTGCCT
GGTATTTCGCTCGGAAGAGGCGCAAAAGAAT
CCCACTGTTCTTTTCCGATGTGGCTGCA
MAHSRARRRKRASATQLYQTCKLTG
TCPPDVIPKVEHNTIADQILKWGSLG
VFFGGLGIGTGSGTGGRTGYVPLQT
SAKPSITSGPMARPPVVVEPVAPSDP
SIVSLIEESAIINAGAPEIVPPAHGGFTI
TSSETTTPAILDVSVTSHTTTSIFRNPV
FTEPSVTQPQPPVEANGHILISAPTV
TSHPIEEIPLDTFVVSSSDSGPTSSTPV
PGTAPRPRVGLYSRALHQVQVTDPA
FLSTPQRLITYDNPVYEGEDVSVQFS
HDSIHNAPDEAFMDIIRLHRPAIASR
RGLVRYSRIGQRGSMHTRSGKHIGA
RIHYFYDISPIAQAAEEIEMHPLVAAQ
DDTFDIYAESFEPGINPTQHPVTNIS
DTYLTSTPNTVTQPWGNTTVPLSLP
NDLFLQSGPDITFPTAPMGTPFSPVT
PALPTGPVFITGSGFYLHPAWYFARK
RRKRIPLFFSDVAA
Page 4 of 48
% Nucleotide
Change
26.1%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV11 E1
ATGGCGGACGATTCAGGTACAGAAAATGAG
GGGTCGGGGTGTACAGGATGGTTTATGGTA
GAAGCCATAGTAGAGCACACTACAGGTACAC
AAATATCAGAAGATGAGGAAGAGGAGGTGG
AGGACAGTGGGTATGACATGGTGGACTTTAT
TGATGACAGGCATATTACACAAAATTCTGTG
GAAGCACAGGCATTGTTTAATAGGCAGGAG
GCGGATGCTCATTATGCGACTGTGCAGGACC
TAAAACGAAAGTATTTAGGCAGTCCATATGT
AAGTCCTATAAGCAATGTAGCTAATGCAGTA
GAAAGTGAGATAAGTCCACGGTTAGACGCCA
TTAAACTTACAACACAGCCAAAAAAGGTAAA
GCGACGGCTGTTTGAAACACGGGAATTAACG
GACAGTGGATATGGCTATTCTGAAGTGGAAG
CTGCAACGCAGGTAGAGAAACATGGCGACCC
GGAAAATGGGGGAGATGGTCAGGAAAGGG
ACACAGGGAGGGACATAGAGGGTGAGGGG
GTGGAACATAGAGAGGCGGAAGCAGTAGAC
GACAGCACCCGAGAGCATGCAGACACATCAG
GAATATTAGAATTACTAAAATGTAAGGATAT
ACGATCTACATTACATGGTAAGTTTAAAGACT
GCTTTGGGCTGTCATTTGTTGATTTAATTAGG
CCATTTAAAAGTGATAGAACCACATGTGCCG
ATTGGGTGGTTGCAGGATTTGGTATACATCA
TAGCATAGCAGATGCATTTCAAAAGTTAATTG
AGCCATTAAGTTTATATGCACATATACAATGG
CTTACAAATGCATGGGGAATGGTACTATTAG
TATTAATAAGGTTTAAAGTAAATAAGAGCAG
ATGTACCGTGGCACGTACATTAGGTACGTTA
TTAAATATACCTGAAAATCACATGTTAATTGA
GCCTCCTAAAATACAAAGTGGCGTACGAGCC
CTGTATTGGTTTAGGACAGGCATTTCAAATGC
AAGTACAGTTATAGGGGAGGCGCCGGAATG
GATAACGCGCCAGACCGTTATTGAACATAGT
TTGGCTGACAGTCAATTTAAATTAACTGAAAT
GGTGCAGTGGGCATATGATAATGATATTTGT
GAAGAAAGTGAGATAGCATTTGAATATGCAC
AGCGTGGAGACTTTGACTCCAATGCAAGGGC
CTTTTTAAATAGTAATATGCAGGCTAAATATG
TAAAAGATTGTGCAATTATGTGCAGACATTAT
AAACATGCAGAAATGAAAAAGATGTCTATTA
AACAATGGATTAAGTATAGGGGTACTAAAGT
TGACAGTGTAGGTAACTGGAAGCCAATTGTG
CAGTTTCTAAGACATCAAAACATAGAATTTAT
TCCATTTTTAAGCAAACTAAAATTATGGCTGC
ACGGAACGCCCAAAAAAAATTGTATAGCCAT
TGTAGGGCCACCTGACACTGGGAAGTCGTGC
TTTTGCATGAGTTTAATTAAGTTTTTGGGGGG
AACAGTTATTAGTTATGTTAATTCCTGCAGCC
ATTTCTGGCTACAGCCACTAACGGATGCAAA
AGTGGCATTATTGGATGATGCCACACAACCA
TGTTGGACATATATGGATACATATATGAGAA
ACCTATTAGATGGTAATCCTATGAGCATAGAT
AGAAAACATAGAGCATTAACATTAATTAAGT
GTCCACCGCTACTGGTTACATCAAATATAGAC
ATTAGCAAAGAGGAGAAATACAAATATTTAC
ATAGTAGAGTTACCACATTTACATTTCCAAAT
CCATTCCCCTTTGACAGAAATGGGAATGCAG
TATATGAACTATCAGATGCAAACTGGAAATG
TTTCTTTGAAAGACTGTCGTCCAGCCTAGACA
TTGAGGATTCAGAGGACGAGGAAGATGGAA
GCAATAGCCAAGCGTTTAGATGCGTGCCAGG
ATCAGTTGTTAGAACTTTATGA
ATGGCAGATGACAGCGGGACCGAGAATGAG
GGGAGCGGATGCACTGGGTGGTTTATGGTG
GAGGCAATCGTGGAACACACTACTGGCACCC
AGATCAGCGAGGACGAGGAAGAGGAAGTG
GAAGATTCCGGATACGACATGGTCGATTTTA
TCGACGATCGGCACATTACTCAGAACAGCGT
GGAGGCACAGGCCCTGTTCAATAGACAGGA
AGCTGACGCACATTATGCCACCGTGCAGGAT
CTGAAGAGGAAATACCTGGGCAGCCCCTATG
TCTCTCCTATCAGTAACGTGGCCAATGCTGTC
GAGTCAGAAATCAGCCCAAGACTGGACGCCA
TTAAGCTGACCACACAGCCCAAGAAAGTGAA
ACGGAGACTGTTTGAGACAAGGGAACTGACT
GATAGCGGGTACGGATATTCCGAGGTGGAA
GCCGCTACCCAGGTCGAGAAGCACGGCGACC
CAGAAAACGGAGGGGATGGACAGGAGCGA
GACACAGGGCGGGATATCGAGGGCGAAGGG
GTGGAGCACAGAGAGGCAGAAGCCGTCGAC
GATTCCACTAGGGAGCATGCCGACACCTCTG
GGATCCTGGAACTGCTGAAGTGCAAAGATAT
TCGCTCCACCCTGCATGGAAAGTTCAAAGACT
GTTTTGGCCTGTCTTTCGTGGATCTGATCCGC
CCATTCAAGAGTGACCGAACTACCTGCGCCG
ATTGGGTGGTCGCTGGATTTGGCATCCACCA
TTCAATTGCTGACGCATTCCAGAAACTGATCG
AGCCCCTGAGCCTGTACGCACACATTCAGTG
GCTGACAAACGCCTGGGGCATGGTGCTGCTG
GTCCTGATCCGCTTTAAGGTGAACAAGTCTA
GATGTACTGTCGCCAGGACCCTGGGGACACT
GCTGAACATTCCTGAGAATCATATGCTGATCG
AACCCCCTAAGATTCAGAGTGGAGTGCGGGC
TCTGTATTGGTTCAGAACAGGCATCTCCAACG
CATCTACTGTGATTGGGGAGGCCCCAGAATG
GATCACTCGGCAGACCGTCATTGAGCACAGT
CTGGCTGACTCACAGTTTAAGCTGACCGAGA
TGGTGCAGTGGGCATACGACAACGATATCTG
CGAGGAAAGCGAGATTGCTTTCGAATATGCA
CAGAGGGGCGACTTTGATAGTAATGCCCGCG
CTTTCCTGAACTCAAATATGCAGGCTAAGTAC
GTGAAAGACTGCGCAATCATGTGTAGGCACT
ATAAGCATGCCGAGATGAAGAAAATGTCCAT
CAAGCAGTGGATCAAGTACCGCGGGACTAA
GGTGGATTCTGTCGGAAACTGGAAACCCATT
GTGCAGTTTCTGCGGCACCAGAATATCGAGT
TCATTCCTTTTCTGTCCAAGCTGAAACTGTGG
CTGCATGGCACACCAAAGAAAAACTGCATCG
CCATTGTGGGGCCACCCGACACTGGAAAGTC
TTGCTTTTGTATGAGTCTGATCAAATTCCTGG
GAGGCACAGTGATTTCTTATGTCAATAGTTGC
TCACACTTCTGGCTGCAGCCCCTGACTGACGC
AAAGGTGGCCCTGCTGGACGATGCAACCCAG
CCTTGTTGGACCTACATGGATACATATATGAG
AAACCTGCTGGACGGGAATCCCATGAGCATC
GATAGGAAGCACCGCGCTCTGACCCTGATCA
AGTGTCCTCCACTGCTGGTGACATCAAACATC
GATATTAGCAAGGAGGAAAAGTACAAATATC
TGCATAGCCGCGTGACAACTTTCACCTTTCCC
AACCCTTTCCCATTTGACCGAAACGGCAATGC
CGTCTACGAGCTGTCCGATGCTAATTGGAAA
TGCTTCTTTGAAAGGCTGAGCTCCTCTCTGGA
CATCGAGGATAGTGAAGACGAGGAAGATGG
AAGCAATTCCCAGGCCTTCCGATGTGTGCCT
GGCTCAGTGGTCCGGACACTG
MADDSGTENEGSGCTGWFMVEAI
VEHTTGTQISEDEEEEVEDSGYDMV
DFIDDRHITQNSVEAQALFNRQEAD
AHYATVQDLKRKYLGSPYVSPISNVA
NAVESEISPRLDAIKLTTQPKKVKRRL
FETRELTDSGYGYSEVEAATQVEKHG
DPENGGDGQERDTGRDIEGEGVEH
REAEAVDDSTREHADTSGILELLKCK
DIRSTLHGKFKDCFGLSFVDLIRPFKS
DRTTCADWVVAGFGIHHSIADAFQK
LIEPLSLYAHIQWLTNAWGMVLLVLI
RFKVNKSRCTVARTLGTLLNIPENHM
LIEPPKIQSGVRALYWFRTGISNASTV
IGEAPEWITRQTVIEHSLADSQFKLTE
MVQWAYDNDICEESEIAFEYAQRG
DFDSNARAFLNSNMQAKYVKDCAI
MCRHYKHAEMKKMSIKQWIKYRGT
KVDSVGNWKPIVQFLRHQNIEFIPFL
SKLKLWLHGTPKKNCIAIVGPPDTGK
SCFCMSLIKFLGGTVISYVNSCSHFW
LQPLTDAKVALLDDATQPCWTYMD
TYMRNLLDGNPMSIDRKHRALTLIK
CPPLLVTSNIDISKEEKYKYLHSRVTTF
TFPNPFPFDRNGNAVYELSDANWK
CFFERLSSSLDIEDSEDEEDGSNSQAF
RCVPGSVVRTL
Page 5 of 48
% Nucleotide
Change
25.7%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV11 E2
ATGGAAGCAATAGCCAAGCGTTTAGATGCGT
GCCAGGATCAGTTGTTAGAACTTTATGAAGA
AAACAGTATTGATATACACAAACACATTATGC
ATTGGAAATGCATACGATTGGAAAGTGTATT
ACTACACAAAGCAAAACAAATGGGCCTGAGC
CACATCGGGTTACAAGTAGTACCACCATTAAC
TGTGTCAGAGACTAAAGGACATAATGCTATT
GAAATGCAAATGCATTTAGAATCCTTAGCAA
AAACTCAGTATGGTGTGGAACCTTGGACATT
ACAGGACACCAGTTATGAAATGTGGCTAACA
CCACCCAAACGGTGCTTTAAAAAACAGGGAA
ATACTGTGGAGGTAAAATTTGATGGCTGTGA
AGACAATGTAATGGAGTATGTGGTATGGACA
CATATATACCTGCAGGACAACGACTCATGGG
TAAAAGTAACTAGTTCCGTAGATGCCAAGGG
CATATATTATACATGTGGACAATTTAAAACAT
ATTATGTAAATTTTAATAAAGAGGCACAAAA
GTATGGTAGTACCAATCATTGGGAAGTATGT
TATGGCAGCACAGTTATATGTTCTCCTGCATC
TGTATCTAGCACTGTACGAGAAGTATCCATTG
CTGAACCTACTACATACACCCCCGCACAGACC
ACCGCCCCTACAGTGTCCGCCTGCACCACGG
AAGACGGCGTGTCGGCGCCGCCTAGGAAGC
GAGCACGTGGACCGTCCACTAACAACACCCT
GTGTGTGGCCAACATCAGATCCGTGGACAGT
ACAATCAACAACATCGTCACTGACAATTACAA
CAAGCACCAAAGAAGGAACAACTGTCACAGT
GCAGCTACGCCTATAGTGCAACTGCAAGGTG
ATTCCAATTGTTTAAAATGTTTTAGATATAGA
CTGAATGACAAATATAAACATTTGTTTGAATT
AGCATCTTCAACGTGGCATTGGGCCTCACCT
GAGGCACCACATAAAAATGCAATTGTAACAT
TAACATATAGCAGTGAGGAACAACGTCAGCA
ATTTTTAAACAGTGTAAAAATACCACCCACCA
TTAGGCATAAGGTGGGGTTTATGTCATTACA
TTTATTGTAA
ATGGTAGTACCAATCATTGGGAAGTATGTTA
TGGCAGCACAGTTATATGTTCTCCTGCATCTG
TATCTAGCACTGTACGAGAAGTATCCATTGCT
GAACCTACTACATACACCCCCGCACAGACCAC
CGCCCCTACAGTGTCCGCCTGCACCACGGAA
GACGGCGTGTCGGCGCCGCCTAGGAAGCGA
GCACGTGGACCGTCCACTAACAACACCCTGT
GTGTGGCCAACATCAGATCCGTGGACAGTAC
AATCAACAACATCGTCACTGACAATTACAACA
AGCACCAAAGAAGGAACAACTGTCACAGTGC
AGCTACGCCTTAG
ATGGAGGTAGTGCCTGTACAAATTGCTGCAG
CAACAACTACAACATTGATATTGCCTGTTGTT
ATTGCATTTGCAGTATGTATTCTTAGTATTGT
ACTTATAATATTAATATCTGATTTTGTAGTATA
TACATCTGTGCTGGTACTAACACTTCTTTTATA
TTTGCTTTTGTGGCTTTTATTAACAACCCCTTT
GCAATTCTTTTTACTAACACTGTGTGTGTGCT
ATTTTCCTGCCTTTTATATACACATATACATTG
TGCAAACGCAACAATAA
ATGGTGATGTTAACCTGTCACTTAAATGATG
GTGATACATGGTTGTTTCTGTGGTTGTTTACT
GCATTTGTTGTAGCTGTACTTGGATTGTTGTT
ACTACATTACAGGGCTGTACATGGTACTGAA
AAAACTAAATGTGCTAAGTGTAAATCAAACC
GCAATACTACTGTGGATTATGTGTATATGTCA
CATGGTGATAATGGAGATTATGTGTACATGA
ACTAG
ATGGAAAGTAAAGATGCCTCCACGTCTGCAA
CATCTATAGACCAGTTGTGCAAGACGTTTAAT
CTTTCTTTGCACACTCTGCAAATTCAGTGCGT
GTTTTGCAGGAATGCACTGACCACCGCAGAG
ATATATGCATATGCCTATAAGAACCTAAAGGT
TGTGTGGCGAGACAACTTTCCCTTTGCAGCGT
GTGCCTGTTGCTTAGAACTGCAAGGGAAAAT
TAACCAATATAGACACTTTAATTATGCTGCAT
ATGCACCTACAGTAGAAGAAGAAACCAATGA
ATGGAAGCAATCGCAAAGAGACTGGATGCCT
GTCAGGATCAGCTGCTGGAACTGTATGAGGA
AAATAGTATCGACATTCACAAACATATCATGC
ACTGGAAGTGCATTCGGCTGGAGTCCGTGCT
GCTGCACAAGGCCAAACAGATGGGCCTGTCT
CATATCGGGCTGCAGGTGGTCCCCCCTCTGA
CAGTGTCTGAAACTAAGGGGCACAACGCAAT
TGAGATGCAGATGCATCTGGAAAGTCTGGCC
AAAACCCAGTACGGAGTGGAGCCTTGGACCC
TGCAGGACACATCTTATGAAATGTGGCTGAC
ACCACCCAAGCGGTGCTTCAAGAAACAGGGG
AACACTGTGGAGGTCAAATTTGACGGATGTG
AGGATAATGTGATGGAATACGTGGTCTGGAC
CCACATCTATCTGCAGGACAACGATAGCTGG
GTGAAGGTCACAAGCTCCGTGGATGCAAAAG
GAATCTACTATACTTGCGGCCAGTTCAAGACC
TACTATGTCAACTTTAATAAGGAGGCTCAGA
AATACGGCTCAACTAATCATTGGGAAGTGTG
CTATGGGAGCACCGTCATCTGTAGTCCAGCTT
CAGTGTCTAGTACCGTGCGAGAGGTCAGCAT
TGCAGAACCTACCACATACACCCCAGCACAG
ACTACCGCCCCCACAGTGTCTGCCTGCACAAC
TGAGGACGGAGTCAGTGCCCCTCCAAGGAAA
CGCGCTCGAGGCCCTTCCACTAACAATACCCT
GTGTGTGGCCAATATCAGGAGCGTCGACTCC
ACAATCAACAACATCGTGACTGATAACTACA
ATAAGCACCAGCGGAGAAACAATTGTCATAG
TGCCGCTACTCCCATTGTGCAGCTGCAGGGC
GACTCAAACTGCCTGAAATGTTTCCGGTACA
GACTGAATGATAAGTATAAACACCTGTTTGA
GCTGGCCTCAAGCACATGGCACTGGGCTTCC
CCAGAAGCACCCCATAAGAACGCTATCGTGA
CCCTGACATACTCCTCTGAGGAACAGAGGCA
GCAGTTCCTGAATAGCGTGAAGATCCCCCCT
ACCATTCGCCACAAAGTCGGGTTTATGTCCCT
GCATCTGCTG
ATGGTCGTCCCTATTATTGGAAAGTATGTGAT
GGCCGCTCAGCTGTATGTCCTGCTGCACCTGT
ATCTGGCTCTGTATGAGAAGTATCCACTGCTG
AACCTGCTGCACACCCCACCTCATCGACCACC
ACCTCTGCAGTGCCCACCAGCACCACGAAAG
ACAGCTTGTCGGAGAAGGCTGGGCAGCGAG
CACGTGGACAGACCTCTGACCACACCATGCG
TCTGGCCCACTTCTGATCCTTGGACCGTGCAG
AGTACTACCAGCTCCCTGACAATCACAACTTC
TACTAAAGAAGGGACCACAGTGACCGTCCAG
CTGCGGCTG
ATGGAGGTCGTGCCAGTCCAGATTGCTGCCG
CCACCACCACTACCCTGATCCTGCCAGTCGTC
ATTGCCTTCGCTGTGTGTATCCTGTCTATTGT
GCTGATCATTCTGATCAGCGACTTCGTGGTCT
ACACTTCCGTGCTGGTCCTGACCCTGCTGCTG
TATCTGCTGCTGTGGCTGCTGCTGACCACACC
CCTGCAGTTCTTTCTGCTGACACTGTGCGTGT
GTTACTTCCCTGCCTTTTACATCCACATCTACA
TCGTCCAGACCCAGCAG
ATGGTCATGCTGACTTGTCACCTGAACGATG
GAGATACCTGGCTGTTTCTGTGGCTGTTTACC
GCCTTTGTGGTCGCAGTGCTGGGCCTGCTGC
TGCTGCACTACCGGGCCGTGCATGGCACTGA
GAAGACCAAATGCGCTAAGTGTAAAAGCAAC
AGAAATACCACAGTGGACTACGTCTATATGT
CCCACGGCGACAACGGGGATTACGTCTATAT
GAAT
ATGGAGAGCAAGGACGCCTCAACATCCGCAA
CAAGCATCGACCAGCTGTGTAAAACTTTCAAC
CTGTCACTGCATACCCTGCAGATTCAGTGCGT
GTTCTGTAGGAACGCCCTGACCACAGCTGAA
ATCTACGCTTATGCATACAAGAACCTGAAAGT
GGTCTGGCGCGACAATTTCCCATTTGCCGCTT
GCGCATGCTGTCTGGAACTGCAGGGCAAGAT
TAACCAGTATCGACACTTCAATTATGCAGCCT
ACGCTCCCACAGTGGAGGAAGAGACCAATG
MEAIAKRLDACQDQLLELYEENSIDI
HKHIMHWKCIRLESVLLHKAKQMGL
SHIGLQVVPPLTVSETKGHNAIEMQ
MHLESLAKTQYGVEPWTLQDTSYE
MWLTPPKRCFKKQGNTVEVKFDGC
EDNVMEYVVWTHIYLQDNDSWVK
VTSSVDAKGIYYTCGQFKTYYVNFNK
EAQKYGSTNHWEVCYGSTVICSPAS
VSSTVREVSIAEPTTYTPAQTTAPTVS
ACTTEDGVSAPPRKRARGPSTNNTL
CVANIRSVDSTINNIVTDNYNKHQRR
NNCHSAATPIVQLQGDSNCLKCFRY
RLNDKYKHLFELASSTWHWASPEAP
HKNAIVTLTYSSEEQRQQFLNSVKIPP
TIRHKVGFMSLHLL
HPV11 E4
HPV11
E5A
HPV11
E5B
HPV11 E6
Page 6 of 48
% Nucleotide
Change
24.1%
MVVPIIGKYVMAAQLYVLLHLYLALY
EKYPLLNLLHTPPHRPPPLQCPPAPR
KTACRRRLGSEHVDRPLTTPCVWPT
SDPWTVQSTTSSLTITTSTKEGTTVTV
QLRL
26.1%
MEVVPVQIAAATTTTLILPVVIAFAVC
ILSIVLIILISDFVVYTSVLVLTLLLYLLL
WLLLTTPLQFFLLTLCVCYFPAFYIHIY
IVQTQQ
30.4%
MVMLTCHLNDGDTWLFLWLFTAFV
VAVLGLLLLHYRAVHGTEKTKCAKCK
SNRNTTVDYVYMSHGDNGDYVYM
N
24.9%
MESKDASTSATSIDQLCKTFNLSLHTL
QIQCVFCRNALTTAEIYAYAYKNLKV
VWRDNFPFAACACCLELQGKINQYR
HFNYAAYAPTVEEETNEDILKVLIRCY
LCHKPLCEIEKLKHILGKARFIKLNNQ
WKGRCLHCWTTCMEDLLP
24.8%
Gene
HPV11 E7
HPV11 L1
PaVE DNA Sequence
Optimized DNA Sequence
AGATATTTTAAAAGTGTTAATTCGTTGTTACC
TGTGTCACAAGCCGTTGTGTGAAATAGAAAA
ACTAAAGCACATATTGGGAAAGGCACGCTTC
ATAAAACTAAATAACCAGTGGAAGGGTCGTT
GCTTACACTGCTGGACAACATGCATGGAAGA
CTTGTTACCCTAA
AGGATATCCTGAAGGTCCTGATTAGATGCTA
CCTGTGTCACAAACCTCTGTGCGAAATCGAG
AAGCTGAAACATATTCTGGGCAAGGCCCGGT
TTATCAAACTGAACAATCAGTGGAAAGGGAG
ATGCCTGCATTGTTGGACTACCTGTATGGAG
GACCTGCTGCCC
ATGCATGGAAGACTTGTTACCCTAAAGGATA
TAGTACTAGACCTGCAGCCTCCTGACCCTGTA
GGGTTACATTGCTATGAGCAATTAGAAGACA
GCTCAGAAGATGAGGTGGACAAGGTGGACA
AACAAGACGCACAACCTTTAACACAACATTAC
CAAATACTGACCTGTTGCTGTGGATGTGACA
GCAACGTCCGACTGGTTGTGGAGTGCACAGA
CGGAGACATCAGACAACTACAAGACCTTTTG
CTGGGCACACTAAATATTGTGTGTCCCATCTG
CGCACCAAAACCATAA
ATGTGGCGGCCTAGCGACAGCACAGTATATG
TGCCTCCTCCCAACCCTGTATCCAAGGTTGTT
GCCACGGATGCGTATGTTAAACGCACCAACA
TATTTTATCATGCCAGCAGTTCTAGACTCCTT
GCTGTGGGACATCCATATTACTCTATCAAAAA
AGTTAACAAAACAGTTGTACCAAAGGTGTCT
GGATATCAATATAGAGTGTTTAAGGTAGTGT
TGCCAGATCCTAACAAGTTTGCATTACCTGAT
TCATCCCTGTTTGACCCCACTACACAGCGTTT
AGTATGGGCGTGCACAGGGTTGGAGGTAGG
CAGGGGTCAACCTTTAGGCGTTGGTGTTAGT
GGGCATCCATTGCTAAACAAATATGATGATG
TAGAAAATAGTGGTGGGTATGGTGGTAATCC
TGGTCAGGATAATAGGGTTAATGTAGGTATG
GATTATAAACAAACCCAGCTATGTATGGTGG
GCTGTGCTCCACCGTTAGGTGAACATTGGGG
TAAGGGTACACAATGTTCAAATACCTCTGTAC
AAAATGGTGACTGCCCCCCGTTGGAACTTATT
ACCAGTGTTATACAGGATGGGGACATGGTTG
ATACAGGCTTTGGTGCTATGAATTTTGCAGAC
TTACAAACCAATAAATCGGATGTTCCCCTTGA
TATTTGTGGAACTGTCTGCAAATATCCTGATT
ATTTGCAAATGGCTGCAGACCCTTATGGTGA
TAGGTTGTTTTTTTATTTGCGAAAGGAACAAA
TGTTTGCTAGACACTTTTTTAATAGGGCCGGT
ACTGTGGGGGAACCTGTGCCTGATGACCTGT
TGGTAAAAGGGGGTAATAACAGATCATCTGT
AGCTAGTAGTATTTATGTACATACACCTAGTG
GCTCATTGGTGTCTTCAGAGGCTCAATTATTT
AATAAACCATATTGGCTTCAAAAGGCTCAGG
GACATAACAATGGTATTTGCTGGGGAAACCA
CTTGTTTGTTACTGTGGTAGATACCACACGCA
GTACAAATATGACACTATGTGCATCTGTGTCT
AAATCTGCTACATACACTAATTCAGATTATAA
GGAATACATGCGCCATGTGGAGGAGTTTGAT
TTACAGTTTATTTTTCAATTGTGTAGCATTACA
TTATCTGCAGAAGTCATGGCCTATATACACAC
AATGAATCCTTCTGTTTTGGAGGACTGGAACT
TTGGTTTATCGCCTCCACCAAATGGTACACTG
GAGGATACTTATAGATATGTACAGTCACAGG
CCATTACCTGTCAGAAACCCACACCTGAAAAA
GAAAAACAGGATCCCTATAAGGATATGAGTT
TTTGGGAGGTTAACTTAAAAGAAAAGTTTTC
AAGTGAATTAGATCAGTTTCCCCTTGGACGTA
AGTTTTTATTGCAAAGTGGATATCGAGGACG
GACGTCTGCTCGTACAGGTATAAAGCGCCCA
GCTGTGTCTAAGCCCTCTACAGCCCCCAAACG
AAAACGTACCAAAACCAAAAAGTAA
ATGCACGGAAGACTGGTGACACTGAAAGAC
ATCGTCCTGGATCTGCAGCCCCCTGACCCCGT
CGGACTGCACTGCTATGAACAGCTGGAGGAC
AGCTCCGAGGACGAAGTGGATAAGGTCGAC
AAACAGGATGCCCAGCCACTGACCCAGCACT
ACCAGATCCTGACATGCTGTTGCGGCTGTGA
CTCTAACGTGCGGCTGGTGGTCGAATGCACT
GACGGCGATATTAGACAGCTGCAGGATCTGC
TGCTGGGGACCCTGAATATCGTCTGTCCCATT
TGCGCTCCCAAGCCT
ATGTGGCGGCCTTCTGATTCTACAGTCTATGT
GCCTCCTCCAAACCCCGTGAGCAAAGTCGTC
GCTACCGATGCCTACGTGAAAAGAACCAACA
TCTTCTACCACGCAAGCTCCTCTCGACTGCTG
GCCGTGGGCCATCCCTACTACAGTATTAAGA
AAGTCAACAAGACAGTGGTCCCTAAAGTGTC
AGGCTACCAGTATCGCGTCTTTAAGGTGGTC
CTGCCTGACCCAAACAAGTTCGCTCTGCCCGA
CAGTTCACTGTTTGATCCTACCACACAGCGAC
TGGTGTGGGCATGCACCGGACTGGAAGTCG
GAAGAGGACAGCCACTGGGAGTGGGCGTCT
CTGGACACCCACTGCTGAACAAGTACGACGA
TGTGGAGAATAGTGGAGGATATGGAGGAAA
CCCAGGACAGGACAACAGGGTGAATGTCGG
AATGGATTACAAGCAGACACAGCTGTGCATG
GTGGGATGTGCACCACCTCTGGGAGAACATT
GGGGGAAAGGAACTCAGTGCAGTAACACCT
CAGTGCAGAATGGAGACTGTCCACCCCTGGA
GCTGATCACCAGCGTGATTCAGGACGGCGAT
ATGGTCGACACAGGCTTCGGGGCAATGAATT
TTGCCGATCTGCAGACAAACAAGTCCGACGT
GCCTCTGGATATCTGCGGGACTGTCTGTAAA
TACCCTGATTATCTGCAGATGGCCGCTGACCC
ATACGGAGATCGCCTGTTCTTTTATCTGCGAA
AGGAACAGATGTTCGCTCGACACTTCTTTAAC
CGAGCAGGAACTGTGGGAGAGCCAGTCCCT
GACGATCTGCTGGTGAAAGGGGGAAACAAT
CGCAGCTCCGTGGCCTCTAGTATCTACGTCCA
TACACCAAGTGGCTCACTGGTGTCAAGCGAG
GCACAGCTGTTCAATAAGCCCTATTGGCTGCA
GAAAGCCCAGGGCCACAACAATGGGATTTGC
TGGGGAAACCATCTGTTTGTGACCGTGGTCG
ACACTACCAGGTCTACTAATATGACCCTGTGT
GCCAGCGTGTCCAAGTCTGCTACATACACTAA
CTCCGACTACAAAGAATATATGCGCCACGTG
GAGGAATTCGATCTGCAGTTCATCTTTCAGCT
GTGCTCTATTACTCTGAGTGCTGAAGTGATG
GCATATATCCATACCATGAATCCCTCCGTCCT
GGAGGACTGGAACTTTGGACTGTCTCCTCCA
CCCAATGGCACACTGGAGGATACTTACAGAT
ATGTGCAGAGCCAGGCCATTACATGTCAGAA
GCCAACTCCCGAGAAGGAAAAACAGGACCCT
TACAAAGATATGTCCTTCTGGGAAGTGAATCT
GAAGGAAAAATTTTCCTCTGAGCTGGATCAG
TTCCCACTGGGCAGAAAGTTTCTGCTGCAGTC
AGGGTATCGGGGAAGAACCAGCGCTAGAAC
AGGGATTAAGAGGCCCGCCGTGAGCAAACCT
TCCACCGCTCCAAAGAGGAAACGCACCAAGA
CAAAGAAA
Page 7 of 48
Protein Sequence
% Nucleotide
Change
MHGRLVTLKDIVLDLQPPDPVGLHC
YEQLEDSSEDEVDKVDKQDAQPLTQ
HYQILTCCCGCDSNVRLVVECTDGDI
RQLQDLLLGTLNIVCPICAPKP
25.9%
MWRPSDSTVYVPPPNPVSKVVATD
AYVKRTNIFYHASSSRLLAVGHPYYSI
KKVNKTVVPKVSGYQYRVFKVVLPD
PNKFALPDSSLFDPTTQRLVWACTG
LEVGRGQPLGVGVSGHPLLNKYDDV
ENSGGYGGNPGQDNRVNVGMDYK
QTQLCMVGCAPPLGEHWGKGTQC
SNTSVQNGDCPPLELITSVIQDGDM
VDTGFGAMNFADLQTNKSDVPLDIC
GTVCKYPDYLQMAADPYGDRLFFYL
RKEQMFARHFFNRAGTVGEPVPDD
LLVKGGNNRSSVASSIYVHTPSGSLV
SSEAQLFNKPYWLQKAQGHNNGIC
WGNHLFVTVVDTTRSTNMTLCASV
SKSATYTNSDYKEYMRHVEEFDLQFI
FQLCSITLSAEVMAYIHTMNPSVLED
WNFGLSPPPNGTLEDTYRYVQSQAI
TCQKPTPEKEKQDPYKDMSFWEVN
LKEKFSSELDQFPLGRKFLLQSGYRGR
TSARTGIKRPAVSKPSTAPKRKRTKTK
K
25.7%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV11 L2
ATGAAACCTAGGGCACGCAGACGTAAACGTG
CGTCAGCCACACAACTATATCAAACATGCAA
GGCCACTGGTACATGTCCCCCAGATGTAATTC
CTAAAGTTGAACATACTACTATTGCAGATCAA
ATATTAAAATGGGGAAGCTTAGGGGTTTTTT
TTGGTGGGTTAGGTATTGGTACAGGGGCTGG
TAGTGGCGGTCGTGCAGGGTATATACCCTTG
GGAAGCTCTCCCAAGCCTGCTATTACTGGGG
GGCCAGCAGCACGTCCGCCAGTGCTTGTGGA
GCCTGTTGCCCCTTCCGATCCCTCCATTGTGT
CCTTAATTGAGGAGTCTGCTATTATTAATGCT
GGTGCACCTGAGGTGGTACCCCCTACACAGG
GTGGCTTTACTATAACATCATCTGAATCGACT
ACACCTGCTATTTTAGATGTGTCTGTTACCAA
TCACACTACCACTAGTGTGTTTCAAAATCCCC
TGTTTACAGAACCGTCTGTAATACAGCCCCAA
CCACCTGTGGAGGCCAGTGGTCACATACTTA
TATCTGCCCCAACAATAACATCCCAACATGTA
GAAGACATTCCACTAGACACTTTTGTTGTATC
CTCTAGTGATAGTGGACCTACATCCAGTACTC
CTCTTCCTCGTGCTTTTCCTCGGCCTCGGGTG
GGTTTGTATAGTCGTGCCTTACAGCAGGTAC
AGGTTACGGACCCCGCGTTTTTGTCCACGCCA
CAGCGATTGGTAACTTATGACAACCCTGTCTA
TGAAGGAGAAGATGTAAGTTTACAATTTACC
CATGAGTCTATCCACAATGCACCTGATGAAG
CATTTATGGATATTATTAGACTACATAGACCA
GCTATAACGTCCAGACGGGGTCTTGTGCGTT
TTAGTCGCATTGGGCAACGGGGGTCCATGTA
CACACGCAGTGGACAACATATAGGTGCCCGC
ATACATTATTTTCAGGACATTTCACCAGTTAC
ACAAGCTGCAGAGGAAATAGAACTGCACCCT
CTAGTGGCTGCAGAAAATGACACGTTTGATA
TTTATGCTGAACCATTTGACCCTATCCCTGAC
CCTGTCCAACATTCTGTTACACAGTCTTATCTT
ACCTCCACACCTAATACCCTTTCACAATCGTG
GGGTAATACCACAGTCCCATTGTCAATCCCTA
GTGACTGGTTTGTGCAGTCTGGGCCTGACAT
AACTTTTCCTACTGCATCTATGGGAACACCCT
TTAGTCCTGTAACTCCTGCTTTACCTACAGGC
CCTGTTTTTATTACAGGTTCTGACTTCTATTTG
CATCCTACATGGTACTTTGCACGCAGACGCCG
TAAACGTATTCCCTTATTTTTTACAGATGTGG
CGGCCTAG
ATGAAACCAAGAGCAAGAAGAAGAAAAAGA
GCATCCGCAACTCAGCTGTATCAGACCTGTAA
AGCCACTGGAACCTGTCCCCCCGACGTGATC
CCCAAGGTCGAGCACACCACAATCGCCGACC
AGATTCTGAAATGGGGCTCTCTGGGGGTGTT
CTTTGGCGGGCTGGGAATCGGAACCGGAGC
AGGAAGCGGAGGACGAGCTGGATACATCCC
ACTGGGAAGCTCCCCAAAGCCTGCTATTACA
GGAGGACCTGCAGCTAGACCACCTGTGCTGG
TCGAACCCGTCGCACCTTCCGACCCATCTATC
GTGAGTCTGATTGAGGAAAGCGCCATCATTA
ACGCAGGAGCACCAGAGGTGGTCCCACCAAC
CCAGGGCGGGTTTACCATCACATCTAGTGAA
TCCACTACCCCTGCCATTCTGGATGTCAGTGT
CACCAACCATACAACTACCTCAGTGTTCCAGA
ATCCTCTGTTTACAGAGCCATCCGTCATCCAG
CCCCAGCCTCCAGTGGAAGCATCCGGCCACA
TCCTGATTTCTGCCCCAACTATCACCAGTCAG
CATGTGGAGGACATTCCCCTGGATACCTTTGT
GGTCTCAAGCTCCGACAGCGGCCCTACCTCT
AGTACACCACTGCCCCGAGCCTTCCCTAGACC
AAGGGTGGGGCTGTATTCTCGCGCTCTGCAG
CAGGTGCAGGTCACTGATCCCGCATTCCTGA
GTACCCCTCAGAGGCTGGTGACATACGACAA
CCCCGTCTATGAGGGAGAAGATGTGTCCCTG
CAGTTTACCCACGAGTCTATCCATAATGCTCC
AGACGAAGCATTCATGGATATCATTCGCCTG
CACCGACCCGCTATCACAAGCCGGAGAGGCC
TGGTGCGGTTTTCCAGAATTGGACAGAGGGG
CTCAATGTACACTCGCAGCGGGCAGCACATC
GGAGCACGCATTCATTATTTCCAGGATATCAG
CCCTGTGACTCAGGCAGCCGAGGAAATTGAG
CTGCACCCACTGGTGGCTGCAGAAAATGACA
CCTTCGATATCTACGCCGAGCCATTTGACCCC
ATTCCTGATCCAGTGCAGCATTCCGTCACACA
GTCTTATCTGACAAGTACTCCCAACACTCTGT
CACAGAGCTGGGGCAATACAACTGTCCCACT
GTCAATCCCCAGCGACTGGTTCGTGCAGTCT
GGGCCTGATATTACTTTTCCAACCGCCTCAAT
GGGAACACCCTTCAGCCCTGTCACACCAGCTC
TGCCCACTGGACCTGTGTTCATCACTGGCAGC
GACTTTTACCTGCACCCTACCTGGTATTTCGC
CAGGCGCCGACGGAAAAGGATTCCACTGTTC
TTTACCGATGTGGCCGCT
MKPRARRRKRASATQLYQTCKATGT
CPPDVIPKVEHTTIADQILKWGSLGV
FFGGLGIGTGAGSGGRAGYIPLGSSP
KPAITGGPAARPPVLVEPVAPSDPSI
VSLIEESAIINAGAPEVVPPTQGGFTI
TSSESTTPAILDVSVTNHTTTSVFQNP
LFTEPSVIQPQPPVEASGHILISAPTIT
SQHVEDIPLDTFVVSSSDSGPTSSTPL
PRAFPRPRVGLYSRALQQVQVTDPA
FLSTPQRLVTYDNPVYEGEDVSLQFT
HESIHNAPDEAFMDIIRLHRPAITSRR
GLVRFSRIGQRGSMYTRSGQHIGARI
HYFQDISPVTQAAEEIELHPLVAAEN
DTFDIYAEPFDPIPDPVQHSVTQSYL
TSTPNTLSQSWGNTTVPLSIPSDWF
VQSGPDITFPTASMGTPFSPVTPALP
TGPVFITGSDFYLHPTWYFARRRRKR
IPLFFTDVAA
Page 8 of 48
% Nucleotide
Change
26.6%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV16 E1
ATGGCTGATCCTGCAGGTACCAATGGGGAAG
AGGGTACGGGATGTAATGGATGGTTTTATGT
AGAGGCTGTAGTGGAAAAAAAAACAGGGGA
TGCTATATCAGATGACGAGAACGAAAATGAC
AGTGATACAGGTGAAGATTTGGTAGATTTTA
TAGTAAATGATAATGATTATTTAACACAGGCA
GAAACAGAGACAGCACATGCGTTGTTTACTG
CACAGGAAGCAAAACAACATAGAGATGCAG
TACAGGTTCTAAAACGAAAGTATTTGGGTAG
TCCACTTAGTGATATTAGTGGATGTGTAGAC
AATAATATTAGTCCTAGATTAAAAGCTATATG
TATAGAAAAACAAAGTAGAGCTGCAAAAAG
GAGATTATTTGAAAGCGAAGACAGCGGGTAT
GGCAATACTGAAGTGGAAACTCAGCAGATGT
TACAGGTAGAAGGGCGCCATGAGACTGAAA
CACCATGTAGTCAGTATAGTGGTGGAAGTGG
GGGTGGTTGCAGTCAGTACAGTAGTGGAAG
TGGGGGAGAGGGTGTTAGTGAAAGACACAC
TATATGCCAAACACCACTTACAAATATTTTAA
ATGTACTAAAAACTAGTAATGCAAAGGCAGC
AATGTTAGCAAAATTTAAAGAGTTATACGGG
GTGAGTTTTTCAGAATTAGTAAGACCATTTAA
AAGTAATAAATCAACGTGTTGCGATTGGTGT
ATTGCTGCATTTGGACTTACACCCAGTATAGC
TGACAGTATAAAAACACTATTACAACAATATT
GTTTATATTTACACATTCAAAGTTTAGCATGT
TCATGGGGAATGGTTGTGTTACTATTAGTAA
GATATAAATGTGGAAAAAATAGAGAAACAAT
TGAAAAATTGCTGTCTAAACTATTATGTGTGT
CTCCAATGTGTATGATGATAGAGCCTCCAAA
ATTGCGTAGTACAGCAGCAGCATTATATTGG
TATAAAACAGGTATATCAAATATTAGTGAAG
TGTATGGAGACACGCCAGAATGGATACAAAG
ACAAACAGTATTACAACATAGTTTTAATGATT
GTACATTTGAATTATCACAGATGGTACAATG
GGCCTACGATAATGACATAGTAGACGATAGT
GAAATTGCATATAAATATGCACAATTGGCAG
ACACTAATAGTAATGCAAGTGCCTTTCTAAAA
AGTAATTCACAGGCAAAAATTGTAAAGGATT
GTGCAACAATGTGTAGACATTATAAACGAGC
AGAAAAAAAACAAATGAGTATGAGTCAATG
GATAAAATATAGATGTGATAGGGTAGATGAT
GGAGGTGATTGGAAGCAAATTGTTATGTTTT
TAAGGTATCAAGGTGTAGAGTTTATGTCATTT
TTAACTGCATTAAAAAGATTTTTGCAAGGCAT
ACCTAAAAAAAATTGCATATTACTATATGGTG
CAGCTAACACAGGTAAATCATTATTTGGTATG
AGTTTAATGAAATTTCTGCAAGGGTCTGTAAT
ATGTTTTGTAAATTCTAAAAGCCATTTTTGGT
TACAACCATTAGCAGATGCCAAAATAGGTAT
GTTAGATGATGCTACAGTGCCCTGTTGGAAC
TACATAGATGACAATTTAAGAAATGCATTGG
ATGGAAATTTAGTTTCTATGGATGTAAAGCAT
AGACCATTGGTACAACTAAAATGCCCTCCATT
ATTAATTACATCTAACATTAATGCTGGTACAG
ATTCTAGGTGGCCTTATTTACATAATAGATTG
GTGGTGTTTACATTTCCTAATGAGTTTCCATTT
GACGAAAACGGAAATCCAGTGTATGAGCTTA
ATGATAAGAACTGGAAATCCTTTTTCTCAAGG
ACGTGGTCCAGATTAAGTTTGCACGAGGACG
AGGACAAGGAAAACGATGGAGACTCTTTGCC
AACGTTTAAATGTGTGTCAGGACAAAATACT
AACACATTATGA
ATGGCAGACCCCGCTGGGACTAACGGAGAA
GAAGGCACTGGCTGTAATGGCTGGTTTTATG
TGGAGGCTGTGGTGGAGAAGAAGACAGGCG
ACGCCATCTCAGACGATGAGAACGAAAATGA
CAGCGATACCGGGGAGGACCTGGTGGATTTC
ATTGTCAACGACAATGATTACCTGACCCAGG
CAGAGACCGAAACAGCACACGCCCTGTTTAC
AGCACAGGAAGCCAAGCAGCATAGGGATGC
CGTGCAGGTCCTGAAGCGCAAATATCTGGGG
AGCCCCCTGTCCGACATCTCTGGATGCGTGG
ATAACAATATTAGCCCTCGACTGAAGGCCATC
TGTATTGAGAAACAGAGCCGGGCCGCTAAGC
GGAGACTGTTCGAGAGTGAAGACTCAGGCTA
CGGGAACACTGAGGTGGAAACCCAGCAGAT
GCTCCAGGTCGAGGGCAGGCACGAGACTGA
AACCCCATGCAGCCAGTACTCCGGAGGGTCT
GGAGGAGGGTGTTCACAGTATAGCTCCGGA
AGCGGAGGAGAGGGCGTGTCCGAACGCCAT
ACTATCTGCCAGACCCCCCTGACAAACATTCT
GAATGTCCTGAAGACCAGCAACGCCAAAGCA
GCCATGCTGGCTAAGTTCAAAGAGCTGTACG
GGGTGTCTTTCAGTGAACTGGTCCGGCCTTTT
AAGAGTAACAAGAGCACCTGCTGTGACTGGT
GTATCGCTGCATTTGGCCTGACTCCAAGTATC
GCTGATTCAATTAAGACCCTGCTCCAGCAGTA
CTGCCTGTATCTGCACATTCAGAGCCTGGCCT
GTTCCTGGGGGATGGTGGTCCTGCTGCTGGT
GCGCTATAAGTGCGGAAAAAACCGAGAGAC
TATCGAAAAGCTGCTGTCTAAACTGCTGTGC
GTGAGTCCTATGTGTATGATGATTGAGCCCC
CTAAACTGCGGAGCACAGCCGCTGCACTGTA
CTGGTATAAGACTGGCATCAGCAATATTTCCG
AGGTGTACGGGGACACCCCAGAATGGATTCA
GAGACAGACAGTCCTCCAGCACTCCTTCAAC
GATTGTACCTTTGAGCTGTCTCAGATGGTGCA
GTGGGCTTATGACAATGATATCGTGGACGAT
TCCGAAATTGCATACAAATATGCTCAGCTGGC
AGACACCAACTCTAATGCTAGTGCATTCCTGA
AGTCAAACAGCCAGGCAAAGATCGTGAAAG
ATTGCGCCACAATGTGCCGGCACTACAAGCG
GGCTGAGAAGAAACAGATGTCCATGTCTCAG
TGGATCAAATATAGGTGCGACCGCGTGGACG
ATGGGGGAGATTGGAAGCAGATTGTGATGT
TCCTGAGATACCAGGGAGTCGAGTTCATGTC
CTTTCTGACTGCCCTGAAGCGGTTCCTCCAGG
GCATCCCCAAGAAAAACTGCATTCTGCTGTAT
GGGGCCGCTAATACCGGAAAATCTCTGTTCG
GCATGAGTCTGATGAAGTTTCTCCAGGGGTC
TGTGATCTGTTTCGTCAATAGTAAATCACACT
TTTGGCTCCAGCCACTGGCCGACGCTAAGAT
CGGAATGCTGGACGATGCCACCGTGCCCTGC
TGGAACTACATTGACGATAACCTGCGCAATG
CTCTGGACGGCAATCTGGTGAGCATGGATGT
CAAACACCGACCCCTGGTGCAGCTGAAGTGT
CCACCCCTGCTGATCACATCCAACATTAATGC
CGGCACTGACTCTCGGTGGCCCTACCTGCAT
AACAGACTGGTGGTCTTCACATTTCCTAATGA
GTTCCCATTTGACGAAAACGGCAATCCTGTGT
ATGAGCTGAACGATAAGAACTGGAAATCATT
CTTTAGCAGAACATGGTCCAGGCTGTCTCTGC
ATGAGGACGAAGATAAAGAAAACGACGGAG
ATAGTCTGCCTACTTTTAAGTGCGTGAGCGG
CCAGAACACAAATACTCTG
MADPAGTNGEEGTGCNGWFYVEA
VVEKKTGDAISDDENENDSDTGEDL
VDFIVNDNDYLTQAETETAHALFTA
QEAKQHRDAVQVLKRKYLGSPLSDIS
GCVDNNISPRLKAICIEKQSRAAKRRL
FESEDSGYGNTEVETQQMLQVEGR
HETETPCSQYSGGSGGGCSQYSSGS
GGEGVSERHTICQTPLTNILNVLKTS
NAKAAMLAKFKELYGVSFSELVRPFK
SNKSTCCDWCIAAFGLTPSIADSIKTL
LQQYCLYLHIQSLACSWGMVVLLLV
RYKCGKNRETIEKLLSKLLCVSPMCM
MIEPPKLRSTAAALYWYKTGISNISEV
YGDTPEWIQRQTVLQHSFNDCTFEL
SQMVQWAYDNDIVDDSEIAYKYAQ
LADTNSNASAFLKSNSQAKIVKDCAT
MCRHYKRAEKKQMSMSQWIKYRC
DRVDDGGDWKQIVMFLRYQGVEF
MSFLTALKRFLQGIPKKNCILLYGAA
NTGKSLFGMSLMKFLQGSVICFVNS
KSHFWLQPLADAKIGMLDDATVPC
WNYIDDNLRNALDGNLVSMDVKHR
PLVQLKCPPLLITSNINAGTDSRWPY
LHNRLVVFTFPNEFPFDENGNPVYEL
NDKNWKSFFSRTWSRLSLHEDEDKE
NDGDSLPTFKCVSGQNTNTL
Page 9 of 48
% Nucleotide
Change
26.6%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV16 E2
ATGGAGACTCTTTGCCAACGTTTAAATGTGTG
TCAGGACAAAATACTAACACATTATGAAAAT
GATAGTACAGACCTACGTGACCATATAGACT
ATTGGAAACACATGCGCCTAGAATGTGCTAT
TTATTACAAGGCCAGAGAAATGGGATTTAAA
CATATTAACCACCAGGTGGTGCCAACACTGG
CTGTATCAAAGAATAAAGCATTACAAGCAAT
TGAACTGCAACTAACGTTAGAAACAATATAT
AACTCACAATATAGTAATGAAAAGTGGACAT
TACAAGACGTTAGCCTTGAAGTGTATTTAACT
GCACCAACAGGATGTATAAAAAAACATGGAT
ATACAGTGGAAGTGCAGTTTGATGGAGACAT
ATGCAATACAATGCATTATACAAACTGGACA
CATATATATATTTGTGAAGAAGCATCAGTAAC
TGTGGTAGAGGGTCAAGTTGACTATTATGGT
TTATATTATGTTCATGAAGGAATACGAACATA
TTTTGTGCAGTTTAAAGATGATGCAGAAAAA
TATAGTAAAAATAAAGTATGGGAAGTTCATG
CGGGTGGTCAGGTAATATTATGTCCTACATCT
GTGTTTAGCAGCAACGAAGTATCCTCTCCTGA
AATTATTAGGCAGCACTTGGCCAACCACCCC
GCCGCGACCCATACCAAAGCCGTCGCCTTGG
GCACCGAAGAAACACAGACGACTATCCAGCG
ACCAAGATCAGAGCCAGACACCGGAAACCCC
TGCCACACCACTAAGTTGTTGCACAGAGACTC
AGTGGACAGTGCTCCAATCCTCACTGCATTTA
ACAGCTCACACAAAGGACGGATTAACTGTAA
TAGTAACACTACACCCATAGTACATTTAAAAG
GTGATGCTAATACTTTAAAATGTTTAAGATAT
AGATTTAAAAAGCATTGTACATTGTATACTGC
AGTGTCGTCTACATGGCATTGGACAGGACAT
AATGTAAAACATAAAAGTGCAATTGTTACACT
TACATATGATAGTGAATGGCAACGTGACCAA
TTTTTGTCTCAAGTTAAAATACCAAAAACTAT
TACAGTGTCTACTGGATTTATGTCTATATGA
METLCQRLNVCQDKILTHYENDSTD
LRDHIDYWKHMRLECAIYYKAREMG
FKHINHQVVPTLAVSKNKALQAIELQ
LTLETIYNSQYSNEKWTLQDVSLEVY
LTAPTGCIKKHGYTVEVQFDGDICNT
MHYTNWTHIYICEEASVTVVEGQVD
YYGLYYVHEGIRTYFVQFKDDAEKYS
KNKVWEVHAGGQVILCPTSVFSSNE
VSSPEIIRQHLANHPAATHTKAVALG
TEETQTTIQRPRSEPDTGNPCHTTKL
LHRDSVDSAPILTAFNSSHKGRINCN
SNTTPIVHLKGDANTLKCLRYRFKKH
CTLYTAVSSTWHWTGHNVKHKSAIV
TLTYDSEWQRDQFLSQVKIPKTITVS
TGFMSI
HPV16 E4
TATTATGTCCTACATCTGTGTTTAGCAGCAAC
GAAGTATCCTCTCCTGAAATTATTAGGCAGCA
CTTGGCCAACCACCCCGCCGCGACCCATACCA
AAGCCGTCGCCTTGGGCACCGAAGAAACACA
GACGACTATCCAGCGACCAAGATCAGAGCCA
GACACCGGAAACCCCTGCCACACCACTAAGT
TGTTGCACAGAGACTCAGTGGACAGTGCTCC
AATCCTCACTGCATTTAACAGCTCACACAAAG
GACGGATTAACTGTAATAGTAACACTACACC
CATAG
ATGACAAATCTTGATACTGCATCCACAACATT
ACTGGCGTGCTTTTTGCTTTGCTTTTGTGTGCT
TTTGTGTGTCTGCCTATTAATACGTCCGCTGC
TTTTGTCTGTGTCTACATACACATCATTAATAA
TATTGGTATTACTATTGTGGATAACAGCAGCC
TCTGCGTTTAGGTGTTTTATTGTATATATTATA
TTTGTTTATATACCATTATTTTTAATACATACA
CATGCACGCTTTTTAATTACATAA
ATGCACCAAAAGAGAACTGCAATGTTTCAGG
ACCCACAGGAGCGACCCAGAAAGTTACCACA
GTTATGCACAGAGCTGCAAACAACTATACAT
GATATAATATTAGAATGTGTGTACTGCAAGC
AACAGTTACTGCGACGTGAGGTATATGACTT
TGCTTTTCGGGATTTATGCATAGTATATAGAG
ATGGGAATCCATATGCTGTATGTGATAAATG
TTTAAAGTTTTATTCTAAAATTAGTGAGTATA
GACATTATTGTTATAGTTTGTATGGAACAACA
TTAGAACAGCAATACAACAAACCGTTGTGTG
ATTTGTTAATTAGGTGTATTAACTGTCAAAAG
CCACTGTGTCCTGAAGAAAAGCAAAGACATC
TGGACAAAAAGCAAAGATTCCATAATATAAG
GGGTCGGTGGACCGGTCGATGTATGTCTTGT
TGCAGATCATCAAGAACACGTAGAGAAACCC
AGCTGTAA
ATGGAGACTCTGTGCCAGCGGCTGAACGTGT
GCCAGGATAAGATTCTGACTCACTACGAAAA
TGACTCAACCGACCTGCGGGACCACATCGAC
TACTGGAAGCACATGCGACTGGAGTGCGCCA
TCTACTATAAGGCTCGGGAAATGGGCTTCAA
ACACATCAATCATCAGGTGGTCCCCACCCTGG
CCGTGAGCAAGAACAAGGCCCTCCAGGCAAT
CGAGCTGCAACTGACCCTGGAAACAATCTAC
AATAGTCAGTATTCAAACGAGAAGTGGACAC
TCCAGGACGTGAGCCTGGAAGTCTACCTGAC
TGCACCTACCGGATGTATTAAGAAACACGGC
TATACCGTGGAGGTCCAGTTTGACGGCGATA
TCTGCAATACAATGCATTACACAAACTGGACT
CACATCTATATTTGTGAGGAAGCTAGCGTGA
CTGTGGTCGAGGGGCAGGTCGATTACTATGG
ACTGTACTATGTGCATGAAGGGATTCGCACC
TACTTCGTGCAGTTTAAGGACGATGCTGAGA
AATATTCTAAGAACAAGGTCTGGGAAGTCCA
CGCAGGAGGACAGGTCATCCTGTGCCCTACC
AGTGTGTTCAGCTCCAATGAGGTCTCTAGTCC
AGAAATCATTCGACAGCACCTGGCCAACCAT
CCCGCCGCTACCCACACAAAGGCAGTGGCCC
TGGGAACCGAGGAAACACAGACCACAATTCA
GCGGCCCAGATCCGAGCCTGACACAGGCAAT
CCTTGCCATACTACCAAGCTGCTGCACAGAG
ACAGCGTGGATTCCGCACCAATCCTGACTGC
CTTCAACTCAAGCCATAAAGGCAGGATCAAC
TGTAATTCTAACACAACTCCAATTGTCCACCT
GAAGGGGGATGCCAATACCCTGAAATGCCTG
CGGTACAGATTCAAGAAACACTGTACTCTGT
ATACCGCCGTGTCCTCTACATGGCACTGGACT
GGGCATAACGTGAAGCACAAATCAGCTATCG
TCACTCTGACCTACGACAGCGAGTGGCAGAG
GGATCAGTTCCTGTCCCAGGTGAAGATCCCC
AAAACAATTACTGTCTCTACAGGCTTCATGAG
TATC
ATGTATGTGCTGCATCTGTGCCTGGCTGCTAC
CAAGTACCCCCTGCTGAAACTGCTGGGATCA
ACCTGGCCTACCACCCCCCCTCGGCCCATCCC
TAAGCCATCTCCCTGGGCCCCTAAGAAACACC
GGCGGCTGAGCAGCGACCAGGATCAGTCAC
AGACTCCTGAGACCCCAGCTACACCCCTGAG
CTGCTGTACCGAAACACAGTGGACAGTGCTC
CAGAGCAGCCTGCACCTGACTGCCCATACCA
AAGACGGCCTGACAGTGATTGTCACTCTGCA
TCCC
ATGACCAACCTGGATACTGCTTCTACTACCCT
GCTGGCTTGTTTCCTGCTGTGTTTCTGTGTCCT
GCTGTGCGTGTGCCTGCTGATTAGGCCCCTG
CTGCTGAGCGTGTCCACCTACACATCTCTGAT
CATTCTGGTCCTGCTGCTGTGGATCACAGCCG
CTAGCGCATTCCGGTGCTTCATCGTGTACATC
ATCTTCGTCTACATCCCTCTGTTTCTGATTCAC
ACTCATGCCAGATTCCTGATCACC
ATGCACCAGAAGAGAACCGCCATGTTTCAGG
ACCCTCAGGAACGACCTCGCAAACTGCCCCA
GCTGTGTACCGAACTGCAGACAACTATCCAC
GACATCATTCTGGAGTGCGTGTACTGTAAGC
AGCAGCTGCTGCGGAGAGAAGTCTATGACTT
CGCCTTTCGCGATCTGTGCATCGTGTACCGAG
ACGGAAACCCCTACGCCGTCTGCGATAAGTG
TCTGAAGTTCTACTCTAAGATTAGTGAGTATC
GGCATTACTGTTATAGCCTGTACGGCACCACA
CTGGAACAGCAGTATAACAAACCCCTGTGCG
ACCTGCTGATCAGATGCATTAATTGTCAGAA
GCCCCTGTGTCCTGAGGAAAAACAGAGGCAC
CTGGATAAGAAACAGCGCTTTCATAATATTCG
AGGCCGGTGGACAGGGAGGTGCATGTCTTG
CTGTAGAAGCTCCAGGACTAGGCGCGAGACC
CAGCTG
HPV16 E5
HPV16 E6
Page 10 of 48
% Nucleotide
Change
25.9%
YYVLHLCLAATKYPLLKLLGSTWPTTP
PRPIPKPSPWAPKKHRRLSSDQDQS
QTPETPATPLSCCTETQWTVLQSSLH
LTAHTKDGLTVIVTLHP
29.4%
MTNLDTASTTLLACFLLCFCVLLCVCL
LIRPLLLSVSTYTSLIILVLLLWITAASAF
RCFIVYIIFVYIPLFLIHTHARFLIT
35.0%
MHQKRTAMFQDPQERPRKLPQLCT
ELQTTIHDIILECVYCKQQLLRREVYD
FAFRDLCIVYRDGNPYAVCDKCLKFY
SKISEYRHYCYSLYGTTLEQQYNKPLC
DLLIRCINCQKPLCPEEKQRHLDKKQ
RFHNIRGRWTGRCMSCCRSSRTRRE
TQL
25.4%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV16 E7
ATGCATGGAGATACACCTACATTGCATGAAT
ATATGTTAGATTTGCAACCAGAGACAACTGA
TCTCTACTGTTATGAGCAATTAAATGACAGCT
CAGAGGAGGAGGATGAAATAGATGGTCCAG
CTGGACAAGCAGAACCGGACAGAGCCCATTA
CAATATTGTAACCTTTTGTTGCAAGTGTGACT
CTACGCTTCGGTTGTGCGTACAAAGCACACA
CGTAGACATTCGTACTTTGGAAGACCTGTTAA
TGGGCACACTAGGAATTGTGTGCCCCATCTG
TTCTCAGAAACCATAA
ATGTCTCTTTGGCTGCCTAGTGAGGCCACTGT
CTACTTGCCTCCTGTCCCAGTATCTAAGGTTG
TAAGCACGGATGAATATGTTGCACGCACAAA
CATATATTATCATGCAGGAACATCCAGACTAC
TTGCAGTTGGACATCCCTATTTTCCTATTAAA
AAACCTAACAATAACAAAATATTAGTTCCTAA
AGTATCAGGATTACAATACAGGGTATTTAGA
ATACATTTACCTGACCCCAATAAGTTTGGTTT
TCCTGACACCTCATTTTATAATCCAGATACAC
AGCGGCTGGTTTGGGCCTGTGTAGGTGTTGA
GGTAGGTCGTGGTCAGCCATTAGGTGTGGGC
ATTAGTGGCCATCCTTTATTAAATAAATTGGA
TGACACAGAAAATGCTAGTGCTTATGCAGCA
AATGCAGGTGTGGATAATAGAGAATGTATAT
CTATGGATTACAAACAAACACAATTGTGTTTA
ATTGGTTGCAAACCACCTATAGGGGAACACT
GGGGCAAAGGATCCCCATGTACCAATGTTGC
AGTAAATCCAGGTGATTGTCCACCATTAGAG
TTAATAAACACAGTTATTCAGGATGGTGATAT
GGTTGATACTGGCTTTGGTGCTATGGACTTTA
CTACATTACAGGCTAACAAAAGTGAAGTTCC
ACTGGATATTTGTACATCTATTTGCAAATATC
CAGATTATATTAAAATGGTGTCAGAACCATAT
GGCGACAGCTTATTTTTTTATTTACGAAGGGA
ACAAATGTTTGTTAGACATTTATTTAATAGGG
CTGGTACTGTTGGTGAAAATGTACCAGACGA
TTTATACATTAAAGGCTCTGGGTCTACTGCAA
ATTTAGCCAGTTCAAATTATTTTCCTACACCTA
GTGGTTCTATGGTTACCTCTGATGCCCAAATA
TTCAATAAACCTTATTGGTTACAACGAGCACA
GGGCCACAATAATGGCATTTGTTGGGGTAAC
CAACTATTTGTTACTGTTGTTGATACTACACG
CAGTACAAATATGTCATTATGTGCTGCCATAT
CTACTTCAGAAACTACATATAAAAATACTAAC
TTTAAGGAGTACCTACGACATGGGGAGGAAT
ATGATTTACAGTTTATTTTTCAACTGTGCAAA
ATAACCTTAACTGCAGACGTTATGACATACAT
ACATTCTATGAATTCCACTATTTTGGAGGACT
GGAATTTTGGTCTACAACCTCCCCCAGGAGG
CACACTAGAAGATACTTATAGGTTTGTAACAT
CCCAGGCAATTGCTTGTCAAAAACATACACCT
CCAGCACCTAAAGAAGATCCCCTTAAAAAAT
ACACTTTTTGGGAAGTAAATTTAAAGGAAAA
GTTTTCTGCAGACCTAGATCAGTTTCCTTTAG
GACGCAAATTTTTACTACAAGCAGGATTGAA
GGCCAAACCAAAATTTACATTAGGAAAACGA
AAAGCTACACCCACCACCTCATCTACCTCTAC
AACTGCTAAACGCAAAAAACGTAAGCTGTAA
ATGCACGGCGACACTCCTACTCTGCACGAAT
ACATGCTGGACCTGCAGCCCGAAACTACTGA
CCTGTACTGCTACGAACAGCTGAATGACAGC
TCCGAGGAAGAGGACGAAATCGATGGACCT
GCCGGCCAGGCTGAGCCTGACAGGGCCCACT
ACAACATTGTGACTTTCTGCTGTAAGTGCGAT
TCTACCCTGCGGCTGTGTGTGCAGAGTACCC
ATGTGGACATCAGAACCCTGGAGGACCTGCT
GATGGGAACACTGGGCATCGTCTGCCCAATT
TGTTCCCAGAAACCC
ATGCAGGTCACTTTTATCTATATCCTGGTCAT
TACTTGCTACGAGAACGATGTCAACGTCTATC
ATATTTTCTTTCAGATGTCCCTGTGGCTGCCA
TCCGAGGCAACCGTCTACCTGCCCCCTGTGCC
CGTCTCTAAAGTGGTCAGTACAGATGAATAT
GTGGCCCGCACTAACATCTACTATCACGCCG
GGACATCTCGACTGCTGGCTGTCGGACATCC
CTACTTCCCTATCAAGAAACCCAACAACAACA
AAATTCTGGTCCCTAAGGTGAGTGGCCTCCA
GTATAGGGTGTTCCGCATTCACCTGCCAGATC
CCAATAAGTTCGGGTTTCCTGACACCAGCTTT
TACAACCCAGATACACAGCGACTGGTCTGGG
CATGCGTGGGAGTCGAAGTGGGAAGAGGAC
AGCCTCTGGGAGTGGGAATCAGCGGACATCC
ACTGCTGAACAAGCTGGACGATACCGAGAAC
GCTTCCGCATACGCCGCTAATGCTGGCGTGG
ACAACCGGGAATGTATTTCTATGGATTATAA
GCAGACACAGCTGTGCCTGATCGGATGTAAA
CCACCCATTGGAGAGCACTGGGGCAAGGGG
TCCCCATGCACTAATGTCGCCGTGAACCCCG
GCGACTGTCCTCCACTGGAACTGATCAATACC
GTCATTCAGGACGGAGATATGGTGCATACAG
GATTCGGCGCAATGGATTTTACCACACTCCAG
GCCAACAAGAGTGAGGTGCCCCTGGACATCT
GCACCTCAATTTGTAAGTACCCCGATTACATC
AAGATGGTGTCCGAGCCTTACGGGGACTCTC
TGTTCTTTTATCTGCGGAGAGAACAGATGTTC
GTGAGACACCTGTTTAATAGGGCAGGCACTG
TCGGAGAAAACGTGCCAGACGATCTGTACAT
CAAGGGGTCAGGAAGCACCGCAAATCTGGC
CAGCTCCAACTATTTCCCTACTCCATCCGGCT
CTATGGTGACCTCTGACGCCCAGATTTTCAAC
AAGCCTTACTGGCTCCAGCGGGCCCAGGGAC
ATAATAACGGCATTTGCTGGGGGAATCAGCT
GTTCGTGACAGTGGTCGATACTACCCGCTCA
ACTAACATGAGCCTGTGTGCAGCCATCAGTA
CCTCAGAGACAACTTACAAGAACACAAACTT
CAAGGAATACCTGAGACACGGAGAGGAATA
TGACCTCCAGTTCATCTTTCAGCTGTGCAAGA
TTACACTGACTGCCGATGTGATGACTTACATC
CATAGCATGAACAGCACCATTCTGGAGGACT
GGAACTTCGGACTCCAGCCACCTCCAGGCGG
GACCCTGGAAGATACATATAGGTTTGTGACA
CAGGCCATCGCTTGTCAGAAACACACTCCCCC
TGCTCCAAAGGAGGACGATCCCCTGAAGAAA
TACACCTTCTGGGAGGTGAACCTGAAGGAAA
AGTTCAGCGCCGACCTGGACCAGTTCCCACT
GGGCAGGAAGTTTCTGCTCCAGGCTGGGCTG
AAGGCAAAACCTAAGTTCACACTGGGCAAAC
GCAAGGCTACTCCAACCACATCTAGTACCAG
CACTACCGCAAAACGAAAGAAACGGAAGCT
G
MHGDTPTLHEYMLDLQPETTDLYCY
EQLNDSSEEEDEIDGPAGQAEPDRA
HYNIVTFCCKCDSTLRLCVQSTHVDI
RTLEDLLMGTLGIVCPICSQKP
HPV16 L1
Page 11 of 48
MSLWLPSEATVYLPPVPVSKVVSTD
EYVARTNIYYHAGTSRLLAVGHPYFPI
KKPNNNKILVPKVSGLQYRVFRIHLP
DPNKFGFPDTSFYNPDTQRLVWAC
VGVEVGRGQPLGVGISGHPLLNKLD
DTENASAYAANAGVDNRECISMDY
KQTQLCLIGCKPPIGEHWGKGSPCT
NVAVNPGDCPPLELINTVIQDGDMV
DTGFGAMDFTTLQANKSEVPLDICT
SICKYPDYIKMVSEPYGDSLFFYLRRE
QMFVRHLFNRAGTVGENVPDDLYIK
GSGSTANLASSNYFPTPSGSMVTSD
AQIFNKPYWLQRAQGHNNGICWG
NQLFVTVVDTTRSTNMSLCAAISTSE
TTYKNTNFKEYLRHGEEYDLQFIFQL
CKITLTADVMTYIHSMNSTILEDWNF
GLQPPPGGTLEDTYRFVTSQAIACQK
HTPPAPKEDPLKKYTFWEVNLKEKFS
ADLDQFPLGRKFLLQAGLKAKPKFTL
GKRKATPTTSSTSTTAKRKKRKL
% Nucleotide
Change
25.6%
30.0%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV16 L2
ATGCGACACAAACGTTCTGCAAAACGCACAA
AACGTGCATCGGCTACCCAACTTTATAAAACA
TGCAAACAGGCAGGTACATGTCCACCTGACA
TTATACCTAAGGTTGAAGGCAAAACTATTGCT
GATCAAATATTACAATATGGAAGTATGGGTG
TATTTTTTGGTGGGTTAGGAATTGGAACAGG
GTCGGGTACAGGCGGACGCACTGGGTATATT
CCATTGGGAACAAGGCCTCCCACAGCTACAG
ATACACTTGCTCCTGTAAGACCCCCTTTAACA
GTAGATCCTGTGGGCCCTTCTGATCCTTCTAT
AGTTTCTTTAGTGGAAGAAACTAGTTTTATTG
ATGCTGGTGCACCAACATCTGTACCTTCCATT
CCCCCAGATGTATCAGGATTTAGTATTACTAC
TTCAACTGATACCACACCTGCTATATTAGATA
TTAATAATACTGTTACTACTGTTACTACACATA
ATAATCCCACTTTCACTGACCCATCTGTATTG
CAGCCTCCAACACCTGCAGAAACTGGAGGGC
ATTTTACACTTTCATCATCCACTATTAGTACAC
ATAATTATGAAGAAATTCCTATGGATACATTT
ATTGTTAGCACAAACCCTAACACAGTAACTAG
TAGCACACCCATACCAGGGTCTCGCCCAGTG
GCACGCCTAGGATTATATAGTCGCACAACAC
AACAGGTTAAAGTTGTAGACCCTGCTTTTGTA
ACCACTCCCACTAAACTTATTACATATGATAA
TCCTGCATATGAAGGTATAGATGTGGATAAT
ACATTATATTTTTCTAGTAATGATAATAGTATT
AATATAGCTCCAGATCCTGACTTTTTGGATAT
AGTTGCTTTACATAGGCCAGCATTAACCTCTA
GGCGTACTGGCATTAGGTACAGTAGAATTGG
TAATAAACAAACACTACGTACTCGTAGTGGA
AAATCTATAGGTGCTAAGGTACATTATTATTA
TGATTTAAGTACTATTGATCCTGCAGAAGAAA
TAGAATTACAAACTATAACACCTTCTACATAT
ACTACCACTTCACATGCAGCCTCACCTACTTCT
ATTAATAATGGATTATATGATATTTATGCAGA
TGACTTTATTACAGATACTTCTACAACCCCGG
TACCATCTGTACCCTCTACATCTTTATCAGGTT
ATATTCCTGCAAATACAACAATTCCTTTTGGT
GGTGCATACAATATTCCTTTAGTATCAGGTCC
TGATATACCCATTAATATAACTGACCAAGCTC
CTTCATTAATTCCTATAGTTCCAGGGTCTCCA
CAATATACAATTATTGCTGATGCAGGTGACTT
TTATTTACATCCTAGTTATTACATGTTACGAA
AACGACGTAAACGTTTACCATATTTTTTTTCA
GATGTCTCTTTGGCTGCCTAG
ATGAGACATAAGCGGAGTGCTAAGAGGACT
AAAAGAGCCAGCGCCACCCAGCTGTATAAGA
CTTGTAAACAGGCCGGAACTTGCCCTCCTGAC
ATCATTCCAAAGGTGGAGGGGAAAACCATTG
CCGAACAGATCCTCCAGTATGGCAGTATGGG
GGTCTTCTTTGGCGGGCTGGGAATTGGCACA
GGGTCTGGAACTGGAGGCAGGACCGGGTAC
ATCCCACTGGGAACACGCCCCCCTACAGCAA
CTGATACCCTGGCACCCGTGAGACCACCCCT
GACCGTGGACCCAGTCGGACCAAGCGATCCT
TCCATTGTGTCTCTGGTCGAGGAAACCTCCTT
CATCGACGCTGGCGCACCCACAAGTGTGCCT
TCAATTCCTCCAGATGTCAGCGGGTTTTCCAT
CACCACATCTACAGACACTACCCCAGCCATTC
TGGATATCAACAATACTGTGACAACTGTCACC
ACACACAACAATCCAACATTCACTGACCCCTC
CGTGCTCCAGCCACCTACACCCGCTGAGACT
GGAGGACACTTTACACTGAGCAGCAGCACCA
TCAGCACACATAACTATGAGGAAATTCCAAT
GGATACATTCATCGTGAGCACTAACCCCAATA
CCGTCACAAGTTCAACCCCAATCCCCGGCAGC
CGGCCCGTGGCACGACTGGGCCTGTACTCTC
GAACTACCCAGCAGGTGAAGGTGGTGGACC
CCGCTTTTGTCACAACTCCAACCAAACTGATT
ACCTACGACAACCCCGCATACGAAGGCATCG
ACGTGGATAATACCCTGTATTTCAGCTCCAAC
GACAATAGCATCAACATTGCCCCTGACCCAG
ATTTTCTGGATATCGTGGCTCTGCATCGCCCC
GCACTGACTTCTCGGAGAACCGGCATTAGAT
ACAGTAGGATCGGGAATAAGCAGACTCTGCG
AACCCGGTCTGGAAAGAGTATTGGCGCAAAA
GTGCACTACTATTACGACCTGAGCACCATCGA
TCCTGCCGAGGAAATTGAGCTGCAAACTATC
ACCCCAAGTACTTATACCACAACTTCACATGC
CGCTTCACCCACCAGCATCAACAATGGCCTGT
ATGACATCTACGCAGACGATTTCATCACAGAT
ACTAGCACCACACCCGTGCCTTCCGTCCCTTC
TACCAGTCTGTCAGGATATATTCCCGCCAACA
CTACCATCCCTTTTGGCGGGGCTTACAATATC
CCTCTGGTGAGCGGCCCAGACATCCCCATTA
ATATCACAGATCAGGCTCCATCACTGATTCCT
ATCGTCCCAGGGAGCCCCCAGTATACCATCAT
TGCCGACGCTGGAGATTTCTACCTGCACCCCT
CCTATTACATGCTGCGGAAGAGGCGCAAAAG
ACTGCCTTACTTCTTTTCCGACGTGTCTCTGGC
AGCC
MRHKRSAKRTKRASATQLYKTCKQA
GTCPPDIIPKVEGKTIADQILQYGSM
GVFFGGLGIGTGSGTGGRTGYIPLGT
RPPTATDTLAPVRPPLTVDPVGPSDP
SIVSLVEETSFIDAGAPTSVPSIPPDVS
GFSITTSTDTTPAILDINNTVTTVTTH
NNPTFTDPSVLQPPTPAETGGHFTLS
SSTISTHNYEEIPMDTFIVSTNPNTVT
SSTPIPGSRPVARLGLYSRTTQQVKV
VDPAFVTTPTKLITYDNPAYEGIDVD
NTLYFSSNDNSINIAPDPDFLDIVALH
RPALTSRRTGIRYSRIGNKQTLRTRSG
KSIGAKVHYYYDLSTIDPAEEIELQTIT
PSTYTTTSHAASPTSINNGLYDIYADD
FITDTSTTPVPSVPSTSLSGYIPANTTI
PFGGAYNIPLVSGPDIPINITDQAPSLI
PIVPGSPQYTIIADAGDFYLHPSYYML
RKRRKRLPYFFSDVSLAA
Page 12 of 48
% Nucleotide
Change
28.3%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV18 E1
ATGGCTGATCCAGAAGGTACAGACGGGGAG
GGCACGGGTTGTAACGGCTGGTTTTATGTAC
AAGCTATTGTAGACAAAAAAACAGGAGATGT
AATATCAGATGACGAGGACGAAAATGCAACA
GACACAGGGTCGGATATGGTAGATTTTATTG
ATACACAAGGAACATTTTGTGAACAGGCAGA
GCTAGAGACAGCACAGGCATTGTTCCATGCG
CAGGAGGTCCACAATGATGCACAAGTGTTGC
ATGTTTTAAAACGAAAGTTTGCAGGAGGCAG
CACAGAAAACAGTCCATTAGGGGAGCGGCT
GGAGGTGGATACAGAGTTAAGTCCACGGTTA
CAAGAAATATCTTTAAATAGTGGGCAGAAAA
AGGCAAAAAGGCGGCTGTTTACAATATCAGA
TAGTGGCTATGGCTGTTCTGAAGTGGAAGCA
ACACAGATTCAGGTAACTACAAATGGCGAAC
ATGGCGGCAATGTATGTAGTGGCGGCAGTAC
GGAGGCTATAGACAACGGGGGCACAGAGGG
CAACAACAGCAGTGTAGACGGTACAAGTGAC
AATAGCAATATAGAAAATGTAAATCCACAAT
GTACCATAGCACAATTAAAAGACTTGTTAAA
AGTAAACAATAAACAAGGAGCTATGTTAGCA
GTATTTAAAGACACATATGGGCTATCATTTAC
AGATTTAGTTAGAAATTTTAAAAGTGATAAA
ACCACGTGTACAGATTGGGTTACAGCTATATT
TGGAGTAAACCCAACAATAGCAGAAGGATTT
AAAACACTAATACAGCCATTTATATTATATGC
CCATATTCAATGTCTAGACTGTAAATGGGGA
GTATTAATATTAGCCCTGTTGCGTTACAAATG
TGGTAAGAGTAGACTAACAGTTGCTAAAGGT
TTAAGTACGTTGTTACACGTACCTGAAACTTG
TATGTTAATTCAACCACCAAAATTGCGAAGTA
GTGTTGCAGCACTATATTGGTATAGAACAGG
AATATCAAATATTAGTGAAGTAATGGGAGAC
ACACCTGAGTGGATACAAAGACTTACTATTAT
ACAACATGGAATAGATGATAGCAATTTTGAT
TTGTCAGAAATGGTACAATGGGCATTTGATA
ATGAGCTGACAGATGAAAGCGATATGGCATT
TGAATATGCCTTATTAGCAGACAGCAACAGC
AATGCAGCTGCCTTTTTAAAAAGCAATTGCCA
AGCTAAATATTTAAAAGATTGTGCCACAATGT
GCAAACATTATAGGCGAGCCCAAAAACGACA
AATGAATATGTCACAGTGGATACGATTTAGA
TGTTCAAAAATAGATGAAGGGGGAGATTGG
AGACCAATAGTGCAATTCCTGCGATACCAAC
AAATAGAGTTTATAACATTTTTAGGAGCCTTA
AAATCATTTTTAAAAGGAACCCCCAAAAAAA
ATTGTTTAGTATTTTGTGGACCAGCAAATACA
GGAAAATCATATTTTGGAATGAGTTTTATACA
CTTTATACAAGGAGCAGTAATATCATTTGTGA
ATTCCACTAGTCATTTTTGGTTGGAACCGTTA
ACAGATACTAAGGTGGCCATGTTAGATGATG
CAACGACCACGTGTTGGACATACTTTGATACC
TATATGAGAAATGCGTTAGATGGCAATCCAA
TAAGTATTGATAGAAAGCACAAACCATTAAT
ACAACTAAAATGTCCTCCAATACTACTAACCA
CAAATATACATCCAGCAAAGGATAATAGATG
GCCATATTTAGAAAGTAGAATAACAGTATTT
GAATTTCCAAATGCATTTCCATTTGATAAAAA
TGGCAATCCAGTATATGAAATAAATGACAAA
AATTGGAAATGTTTTTTTGAAAGGACATGGT
CCAGATTAGATTTGCACGAGGAAGAGGAAG
ATGCAGACACCGAAGGAAACCCTTTCGGAAC
GTTTAAGTGCGTTGCAGGACAAAATCATAGA
CCACTATGA
ATGGCAGACCCCGAAGGGACTGACGGCGAA
GGGACTGGATGTAACGGATGGTTTTATGTGC
AGGCTATTGTGGATAAGAAGACTGGCGACGT
GATCAGCGACGATGAGGATGAAAACGCTACC
GACACAGGCTCCGACATGGTCGATTTCATTG
ACACACAGGGGACTTTTTGCGAGCAGGCAGA
GCTGGAAACCGCACAGGCCCTGTTCCACGCT
CAGGAAGTGCATAACGATGCACAGGTGCTGC
ACGTCCTGAAGCGGAAATTTGCCGGCGGGA
GTACAGAGAACAGCCCACTGGGGGAGAGAC
TGGAAGTGGACACTGAGCTGTCTCCCAGGCT
CCAGGAAATCAGCCTGAACTCCGGACAGAAG
AAAGCCAAGCGGAGACTGTTCACCATCTCAG
ATAGCGGGTACGGATGCAGCGAGGTGGAAG
CTACACAGATTCAGGTCACCACAAACGGCGA
GCACGGAGGCAATGTGTGTTCTGGGGGAAG
TACAGAGGCTATTGACAATGGCGGGACTGAA
GGAAACAATAGCTCCGTGGATGGCACTTCAG
ACAACAGCAATATCGAAAACGTCAATCCTCA
GTGCACCATTGCCCAGCTGAAGGATCTGCTG
AAAGTGAACAATAAGCAGGGAGCTATGCTG
GCAGTCTTCAAGGATACCTACGGCCTGAGTT
TCACTGACCTGGTGAGAAACTTTAAGTCAGA
TAAAACTACCTGTACCGACTGGGTGACAGCA
ATCTTTGGGGTCAATCCCACCATTGCCGAGG
GATTCAAAACACTGATCCAGCCTTTTATTCTG
TACGCCCACATCCAGTGCCTGGACTGTAAGT
GGGGCGTGCTGATTCTGGCCCTGCTGCGGTA
TAAGTGCGGAAAATCCAGACTGACTGTGGCT
AAAGGCCTGTCTACCCTGCTGCATGTCCCCGA
GACATGTATGCTGATCCAGCCCCCTAAGCTGC
GCTCTAGTGTGGCCGCTCTGTACTGGTATCG
AACCGGGATCTCCAACATTTCTGAGGTCATG
GGAGACACCCCTGAATGGATTCAGAGGCTGA
CAATCATTCAGCACGGCATTGACGATAGCAA
CTTCGATCTGTCCGAGATGGTGCAGTGGGCT
TTTGACAATGAGCTGACCGATGAATCTGACA
TGGCATTCGAATACGCCCTGCTGGCTGATTCC
AACTCTAATGCAGCCGCTTTTCTGAAAAGTAA
CTGCCAGGCCAAGTACCTGAAAGACTGCGCT
ACAATGTGTAAACATTATAGGCGCGCCCAGA
AGCGCCAGATGAATATGTCACAGTGGATCAG
ATTCCGGTGCAGCAAGATTGATGAGGGAGG
CGACTGGAGGCCAATCGTGCAGTTTCTGCGC
TACCAGCAGATCGAGTTCATCACTTTTCTGGG
CGCCCTGAAGAGCTTCCTGAAGGGGACCCCT
AAGAAAAACTGCCTGGTGTTCTGTGGACCAG
CTAATACAGGCAAATCTTATTTTGGGATGAGT
TTCATCCACTTTATTCAGGGCGCAGTGATCAG
CTTCGTCAACAGTACTTCACATTTTTGGCTGG
AGCCCCTGACTGATACCAAGGTGGCAATGCT
GGACGATGCCACAACTACCTGCTGGACTTAC
TTCGACACCTATATGCGGAACGCCCTGGATG
GGAATCCAATCAGCATTGACAGAAAGCACAA
ACCCCTGATCCAGCTGAAATGTCCACCCATCC
TGCTGACAACTAACATTCATCCCGCAAAGGA
CAATCGATGGCCTTACCTGGAGTCCCGGATC
ACCGTGTTCGAATTTCCTAATGCCTTCCCATTT
GATAAGAACGGCAATCCCGTCTATGAGATTA
ACGACAAGAACTGGAAGTGTTTCTTTGAAAG
AACATGGTCCAGGCTGGATCTGCACGAGGAA
GAGGAAGACGCAGATACTGAGGGCAACCCT
TTCGGGACCTTTAAGCTGCGCGCCGGCCAGA
ATCATCGACCACTG
MADPEGTDGEGTGCNGWFYVQAI
VDKKTGDVISDDEDENATDTGSDM
VDFIDTQGTFCEQAELETAQALFHA
QEVHNDAQVLHVLKRKFAGGSTENS
PLGERLEVDTELSPRLQEISLNSGQKK
AKRRLFTISDSGYGCSEVEATQIQVTT
NGEHGGNVCSGGSTEAIDNGGTEG
NNSSVDGTSDNSNIENVNPQCTIAQ
LKDLLKVNNKQGAMLAVFKDTYGLS
FTDLVRNFKSDKTTCTDWVTAIFGV
NPTIAEGFKTLIQPFILYAHIQCLDCK
WGVLILALLRYKCGKSRLTVAKGLSTL
LHVPETCMLIQPPKLRSSVAALYWYR
TGISNISEVMGDTPEWIQRLTIIQHGI
DDSNFDLSEMVQWAFDNELTDESD
MAFEYALLADSNSNAAAFLKSNCQA
KYLKDCATMCKHYRRAQKRQMNM
SQWIRFRCSKIDEGGDWRPIVQFLR
YQQIEFITFLGALKSFLKGTPKKNCLV
FCGPANTGKSYFGMSFIHFIQGAVIS
FVNSTSHFWLEPLTDTKVAMLDDAT
TTCWTYFDTYMRNALDGNPISIDRK
HKPLIQLKCPPILLTTNIHPAKDNRW
PYLESRITVFEFPNAFPFDKNGNPVY
EINDKNWKCFFERTWSRLDLHEEEE
DADTEGNPFGTFKCVAGQNHRPL
Page 13 of 48
% Nucleotide
Change
25.5%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV18 E2
ATGCAGACACCGAAGGAAACCCTTTCGGAAC
GTTTAAGTGCGTTGCAGGACAAAATCATAGA
CCACTATGAAAATGACAGTAAAGACATAGAC
AGCCAAATACAGTATTGGCAACTAATACGTT
GGGAAAATGCAATATTCTTTGCAGCAAGGGA
ACATGGCATACAGACATTAAACCACCAGGTG
GTGCCAGCCTATAACATTTCAAAAAGTAAAG
CACATAAAGCTATTGAACTGCAAATGGCCCT
ACAAGGCCTTGCACAAAGTGCATACAAAACC
GAGGATTGGACACTGCAAGACACATGCGAG
GAACTATGGAATACAGAACCTACTCACTGCTT
TAAAAAAGGTGGCCAAACAGTACAAGTATAT
TTTGATGGCAACAAAGACAATTGTATGACCT
ATGTAGCATGGGACAGTGTGTATTATATGAC
TGATGCAGGAACATGGGACAAAACGGCTACC
TGTGTAAGTCACAGGGGATTGTATTATGTAA
AGGAAGGGTACAACACGTTTTATATAGAATT
TAAAAGTGAATGTGAAAAATATGGGAACACA
GGTACGTGGGAAGTACATTTTGGGAATAATG
TAATTGATTGTAATGACTCTATGTGCAGTACC
AGTGACGACACGGTATCCGCTACTCAGCTTG
TTAAACAGCTACAGCACACCCCCTCACCGTAT
TCCAGCACCGTGTCCGTGGGCACCGCAAAGA
CCTACGGCCAGACGTCGGCTGCTACACGACC
TGGACACTGTGGACTCGCGGAGAAGCAGCAT
TGTGGACCTGTCAACCCACTTCTCGGTGCAGC
TACACCTACAGGCAACAACAAAAGACGGAAA
CTCTGTAGTGGTAACACTACGCCTATAATACA
TTTAAAAGGTGACAGAAACAGTTTAAAATGT
TTACGGTACAGATTGCGAAAACATAGCGACC
ACTATAGAGATATATCATCCACCTGGCATTGG
ACAGGTGCAGGCAATGAAAAAACAGGAATA
CTGACTGTAACATACCATAGTGAAACACAAA
GAACAAAATTTTTAAATACTGTTGCAATTCCA
GATAGTGTACAAATATTGGTGGGATACATGA
CAATGTAA
ATGACTCTATGTGCAGTACCAGTGACGACAC
GGTATCCGCTACTCAGCTTGTTAAACAGCTAC
AGCACACCCCCTCACCGTATTCCAGCACCGTG
TCCGTGGGCACCGCAAAGACCTACGGCCAGA
CGTCGGCTGCTACACGACCTGGACACTGTGG
ACTCGCGGAGAAGCAGCATTGTGGACCTGTC
AACCCACTTCTCGGTGCAGCTACACCTACAGG
CAACAACAAAAGACGGAAACTCTGTAGTGGT
AACACTACGCCTATAA
ATGTTATCACTTATTTTTTTATTTTGCTTTTGTG
TATGCATGTATGTGTGCTGCCATGTCCCGCTT
TTGCCATCTGTCTGTATGTGTGCGTATGCATG
GGTATTGGTATTTGTGTATATTGTGGTAATAA
CGTCCCCTGCCACAGCATTCACAGTATATGTA
TTTTGTTTTTTATTGCCCATGTTACTATTGCAT
ATACATGCTATATTGTCTTTACAGTAA
ATGGCGCGCTTTGAGGATCCAACACGGCGAC
CCTACAAGCTACCTGATCTGTGCACGGAACT
GAACACTTCACTGCAAGACATAGAAATAACC
TGTGTATATTGCAAGACAGTATTGGAACTTAC
AGAGGTATTTGAATTTGCATTTAAAGATTTAT
TTGTGGTGTATAGAGACAGTATACCGCATGC
TGCATGCCATAAATGTATAGATTTTTATTCTA
GAATTAGAGAATTAAGACATTATTCAGACTCT
GTGTATGGAGACACATTGGAAAAACTAACTA
ACACTGGGTTATACAATTTATTAATAAGGTGC
CTGCGGTGCCAGAAACCGTTGAATCCAGCAG
AAAAACTTAGACACCTTAATGAAAAACGACG
ATTTCACAACATAGCTGGGCACTATAGAGGC
CAGTGCCATTCGTGCTGCAACCGAGCACGAC
AGGAACGACTCCAACGACGCAGAGAAACAC
AAGTATAA
ATGCAGACCCCTAAGGAAACACTGAGCGAAC
GACTGTCATGCGTCCAGGATAAAATCATCGA
CCATTACGAAAACGACTCCAAAGATATCGAC
AGCCAGATTCAGTACTGGCAGCTGATCCGGT
GGGAGAACGCAATTTTCTTTGCCGCTAGAGA
ACACGGAATCCAGACCCTGAACCATCAGGTG
GTCCCCGCCTACAATATCTCAAAGAGCAAAG
CCCACAAGGCTATTGAGCTGCAAATGGCACT
CCAGGGCCTGGCCCAGTCCCGATATAAAACA
GAGGACTGGACTCTCCAGGATACCTGCGAGG
AACTGTGGAATACAGAACCTACTCATTGTTTC
AAGAAAGGCGGGCAGACCGTGCAGGTCTAC
TTTGACGGGAACAAGGATAATTGCATGACCT
ACGTGGCCTGGGATTCCGTCTACTATATGAC
AGACGCTGGAACTTGGGATAAGACTGCAACC
TGTGTGTCTCACAGGGGCCTGTACTATGTGA
AAGAGGGGTACAACACCTTCTATATCGAGTT
CAAGTCTGAGTGCGAAAAATACGGGAATACA
GGAACTTGGGAGGTGCACTTCGGGAACAAT
GTCATTGACTGCAACGATAGCATGTGTTCCAC
CTCTGACGATACAGTGTCCGCCACTCAGCTG
GTCAAGCAGCTCCAGCATACACCCAGTCCTTA
CAGCTCCACTGTGTCAGTCGGAACCGCCAAA
ACCTACGGCCAGACCAGTGCAGCCACACGGC
CAGGCCACTGCGGACTGGCTGAAAAGCAGC
ATTGTGGCCCAGTGAATCCCCTGCTGGGGGC
TGCAACCCCTACAGGAAACAATAAGCGGAGA
AAACTGTGCAGCGGAAACACCACACCAATCA
TTCACCTGAAGGGCGACCGGAACAGCCTGAA
GTGTCTGCGGTACAGACTGCGAAAGCACAGT
GACCATTATCGCGATATCTCTAGTACTTGGCA
CTGGACCGGAGCTGGCAACGAGAAGACCGG
CATTCTGACTGTGACCTACCATTCAGAAACTC
AGCGGACCAAATTTCTGAATACTGTGGCCAT
CCCCGATAGCGTGCAGATTCTGGTCGGGTAT
ATGACAATG
ATGACCCTGTGTGCTGTCCCTGTGACTACAAG
ATACCCCCTGCTGTCCCTGCTGAACTCCTACT
CCACCCCTCCCCATAGAATCCCCGCACCATGC
CCTTGGGCTCCACAGAGACCAACTGCACGGA
GAAGGCTGCTGCACGACCTGGATACCGTGGA
CAGCCGGCGGAGCAGCATCGTGGATCTGTCT
ACACACTTCAGTGTCCAGCTGCACCTCCAGGC
CACCACAAAGGACGGCAACTCTGTGGTCGTG
ACCCTGCGGCTG
ATGCTGAGTCTGATTTTCCTGTTTTGTTTCTGC
GTGTGTATGTATGTCTGCTGTCATGTCCCACT
GCTGCCTTCTGTCTGTATGTGTGCCTACGCTT
GGGTGCTGGTCTTCGTGTATATCGTGGTCATT
ACCAGCCCCGCAACCGCCTTTACAGTCTACGT
GTTCTGCTTTCTGCTGCCTATGCTGCTGCTGC
ACATCCATGCTATTCTGAGCCTCCAG
ATGGCTAGATTTGAGGACCCAACAAGACGCC
CCTATAAACTGCCCGACCTGTGTACCGAACTG
AATACTTCTCTGCAGGACATTGAGATCACATG
CGTGTACTGTAAAACTGTCCTGGAGCTGACC
GAAGTGTTCGAGTTTGCTTTCAAGGACCTGTT
TGTGGTCTACAGAGATTCTATTCCCCACGCCG
CTTGCCATAAATGTATCGACTTCTATAGTCGG
ATTAGAGAACTGAGGCACTACAGCGACTCCG
TCTATGGAGATACTCTGGAGAAGCTGACCAA
CACAGGCCTGTACAATCTGCTGATCCGATGCC
TGAGGTGTCAGAAGCCCCTGAACCCTGCCGA
AAAACTGCGGCACCTGAACGAGAAGCGGAG
ATTTCACAATATTGCAGGCCATTATCGCGGGC
AGTGCCATTCCTGCTGTAATAGGGCCCGCCA
GGAACGACTCCAGAGGCGCCGAGAGACCCA
GGTG
MQTPKETLSERLSALQDKIIDHYEND
SKDIDSQIQYWQLIRWENAIFFAARE
HGIQTLNHQVVPAYNISKSKAHKAIE
LQMALQGLAQSAYKTEDWTLQDTC
EELWNTEPTHCFKKGGQTVQVYFD
GNKDNCMTYVAWDSVYYMTDAGT
WDKTATCVSHRGLYYVKEGYNTFYIE
FKSECEKYGNTGTWEVHFGNNVIDC
NDSMCSTSDDTVSATQLVKQLQHT
PSPYSSTVSVGTAKTYGQTSAATRPG
HCGLAEKQHCGPVNPLLGAATPTG
NNKRRKLCSGNTTPIIHLKGDRNSLK
CLRYRLRKHSDHYRDISSTWHWTGA
GNEKTGILTVTYHSETQRTKFLNTVAI
PDSVQILVGYMTM
HPV18 E4
HPV18 E5
HPV18 E6
Page 14 of 48
% Nucleotide
Change
25.2%
MTLCAVPVTTRYPLLSLLNSYSTPPH
RIPAPCPWAPQRPTARRRLLHDLDT
VDSRRSSIVDLSTHFSVQLHLQATTK
DGNSVVVTLRL
27.6%
MLSLIFLFCFCVCMYVCCHVPLLPSV
CMCAYAWVLVFVYIVVITSPATAFTV
YVFCFLLPMLLLHIHAILSLQ
29.2%
MARFEDPTRRPYKLPDLCTELNTSLQ
DIEITCVYCKTVLELTEVFEFAFKDLFV
VYRDSIPHAACHKCIDFYSRIRELRHY
SDSVYGDTLEKLTNTGLYNLLIRCLRC
QKPLNPAEKLRHLNEKRRFHNIAGH
YRGQCHSCCNRARQERLQRRRETQ
V
27.1%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV18 E7
ATGCATGGACCTAAGGCAACATTGCAAGACA
TTGTATTGCATTTAGAGCCCCAAAATGAAATT
CCGGTTGACCTTCTATGTCACGAGCAATTAAG
CGACTCAGAGGAAGAAAACGATGAAATAGA
TGGAGTTAATCATCAACATTTACCAGCCCGAC
GAGCCGAACCACAACGTCACACAATGTTGTG
TATGTGTTGTAAGTGTGAAGCCAGAATTGAG
CTAGTAGTAGAAAGCTCAGCAGACGACCTTC
GAGCATTCCAGCAGCTGTTTCTGAACACCCTG
TCCTTTGTGTGTCCGTGGTGTGCATCCCAGCA
GTAA
ATGGCTTTGTGGCGGCCTAGTGACAATACCG
TATATCTTCCACCTCCTTCTGTGGCAAGAGTT
GTAAATACCGATGATTATGTGACTCGCACAA
GCATATTTTATCATGCTGGCAGCTCTAGATTA
TTAACTGTTGGTAATCCATATTTTAGGGTTCC
TGCAGGTGGTGGCAATAAGCAGGATATTCCT
AAGGTTTCTGCATACCAATATAGAGTATTTAG
GGTGCAGTTACCTGACCCAAATAAATTTGGTT
TACCTGATACTAGTATTTATAATCCTGAAACA
CAACGTTTAGTGTGGGCCTGTGCTGGAGTGG
AAATTGGCCGTGGTCAGCCTTTAGGTGTTGG
CCTTAGTGGGCATCCATTTTATAATAAATTAG
ATGACACTGAAAGTTCCCATGCCGCCACGTCT
AATGTTTCTGAGGACGTTAGGGACAATGTGT
CTGTAGATTATAAGCAGACACAGTTATGTATT
TTGGGCTGTGCCCCTGCTATTGGGGAACACT
GGGCTAAAGGCACTGCTTGTAAATCGCGTCC
TTTATCACAGGGCGATTGCCCCCCTTTAGAAC
TTAAAAACACAGTTTTGGAAGATGGTGATAT
GGTAGATACTGGATATGGTGCCATGGACTTT
AGTACATTGCAAGATACTAAATGTGAGGTAC
CATTGGATATTTGTCAGTCTATTTGTAAATAT
CCTGATTATTTACAAATGTCTGCAGATCCTTA
TGGGGATTCCATGTTTTTTTGCTTACGGCGTG
AGCAGCTTTTTGCTAGGCATTTTTGGAATAGA
GCAGGTACTATGGGTGACACTGTGCCTCAAT
CCTTATATATTAAAGGCACAGGTATGCGTGCT
TCACCTGGCAGCTGTGTGTATTCTCCCTCTCC
AAGTGGCTCTATTGTTACCTCTGACTCCCAGT
TGTTTAATAAACCATATTGGTTACATAAGGCA
CAGGGTCATAACAATGGTGTTTGCTGGCATA
ATCAATTATTTGTTACTGTGGTAGATACCACT
CGCAGTACCAATTTAACAATATGTGCTTCTAC
ACAGTCTCCTGTACCTGGGCAATATGATGCTA
CCAAATTTAAGCAGTATAGCAGACATGTTGA
GGAATATGATTTGCAGTTTATTTTTCAGTTGT
GTACTATTACTTTAACTGCAGATGTTATGTCC
TATATTCATAGTATGAATAGCAGTATTTTAGA
GGATTGGAACTTTGGTGTTCCCCCCCCGCCAA
CTACTAGTTTGGTGGATACATATCGTTTTGTA
CAATCTGTTGCTATTACCTGTCAAAAGGATGC
TGCACCGGCTGAAAATAAGGATCCCTATGAT
AAGTTAAAGTTTTGGAATGTGGATTTAAAGG
AAAAGTTTTCTTTAGACTTAGATCAATATCCC
CTTGGACGTAAATTTTTGGTTCAGGCTGGATT
GCGTCGCAAGCCCACCATAGGCCCTCGCAAA
CGTTCTGCTCCATCTGCCACTACGTCTTCTAA
ACCTGCCAAGCGTGTGCGTGTACGTGCCAGG
AAGTAA
ATGCATGGACCCAAGGCTACTCTGCAGGATA
TTGTGCTGCACCTGGAACCTCAGAATGAAAT
CCCCGTGGACCTGCTGTGTCACGAACAGCTG
TCTGACAGTGAGGAAGAGAACGACGAGATC
GATGGCGTGAATCACCAGCATCTGCCCGCCC
GGAGAGCTGAACCTCAGAGGCACACCATGCT
GTGCATGTGCTGTAAGTGTGAAGCTCGCATT
GAGCTGGTGGTCGAAAGCTCCGCAGACGATC
TGCGAGCCTTCCAGCAGCTGTTTCTGAACACA
CTGAGCTTCGTCTGCCCCTGGTGTGCCTCTCA
GCAG
ATGTGCCTGTATACCCGCGTGCTGATTCTGCA
TTACCATCTGCTGCCCCTGTACGGACCCCTGT
ACCATCCAAGACCTCTGCCTCTGCACTCCATT
CTGGTGTACATGGTCCACATCATTATCTGCGG
GCATTATATTATCCTGTTCCTGCGCAACGTGA
ATGTCTTCCCTATCTTCCTCCAGATGGCTCTGT
GGCGACCAAGTGACAACACCGTGTACCTGCC
CCCTCCATCAGTCGCACGGGTGGTCAATACA
GACGATTACGTGACCCCTACAAGCATTTTCTA
TCATGCAGGCAGCTCCCGACTGCTGACCGTG
GGAAACCCTTATTTTCGGGTCCCAGCAGGCG
GGGGAAATAAGCAGGATATCCCAAAAGTGTC
TGCCTACCAGTATCGGGTGTTCAGAGTCCAG
CTGCCCGACCCTAACAAGTTTGGCCTGCCAG
ATACTAGCATCTACAATCCCGAGACCCAGAG
ACTGGTGTGGGCTTGTGCAGGAGTCGAAATC
GGAAGGGGACAGCCACTGGGAGTGGGACTG
TCTGGACACCCTTTCTACAACAAGCTGGACGA
TACAGAGTCTAGTCATGCCGCTACTTCAAACG
TGAGCGAAGACGTCAGAGATAATGTGAGCG
TGGACTATAAACAGACTCAGCTGTGCATTCTG
GGATGTGCACCAGCTATCGGAGAGCACTGG
GCTAAGGGAACCGCATGCAAATCTAGGCCTC
TGAGTCAGGGCGACTGTCCCCCTCTGGAGCT
GAAGAATACCGTGCTGGAAGACGGGGATAT
GGTCGATACAGGGTACGGAGCTATGGACTTT
TCTACACTCCAGGATACTAAGTGCGAGGTGC
CTCTGGACATTTGCCAGAGTATCTGTAAATAC
CCAGATTACCTCCAGATGTCCGCTGACCCCTA
TGGCGATTCTATGTTCTTTTGTCTGCGGAGAG
AACAGCTGTTCGCCAGGCACTTTTGGAACCG
CGCTGGCACAATGGGCGACACAGTGCCCCAG
AGCCTGTACATTAAGGGCACAGGAATGCCAG
CATCCCCTGGATCTTGCGTGTATAGTCCATCA
CCCAGCGGCTCCATCGTGACCTCTGACAGTC
AGCTGTTCAATAAGCCATACTGGCTGCACAA
AGCTCAGGGGCATAACAATGGAGTGTGCTG
GCATAACCAGCTGTTTGTCACAGTGGTCGAT
ACCACACCCAGCACTAATCTGACCATCTGTGC
CAGTACACAGTCACCTGTGCCAGGACAGTAC
GACGCTACTAAGTTCAAACAGTACTCCAGAC
ACGTGGAGGAATATGACCTCCAGTTCATCTTT
CAGCTGTGCACTATCACCCTGACAGCCGACG
TGATGTCATACATTCATAGCATGAACTCAAGC
ATCCTGGAGGATTGGAATTTCGGCGTGCCAC
CCCCTCCAACTACCAGCCTGGTGGACACTTAT
CGCTTTGTGCAGTCCGTCGCTATTACCTGTCA
GAAGGATGCAGCCCCCGCAGAGAACAAAGA
CCCTTACGATAAGCTGAAATTCTGGAATGTG
GACCTGAAGGAAAAATTTTCCCTGGACCTGG
ATCAGTATCCCCTGGGACGGAAGTTTCTGGT
GCAGGCAGGACTGAGGCGAAAGCCAACCAT
CGGACCACGAAAACGGAGCGCACCTTCCGCC
ACAACTTCCTCTAAGCCAGCAAAAAGAGTGA
GGGTCCGCGCCCGAAAA
MHGPKATLQDIVLHLEPQNEIPVDLL
CHEQLSDSEEENDEIDGVNHQHLPA
RRAEPQRHTMLCMCCKCEARIELVV
ESSADDLRAFQQLFLNTLSFVCPWC
ASQQ
HPV18 L1
Page 15 of 48
MALWRPSDNTVYLPPPSVARVVNT
DDYVTRTSIFYHAGSSRLLTVGNPYF
RVPAGGGNKQDIPKVSAYQYRVFRV
QLPDPNKFGLPDTSIYNPETQRLVW
ACAGVEIGRGQPLGVGLSGHPFYNK
LDDTESSHAATSNVSEDVRDNVSVD
YKQTQLCILGCAPAIGEHWAKGTAC
KSRPLSQGDCPPLELKNTVLEDGDM
VDTGYGAMDFSTLQDTKCEVPLDIC
QSICKYPDYLQMSADPYGDSMFFCL
RREQLFARHFWNRAGTMGDTVPQ
SLYIKGTGMRASPGSCVYSPSPSGSIV
TSDSQLFNKPYWLHKAQGHNNGVC
WHNQLFVTVVDTTRSTNLTICASTQ
SPVPGQYDATKFKQYSRHVEEYDLQ
FIFQLCTITLTADVMSYIHSMNSSILE
DWNFGVPPPPTTSLVDTYRFVQSVA
ITCQKDAAPAENKDPYDKLKFWNVD
LKEKFSLDLDQYPLGRKFLVQAGLRR
KPTIGPRKRSAPSATTSSKPAKRVRV
RARK
% Nucleotide
Change
23.8%
34.1%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV18 L2
ATGGTATCCCACCGTGCCGCACGACGCAAAC
GGGCTTCGGTAACTGACTTATATAAAACATGT
AAACAATCTGGTACATGTCCACCTGATGTTGT
TCCTAAGGTGGAGGGCACCACGTTAGCAGAT
AAAATATTGCAATGGTCAAGCCTTGGTATATT
TTTGGGTGGACTTGGCATAGGTACTGGCAGT
GGTACAGGGGGTCGTACAGGGTACATTCCAT
TGGGTGGGCGTTCCAATACAGTGGTGGATGT
TGGTCCTACACGTCCCCCAGTGGTTATTGAAC
CTGTGGGCCCCACAGACCCATCTATTGTTACA
TTAATAGAGGACTCCAGTGTGGTTACATCAG
GTGCACCTAGGCCTACGTTTACTGGCACGTCT
GGGTTTGATATAACATCTGCGGGTACAACTA
CACCTGCGGTTTTGGATATCACACCTTCGTCT
ACCTCTGTGTCTATTTCCACAACCAATTTTACC
AATCCTGCATTTTCTGATCCGTCCATTATTGA
AGTTCCACAAACTGGGGAGGTGGCAGGTAAT
GTATTTGTTGGTACCCCTACATCTGGAACACA
TGGGTATGAGGAAATACCTTTACAAACATTT
GCTTCTTCTGGTACGGGGGAGGAACCCATTA
GTAGTACCCCATTGCCTACTGTGCGGCGTGT
AGCAGGTCCCCGCCTTTACAGTAGGGCCTAC
CAACAAGTGTCAGTGGCTAACCCTGAGTTTCT
TACACGTCCATCCTCTTTAATTACATATGACA
ACCCGGCCTTTGAGCCTGTGGACACTACATTA
ACATTTGATCCTCGTAGTGATGTTCCTGATTC
AGATTTTATGGATATTATCCGTCTACATAGGC
CTGCTTTAACATCCAGGCGTGGGACTGTTCG
CTTTAGTAGATTAGGTCAACGGGCAACTATG
TTTACCCGCAGCGGTACACAAATAGGTGCTA
GGGTTCACTTTTATCATGATATAAGTCCTATT
GCACCTTCCCCAGAATATATTGAACTGCAGCC
TTTAGTATCTGCCACGGAGGACAATGACTTG
TTTGATATATATGCAGATGACATGGACCCTGC
AGTGCCTGTACCATCGCGTTCTACTACCTCCT
TTGCATTTTTTAAATATTCGCCCACTATATCTT
CTGCCTCTTCCTATAGTAATGTAACGGTCCCT
TTAACCTCCTCTTGGGATGTGCCTGTATACAC
GGGTCCTGATATTACATTACCATCTACTACCT
CTGTATGGCCCATTGTATCACCCACGGCCCCT
GCCTCTACACAGTATATTGGTATACATGGTAC
ACATTATTATTTGTGGCCATTATATTATTTTAT
TCCTAAGAAACGTAAACGTGTTCCCTATTTTT
TTGCAGATGGCTTTGTGGCGGCCTAG
ATGGTGTCTCATCGCGCAGCACGACGGAAAA
GAGCCAGTGTGACCGACCTGTATAAAACCTG
TAAGCAGAGCGGAACTTGCCCTCCTGACGTG
GTCCCCAAGGTGGAGGGAACCACACTGGCTG
ATAAGATCCTCCAGTGGAGCAGCCTGGGAAT
CTTCCTGGGAGGACTGGGAATTGGGACTGG
AAGCGGCACCGGAGGCCGAACAGGCTACAT
CCCTCTGGGGGGACGCAGCAACACCGTGGT
GGACGTGGGACCAACAAGGCCCCCTGTGGTC
ATTGAGCCTGTGGGGCCAACTGACCCCTCCA
TCGTCACCCTGATTGAAGATTCTAGTGTGGTC
ACATCTGGGGCCCCACGACCAACATTCACTG
GCACCTCCGGGTTTGACATCACCTCTGCTGGA
ACTACCACACCCGCCGTGCTGGACATCACTCC
ATCAAGCACCAGTGTGTCAATTAGCACTACCA
ACTTCACAAATCCAGCCTTTAGTGATCCCTCA
ATCATTGAGGTGCCCCAGACTGGCGAAGTCG
CTGGGAATGTGTTCGTCGGCACACCCACTAG
CGGAACCCACGGCTACGAGGAAATCCCTCTC
CAGACATTTGCATCCTCTGGGACTGGAGAGG
AACCAATTAGTTCAACACCTCTGCCAACTGTG
CGGAGAGTCGCAGGACCACGACTGTACAGC
AGAGCATATCAGCAGGTGTCCGTCGCCAACC
CCGAGTTCCTGACTCGGCCTAGCTCCCTGATC
ACCTATGACAATCCCGCTTTCGAACCTGTGGA
TACAACTCTGACCTTTGACCCTCGGAGCGATG
TGCCAGACAGTGATTTTATGGACATCATTAGA
CTGCATAGGCCAGCACTGACTAGCAGGCGCG
GGACCGTGCGCTTCAGCCGACTGGGACAGA
GGGCCACCATGTTTACACGCTCCGGAACACA
GATTGGCGCTAGGGTGCACTTCTACCATGAT
ATCTCACCAATTGCACCCAGCCCTGAGTATAT
CGAGCTGCAACCCCTGGTGTCTGCCACCGAG
GACAACGATCTGTTCGACATCTACGCAGACG
ATATGGACCCCGCCGTGCCCGTGCCCAGCCG
GAGCACCACATCTTTTGCTTTCTTTAAGTACA
GTCCCACCATCTCTAGTGCATCAAGCTATAGC
AATGTGACCGTCCCTCTGACATCCTCTTGGGA
CGTGCCCGTCTATACAGGCCCTGATATCACTC
TGCCAAGCACTACCTCCGTGTGGCCTATTGTC
AGTCCTACCGCACCAGCCTCAACACAGTACAT
CGGGATTCACGGAACCCATTACTATCTGTGG
CCACTGTACTATTTCATCCCCAAGAAACGAAA
ACGGGTGCCATATTTCTTTGCCGACGGCTTTG
TGGCCGCT
MVSHRAARRKRASVTDLYKTCKQSG
TCPPDVVPKVEGTTLADKILQWSSLG
IFLGGLGIGTGSGTGGRTGYIPLGGR
SNTVVDVGPTRPPVVIEPVGPTDPSI
VTLIEDSSVVTSGAPRPTFTGTSGFDI
TSAGTTTPAVLDITPSSTSVSISTTNFT
NPAFSDPSIIEVPQTGEVAGNVFVGT
PTSGTHGYEEIPLQTFASSGTGEEPIS
STPLPTVRRVAGPRLYSRAYQQVSVA
NPEFLTRPSSLITYDNPAFEPVDTTLT
FDPRSDVPDSDFMDIIRLHRPALTSR
RGTVRFSRLGQRATMFTRSGTQIGA
RVHFYHDISPIAPSPEYIELQPLVSATE
DNDLFDIYADDMDPAVPVPSRSTTS
FAFFKYSPTISSASSYSNVTVPLTSSW
DVPVYTGPDITLPSTTSVWPIVSPTA
PASTQYIGIHGTHYYLWPLYYFIPKKR
KRVPYFFADGFVAA
Page 16 of 48
% Nucleotide
Change
27.2%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV31 E1
ATGGCTGATCCAGCAGGTACAGATGGGGAG
GGGACGGGATGCAATGGTTGGTTTTATGTAG
AAGCAGTAATTGACAGACAGACAGGGGACA
ACATTTCAGAGGACGAAAATGAAGACAGTAG
TGATACTGGGGAGGATATGGTTGACTTTATT
GACAATTGTAATGTATACAACAATCAGGCAG
AAGCAGAGACAGCACAGGCATTGTTTCATGC
ACAGGAAGCGGAGGAACATGCAGAGGCTGT
GCAGGTTCTAAAACGAAAGTATGTAGGTAGT
CCTTTAAGTGATATTAGTAGTTGTGTGGATTA
TAATATTAGTCCACGGTTAAAAGCTATATGCA
TAGAAAATAACAGTAAAACAGCAAAACGAAG
ACTCTTTGAACTTCCAGACAGCGGGTATGGC
AATACTGAAGTGGAAACGCAGCAGATGGTAC
AGGTAGAGGAGCAACAAACAACATTAAGTTG
TAATGGTAGTGACGGGACACATAGTGAACGA
GAGAATGAAACTCCAACACGTAATATATTGC
AAGTGTTAAAAACTAGCAATGGTAAAGCTGC
TATGTTAGGTAAATTTAAAGAATTATATGGTG
TAAGTTTTATGGAACTAATTAGGCCATTTCAA
AGCAATAAAAGCACATGTACTGATTGGTGTG
TAGCTGCGTTTGGAGTTACAGGTACAGTTGC
AGAAGGATTTAAAACCCTATTGCAACCATATT
GTTTGTATTGCCATTTACAAAGTTTAGCATGT
TCCTGGGGCATGGTTATGTTAATGCTTGTGA
GATTTAAATGTGCAAAAAATAGAATAACAAT
TGAAAAATTATTAGAAAAATTATTGTGTATAT
CTACAAATTGTATGTTAATTCAGCCACCCAAA
TTACGTAGCACAGCTGCAGCATTATATTGGTA
CAGAACAGGAATGTCAAACATTAGCGATGTA
TATGGTGAAACACCAGAATGGATAGAAAGAC
AAACAGTATTACAGCATAGTTTTAATGACACA
ACATTTGATTTGTCCCAAATGGTACAATGGGC
ATATGACAATGATGTTATGGATGATAGTGAA
ATTGCCTATAAATATGCACAATTAGCTGACAG
TGATAGTAATGCATGTGCATTTTTAAAAAGTA
ATTCGCAGGCAAAAATAGTTAAAGATTGTGG
AACAATGTGTAGACATTATAAACGAGCAGAA
AAACGACAAATGTCCATGGGACAGTGGATTA
AAAGTAGATGTGACAAAGTTAGTGACGAAG
GTGACTGGAGGGACATAGTAAAGTTTTTAAG
ATATCAACAAATAGAATTTGTGTCATTTTTAT
CTGCATTAAAGCTGTTTTTAAAAGGAGTGCCA
AAGAAAAACTGTATTTTAATACATGGTGCACC
TAATACAGGTAAATCATATTTTGGAATGAGCC
TTATTAGCTTTTTACAAGGATGTATAATATCA
TATGCAAATTCAAAAAGTCATTTTTGGTTACA
ACCACTGGCTGATGCTAAAATAGGCATGTTA
GATGATGCTACAACGCCATGTTGGCATTATAT
AGACAATTACCTACGAAATGCACTAGATGGC
AACCCTGTATCTATAGATGTAAAGCATAAAG
CTTTAATGCAGTTAAAATGTCCTCCTTTATTG
ATTACATCTAATATAAATGCAGGTAAGGATG
ACAGATGGCCATACCTACATAGCAGACTGGT
GGTTTTTACATTTCCAAATCCATTTCCATTTGA
CAAAAACGGAAATCCAGTATATGAATTAAGT
GATAAAAACTGGAAATCCTTTTTCTCAAGGAC
GTGGTGCAGATTAAATTTGCACGAGGAAGAG
GACAAAGAAAACGATGGAGACTCTTTCTCAA
CGTTTAAATGTGTGTCAGGACAAAATATTAG
AACATTATGA
ATGGCAGACCCAGCAGGAACAGACGGAGAG
GGGACAGGATGTAATGGATGGTTTTATGTGG
AGGCAGTGATTGACAGGCAGACCGGCGACA
ACATCAGCGAAGATGAGAATGAAGACAGCTC
CGATACCGGCGAGGACATGGTGGATTTCATT
GACAACTGCAATGTCTACAACAATCAGGCTG
AGGCAGAAACAGCCCAGGCTCTGTTTCACGC
ACAGGAGGCCGAGGAACATGCAGAAGCCGT
GCAGGTCCTGAAGAGAAAATACGTGGGGTCT
CCCCTGAGTGACATCTCTAGTTGCGTCGATTA
TAACATTTCTCCTAGGCTGAAGGCCATCTGTA
TTGAGAACAATAGTAAGACCGCTAAACGGAG
ACTGTTCGAACTGCCAGACTCTGGCTACGGG
AACACCGAGGTGGAAACACAGCAGATGGTG
CAGGTCGAGGAACAGCAGACCACACTGTCAT
GCAATGGAAGCGATGGCACACACAGCGAGA
GGGAAAACGAGACCCCCACACGCAATATCCT
GCAGGTGCTGAAGACTAGCAACGGCAAAGC
CGCTATGCTGGGGAAGTTTAAAGAGCTGTAT
GGCGTGTCCTTCATGGAACTGATTCGCCCCTT
TCAGTCAAATAAGAGCACTTGCACCGACTGG
TGTGTGGCAGCCTTCGGGGTGACAGGAACTG
TCGCCGAGGGCTTCAAGACACTGCTGCAGCC
TTACTGCCTGTATTGTCACCTGCAGTCCCTGG
CCTGCTCTTGGGGCATGGTCATGCTGATGCT
GGTCCGATTCAAGTGTGCTAAAAACCGGATC
ACTATTGAGAAGCTGCTGGAAAAACTGCTGT
GCATCTCAACCAATTGTATGCTGATTCAGCCC
CCTAAGCTGCGCAGCACAGCTGCAGCCCTGT
ACTGGTATCGGACTGGAATGTCCAATATCTCT
GATGTGTACGGCGAAACTCCTGAGTGGATTG
AACGCCAGACCGTCCTGCAGCACAGCTTCAA
CGACACTACCTTTGATCTGTCCCAGATGGTGC
AGTGGGCATATGACAATGATGTCATGGACGA
TTCTGAGATCGCCTACAAGTATGCTCAGCTG
GCAGACTCAGATAGCAACGCTTGCGCATTTCT
GAAATCCAATTCTCAGGCAAAGATCGTGAAA
GACTGCGGCACAATGTGTAGGCATTACAAGC
GCGCCGAGAAACGACAGATGAGCATGGGCC
AGTGGATTAAGAGTCGGTGTGATAAAGTGTC
AGATGAGGGGGACTGGAGAGATATCGTCAA
GTTCCTGAGGTATCAGCAGATTGAATTCGTG
AGTTTTCTGTCAGCTCTGAAGCTGTTTCTGAA
AGGCGTGCCTAAGAAAAACTGCATCCTGATT
CACGGCGCCCCAAATACTGGGAAGAGTTACT
TCGGAATGAGCCTGATCTCCTTTCTGCAGGG
GTGTATCATTAGCTATGCCAACAGTAAGTCAC
ATTTCTGGCTGCAGCCACTGGCCGACGCTAA
AATCGGAATGCTGGACGATGCCACAACTCCC
TGCTGGCACTACATTGATAACTATCTGAGAAA
TGCTCTGGACGGCAATCCCGTGTCCATCGAT
GTCAAGCATAAAGCACTGATGCAGCTGAAGT
GTCCACCCCTGCTGATCACTTCCAACATTAAT
GCCGGGAAAGACGATCGCTGGCCTTACCTGC
ACTCTCGACTGGTGGTCTTCACCTTTCCTAAC
CCATTCCCCTTTGACAAGAACGGAAATCCAGT
GTATGAGCTGAGCGATAAGAATTGGAAATCT
TTCTTTAGTCGGACCTGGTGCAGACTGAACCT
GCATGAGGAAGAGGACAAGGAGAATGACGG
GGATAGCTTCTCCACCTTTAAATGTGTGTCCG
GACAGAACATCAGGACACTG
MADPAGTDGEGTGCNGWFYVEAVI
DRQTGDNISEDENEDSSDTGEDMV
DFIDNCNVYNNQAEAETAQALFHA
QEAEEHAEAVQVLKRKYVGSPLSDIS
SCVDYNISPRLKAICIENNSKTAKRRL
FELPDSGYGNTEVETQQMVQVEEQ
QTTLSCNGSDGTHSERENETPTRNIL
QVLKTSNGKAAMLGKFKELYGVSFM
ELIRPFQSNKSTCTDWCVAAFGVTG
TVAEGFKTLLQPYCLYCHLQSLACSW
GMVMLMLVRFKCAKNRITIEKLLEKL
LCISTNCMLIQPPKLRSTAAALYWYR
TGMSNISDVYGETPEWIERQTVLQH
SFNDTTFDLSQMVQWAYDNDVMD
DSEIAYKYAQLADSDSNACAFLKSNS
QAKIVKDCGTMCRHYKRAEKRQMS
MGQWIKSRCDKVSDEGDWRDIVKF
LRYQQIEFVSFLSALKLFLKGVPKKNCI
LIHGAPNTGKSYFGMSLISFLQGCIIS
YANSKSHFWLQPLADAKIGMLDDA
TTPCWHYIDNYLRNALDGNPVSIDV
KHKALMQLKCPPLLITSNINAGKDDR
WPYLHSRLVVFTFPNPFPFDKNGNP
VYELSDKNWKSFFSRTWCRLNLHEE
EDKENDGDSFSTFKCVSGQNIRTL
Page 17 of 48
% Nucleotide
Change
26.2%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV31 E2
ATGGAGACTCTTTCTCAACGTTTAAATGTGTG
TCAGGACAAAATATTAGAACATTATGAAAAT
GATAGTAAACGACTTTGTGATCATATAGACTA
TTGGAAACATATTCGACTTGAATGTGTATTAA
TGTATAAAGCAAGAGAAATGGGAATACACA
GTATTAACCACCAGGTGGTGCCAGCGTTGTC
AGTATCAAAGGCCAAAGCCTTACAAGCTATT
GAACTACAAATGATGTTGGAAACATTAAATA
ACACTGAATACAAAAATGAGGACTGGACAAT
GCAGCAAACAAGTCTTGAACTGTATTTAACTG
CACCTACAGGGTGTTTAAAAAAACATGGATA
TACTGTAGAGGTGCAATTTGATGGTGATGTA
CACAACACCATGCATTATACTAACTGGAAATT
TATATACCTATGTATAGATGGCCAATGTACTG
TTGTGGAAGGGCAAGTTAATTGTAAGGGCAT
TTATTATGTACATGAAGGACATATAACATATT
TTGTAAATTTTACAGAAGAGGCAAAAAAATA
TGGGACTGGTAAAAAATGGGAAGTGCATGC
GGGTGGTCAGGTAATTGTTTTTCCTGAATCTG
TATTTAGCAGTGACGAAATATCCTTTGCTGGG
ATTGTTACAAAGCTACCAACAGCCAACAACA
CCACCACATCGAATTCCAAAACCTGCGCCTTG
GGCACCAGTGAAGGTGTGCGGCGGGCGACG
ACGTCTACTAAGCGACCAAGAACAGAGCCAG
AGCACAGAAACACCCACCACCCCAACAAGTT
GTTGCGAGGCGACTCCGTGGACAGTGTCAAC
TGTGGGGTTATCAGTGCAGCTGCATGCACAA
ACCAAACAAGGGCTGTCAGTTGTCCTGCAAC
TACACCTATAATACACTTAAAAGGTGATGCAA
ATATATTAAAATGTTTAAGATATAGGCTGTCA
AAATATAAACAATTGTATGAACAAGTGTCATC
TACATGGCATTGGACATGTACAGATGGAAAA
CATAAAAATGCTATTGTAACCTTAACATATAT
AAGTACATCACAAAGAGACGATTTTTTAAATA
CTGTAAAAATACCTAACACAGTATCAGTGTCA
ACAGGATATATGACTATTTAG
TTGTTTTTCCTGAATCTGTATTTAGCAGTGAC
GAAATATCCTTTGCTGGGATTGTTACAAAGCT
ACCAACAGCCAACAACACCACCACATCGAAT
TCCAAAACCTGCGCCTTGGGCACCAGTGAAG
GTGTGCGGCGGGCGACGACGTCTACTAAGC
GACCAAGAACAGAGCCAGAGCACAGAAACA
CCCACCACCCCAACAAGTTGTTGCGAGGCGA
CTCCGTGGACAGTGTCAACTGTGGGGTTATC
AGTGCAGCTGCATGCACAAACCAAACAAGGG
CTGTCAGTTGTCCTGCAACTACACCTATAA
ATGATTGAACTAAATATTTCTACAGTAAGCAT
TGTGCTATGCTTTTTGCTTTGCTTTTGTGTGCT
ACTATTTGTGTGTCTTGTCATACGTCCACTTGT
GCTGTCTGTGTCGGTATATGCAACACTACTAT
TATTAATTGTGATTTTATGGGTTATTGCAACC
TCTCCATTACGTTGTTTTTGTATATATGTTGTG
TTTATATATATTCCATTATTTGTAATTCATACA
CATGCATCTTTTTTAAGTCAACAGTAA
ATGTTCAAAAATCCTGCAGAAAGACCTCGGA
AATTGCATGAACTAAGCTCGGCATTGGAAAT
ACCCTACGATGAACTAAGATTGAATTGTGTCT
ACTGCAAAGGTCAGTTAACAGAAACAGAGGT
ATTAGATTTTGCATTTACAGATTTAACAATAG
TATATAGGGACGACACACCACACGGAGTGTG
TACAAAATGTTTAAGATTTTATTCAAAAGTAA
GTGAATTTAGATGGTATAGATATAGTGTGTA
TGGAACAACATTAGAAAAATTGACAAACAAA
GGTATATGTGATTTGTTAATTAGGTGTATAAC
GTGTCAAAGACCGTTGTGTCCAGAAGAAAAA
CAAAGACATTTGGATAAAAAGAAACGATTCC
ACAACATAGGAGGAAGGTGGACAGGACGTT
GCATAGCATGTTGGAGAAGACCTCGTACTGA
AACCCAAGTGTAA
ATGGAAACTCTGTCCCAGCGACTGAACGTCT
GCCAGGATAAGATTCTGGAACACTACGAAAA
TGATTCTAAAAGACTGTGCGACCACATCGACT
ACTGGAAGCACATTAGACTGGAGTGCGTGCT
GATGTATAAAGCCAGGGAAATGGGCATCCAC
AGCATTAACCATCAGGTGGTCCCCGCACTGTC
AGTGAGCAAGGCCAAAGCTCTGCAGGCCATC
GAGCTGCAGATGATGCTGGAAACCCTGAACA
ATACAGAGTACAAGAATGAAGACTGGACTAT
GCAGCAGACCTCCCTGGAGCTGTACCTGACT
GCCCCTACCGGCTGCCTGAAGAAACACGGGT
ATACAGTGGAAGTCCAGTTCGACGGCGATGT
GCACAACACAATGCATTACACTAACTGGAAG
TTTATCTATCTGTGCATTGATGGGCAGTGTAC
CGTGGTCGAGGGACAGGTGAACTGTAAAGG
CATCTACTATGTCCACGAAGGACATATCACTT
ACTTCGTGAACTTCACCGAGGAAGCTAAGAA
ATATGGAACCGGCAAGAAATGGGAGGTCCA
TGCAGGCGGGCAGGTCATCGTCTTCCCTGAG
TCAGTGTTTAGCTCCGATGAAATCAGCTTCGC
TGGCATTGTCACCAAGCTGCCAACAGCAAAC
AATACCACAACTTCCAACTCTAAAACATGCGC
ACTGGGAACTTCCGAGGGAGTGCGGAGAGC
TACCACATCTACCAAGAGGCCCCGCACAGAG
CCTGAACACCGCAACACCCACCATCCAAACA
AGCTGCTGCGAGGGGACTCTGTGGATAGTGT
CAACTGCGGAGTGATCAGTGCCGCTGCATGT
ACAAATCAGACTAGGGCAGTCAGCTGCCCAG
CCACTACCCCCATCATTCATCTGAAGGGCGAC
GCTAACATTCTGAAATGTCTGCGATACCGGCT
GTCTAAGTACAAACAGCTGTATGAGCAGGTG
TCTAGTACATGGCACTGGACATGTACTGATG
GGAAGCATAAAAATGCCATCGTGACCCTGAC
ATACATTAGTACCTCACAGCGGGACGATTTTC
TGAACACAGTGAAGATCCCCAATACTGTGAG
CGTCTCCACTGGCTATATGACCATT
ATGTTTTTCCTGAACCTGTATCTGGCCGTGAC
AAAGTATCCTCTGCTGGGCCTGCTGCAGTCTT
ATCAGCAGCCTACCACCCCCCCTCACCGAATC
CCAAAGCCTGCACCATGGGCTCCAGTGAAAG
TCTGCGGAGGACGAAGAAGGCTGCTGTCAG
ACCAGGAGCAGAGCCAGTCCACTGAAACCCC
CACCACACCTACAAGCTGCTGTGAGGCAACA
CCCTGGACTGTGTCTACCGTCGGACTGAGTG
TGCAGCTGCACGCCCAGACCAAGCAGGGCCT
GTCTGTGGTCCTGCAGCTGCATCTG
ATGATTGAACTGAATATCTCTACCGTGTCCAT
TGTCCTGTGTTTTCTGCTGTGTTTCTGCGTCCT
GCTGTTTGTCTGCCTGGTCATCCGGCCCCTGG
TGCTGAGCGTGTCCGTCTACGCCACCCTGCTG
CTGCTGATCGTGATTCTGTGGGTCATCGCTAC
ATCCCCCCTGAGATGCTTCTGTATCTACGTGG
TCTTTATCTATATTCCTCTGTTCGTGATCCACA
CCCATGCCTCTTTTCTGAGTCAGCAG
ATGTTTAAGAACCCCGCCGAGAGACCAAGAA
AGCTGCACGAACTGTCATCCGCCCTGGAAAT
CCCCTACGACGAACTGAGGCTGAACTGCGTG
TACTGTAAAGGGCAGCTGACTGAGACCGAAG
TCCTGGACTTCGCCTTTACCGATCTGACAATC
GTGTATAGGGACGATACTCCACACGGAGTCT
GCACCAAATGTCTGCGGTTCTACAGCAAGGT
GTCCGAGTTTAGGTGGTACCGCTATTCTGTCT
ATGGAACCACACTGGAAAAACTGACAAACAA
GGGCATTTGCGACCTGCTGATCAGATGCATT
ACTTGTCAGAGGCCCCTGTGTCCTGAGGAAA
AGCAGCGCCACCTGGATAAGAAAAAGCGATT
CCATAATATCGGAGGACGATGGACCGGACG
ATGCATTGCTTGTTGGCGGAGACCCCGGACA
GAGACTCAGGTG
METLSQRLNVCQDKILEHYENDSKRL
CDHIDYWKHIRLECVLMYKAREMGI
HSINHQVVPALSVSKAKALQAIELQ
MMLETLNNTEYKNEDWTMQQTSL
ELYLTAPTGCLKKHGYTVEVQFDGD
VHNTMHYTNWKFIYLCIDGQCTVVE
GQVNCKGIYYVHEGHITYFVNFTEEA
KKYGTGKKWEVHAGGQVIVFPESVF
SSDEISFAGIVTKLPTANNTTTSNSKT
CALGTSEGVRRATTSTKRPRTEPEHR
NTHHPNKLLRGDSVDSVNCGVISAA
ACTNQTRAVSCPATTPIIHLKGDANIL
KCLRYRLSKYKQLYEQVSSTWHWTC
TDGKHKNAIVTLTYISTSQRDDFLNT
VKIPNTVSVSTGYMTI
HPV31 E4
HPV31 E5
HPV31 E6
Page 18 of 48
% Nucleotide
Change
24.7%
LFFLNLYLAVTKYPLLGLLQSYQQPTT
PPHRIPKPAPWAPVKVCGGRRRLLS
DQEQSQSTETPTTPTSCCEATPWTV
STVGLSVQLHAQTKQGLSVVLQLHL
26.8%
MIELNISTVSIVLCFLLCFCVLLFVCLVI
RPLVLSVSVYATLLLLIVILWVIATSPL
RCFCIYVVFIYIPLFVIHTHASFLSQQ
28.6%
MFKNPAERPRKLHELSSALEIPYDELR
LNCVYCKGQLTETEVLDFAFTDLTIVY
RDDTPHGVCTKCLRFYSKVSEFRWY
RYSVYGTTLEKLTNKGICDLLIRCITCQ
RPLCPEEKQRHLDKKKRFHNIGGRW
TGRCIACWRRPRTETQV
27.5%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV31 E7
ATGCGTGGAGAAACACCTACGTTGCAAGACT
ATGTGTTAGATTTGCAACCTGAGGCAACTGA
CCTCCACTGTTATGAGCAATTACCCGACAGCT
CAGATGAGGAGGATGTCATAGACAGTCCAGC
TGGACAAGCAGAACCGGACACATCCAATTAC
AATATCGTTACCTTTTGTTGTCAGTGTAAGTC
TACACTTCGTTTGTGTGTACAGAGCACACAAG
TAGATATTCGCATATTGCAAGAGCTGTTAATG
GGCTCATTTGGAATCGTGTGCCCCAACTGTTC
TACTAGACTGTAA
ATGTCTCTGTGGCGGCCTAGCGAGGCTACTG
TCTACTTACCACCTGTCCCAGTGTCTAAAGTT
GTAAGCACGGATGAATATGTAACACGAACCA
ACATATATTATCACGCAGGCAGTGCTAGGCT
GCTTACAGTAGGCCATCCATATTATTCCATAC
CTAAATCTGACAATCCTAAAAAAATAGTTGTA
CCAAAGGTGTCAGGATTACAATATAGGGTAT
TTAGGGTTCGTTTACCAGATCCAAACAAATTT
GGATTTCCTGATACATCTTTTTATAATCCTGA
AACTCAACGCTTAGTTTGGGCCTGTGTTGGTT
TAGAGGTAGGTCGCGGGCAGCCATTAGGTG
TAGGTATTAGTGGTCATCCATTATTAAATAAA
TTTGATGACACTGAAAACTCTAATAGATATGC
CGGTGGTCCTGGCACTGATAATAGGGAATGT
ATATCAATGGATTATAAACAAACACAACTGT
GTTTACTTGGTTGCAAACCACCTATTGGAGA
GCATTGGGGTAAAGGTAGTCCTTGTAGTAAC
AATGCTATTACCCCTGGTGATTGTCCTCCATT
AGAATTAAAAAATTCAGTTATACAAGATGGG
GATATGGTTGATACAGGCTTTGGAGCTATGG
ATTTTACTGCTTTACAAGACACTAAAAGTAAT
GTTCCTTTGGACATTTGTAATTCTATTTGTAAA
TATCCAGATTATCTTAAAATGGTTGCTGAGCC
ATATGGCGATACATTATTTTTTTATTTACGTA
GGGAACAAATGTTTGTAAGGCATTTTTTTAAT
AGATCAGGCACGGTTGGTGAATCGGTCCCTA
CTGACTTATATATTAAAGGCTCCGGTTCAACA
GCTACTTTAGCTAACAGTACATACTTTCCTAC
ACCTAGCGGCTCCATGGTTACTTCAGATGCAC
AAATTTTTAATAAACCATATTGGATGCAACGT
GCTCAGGGACACAATAATGGTATTTGTTGGG
GCAATCAGTTATTTGTTACTGTGGTAGATACC
ACACGTAGTACCAATATGTCTGTTTGTGCTGC
AATTGCAAACAGTGATACTACATTTAAAAGTA
GTAATTTTAAAGAGTATTTAAGACATGGTGA
GGAATTTGATTTACAATTTATATTTCAGTTAT
GCAAAATAACATTATCTGCAGACATAATGAC
ATATATTCACAGTATGAATCCTGCTATTTTGG
AAGATTGGAATTTTGGATTGACCACACCTCCC
TCAGGTTCTTTGGAGGATACCTATAGGTTTGT
CACCTCACAGGCCATTACATGTCAAAAAACTG
CCCCCCAAAAGCCCAAGGAAGATCCATTTAA
AGATTATGTATTTTGGGAGGTTAATTTAAAA
GAAAAGTTTTCTGCAGATTTAGATCAGTTTCC
ACTGGGTCGCAAATTTTTATTACAGGCAGGA
TATAGGGCACGTCCTAAATTTAAAGCAGGTA
AACGTAGTGCACCCTCAGCATCTACCACTACA
CCAGCAAAACGTAAAAAAACTAAAAAGTAA
ATGAGAGGAGAAACCCCAACACTGCAGGACT
ATGTGCTGGACCTGCAGCCCGAAGCCACTGA
TCTGCATTGCTACGAACAGCTGCCAGACAGC
TCCGATGAGGAAGACGTGATCGATTCCCCAG
CAGGACAGGCTGAGCCTGACACCAGTAACTA
CAATATTGTCACATTCTGCTGTCAGTGCAAGT
CTACTCTGCGGCTGTGTGTGCAGAGTACCCA
GGTCGATATCAGAATTCTGCAGGAACTGCTG
ATGGGCTCATTTGGGATCGTGTGCCCCAACT
GTAGCACAAGGCTG
ATGTCTCTGTGGCGACCTAGTGAAGCAACTG
TCTACCTGCCTCCTGTCCCTGTGTCCAAAGTG
GTGTCTACCGACGAGTATGTGACTCGGACTA
ATATCTACTATCACGCAGGATCCGCAAGACT
GCTGACCGTGGGGCATCCCTACTATTCTATCC
CTAAGAGTGACAACCCAAAGAAAATTGTGGT
CCCTAAAGTGTCCGGACTGCAGTACAGGGTG
TTCAGGGTCCGCCTGCCAGACCCTAATAAGTT
CGGCTTTCCCGATACATCTTTTTATAACCCTG
AGACTCAGAGGCTGGTGTGGGCATGCGTCG
GACTGGAAGTGGGACGAGGACAGCCACTGG
GAGTGGGAATTTCAGGACACCCTCTGCTGAA
TAAGTTCGACGATACCGAGAACAGCAATCGA
TACGCTGGAGGACCAGGAACAGACAACCGA
GAATGTATCTCTATGGATTATAAACAGACCCA
GCTGTGCCTGCTGGGCTGTAAGCCCCCTATC
GGCGAGCATTGGGGCAAAGGGAGCCCTTGC
TCCAACAATGCCATTACACCAGGCGACTGTCC
ACCCCTGGAACTGAAGAATTCCGTCATCCAG
GACGGGGATATGGTGGATACTGGATTCGGC
GCTATGGACTTTACCGCACTGCAGGATACAA
AAAGTAACGTCCCCCTGGACATCTGCAATTCA
ATCTGTAAGTACCCAGATTATCTGAAGATGGT
GGCCGAGCCCTACGGGGACACACTGTTCTTT
TATCTGCGGAGAGAACAGATGTTCGTGAGAC
ACTTCTTTAATAGGTCCGGAACCGTCGGAGA
GTCTGTGCCAACAGACCTGTACATTAAGGGG
TCTGGAAGTACCGCTACACTGGCAAACTCTAC
TTATTTCCCAACCCCCTCAGGCAGCATGGTGA
CCAGTGATGCACAGATTTTTAATAAGCCTTAC
TGGATGCAGCGGGCCCAGGGACATAACAAT
GGCATCTGCTGGGGGAACCAGCTGTTCGTCA
CAGTGGTCGACACCACAAGATCAACTAACAT
GAGCGTGTGTGCCGCTATCGCCAATAGCGAT
ACTACCTTCAAGAGCTCCAACTTTAAAGAGTA
CCTGAGACACGGCGAGGAATTTGACCTGCAG
TTCATCTTTCAGCTGTGCAAGATTACTCTGAG
CGCTGATATCATGACCTATATTCATTCCATGA
ACCCAGCAATTCTGGAGGACTGGAATTTCGG
GCTGACAACTCCTCCATCCGGATCTCTGGAAG
ATACTTACAGGTTTGTGACCAGCCAGGCCATC
ACATGTCAGAAGACTGCTCCTCAGAAGCCAA
AAGAAGACCCCTTCAAAGATTATGTCTTTTGG
GAGGTGAACCTGAAGGAAAAATTCAGTGCC
GACCTGGATCAGTTCCCTCTGGGACGCAAGT
TTCTGCTGCAGGCAGGATACCGAGCACGACC
AAAGTTTAAAGCCGGCAAGCGATCCGCCCCA
AGTGCTTCAACCACAACTCCCGCTAAACGCAA
GAAAACCAAGAAA
MRGETPTLQDYVLDLQPEATDLHCY
EQLPDSSDEEDVIDSPAGQAEPDTS
NYNIVTFCCQCKSTLRLCVQSTQVDI
RILQELLMGSFGIVCPNCSTRL
HPV31 L1
Page 19 of 48
MSLWRPSEATVYLPPVPVSKVVSTD
EYVTRTNIYYHAGSARLLTVGHPYYSI
PKSDNPKKIVVPKVSGLQYRVFRVRL
PDPNKFGFPDTSFYNPETQRLVWAC
VGLEVGRGQPLGVGISGHPLLNKFD
DTENSNRYAGGPGTDNRECISMDYK
QTQLCLLGCKPPIGEHWGKGSPCSN
NAITPGDCPPLELKNSVIQDGDMVD
TGFGAMDFTALQDTKSNVPLDICNSI
CKYPDYLKMVAEPYGDTLFFYLRREQ
MFVRHFFNRSGTVGESVPTDLYIKGS
GSTATLANSTYFPTPSGSMVTSDAQI
FNKPYWMQRAQGHNNGICWGNQ
LFVTVVDTTRSTNMSVCAAIANSDT
TFKSSNFKEYLRHGEEFDLQFIFQLCK
ITLSADIMTYIHSMNPAILEDWNFGL
TTPPSGSLEDTYRFVTSQAITCQKTAP
QKPKEDPFKDYVFWEVNLKEKFSAD
LDQFPLGRKFLLQAGYRARPKFKAGK
RSAPSASTTTPAKRKKTKK
% Nucleotide
Change
25.9%
27.0%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV31 L2
ATGCGGTCCAAACGCTCTACAAAACGCACTA
AACGTGCGTCTGCTACACAATTATATCAAACA
TGTAAAGCAGCAGGTACTTGTCCATCAGACG
TTATACCTAAAATAGAACATACTACCATTGCA
GACCAAATATTAAGGTATGGTAGTATGGGTG
TTTTTTTTGGTGGGTTGGGTATTGGGTCCGGC
TCTGGTACTGGGGGTCGCACTGGATATGTCC
CTCTTAGTACACGTCCTTCTACAGTATCTGAG
GCAAGTATACCTATTAGACCACCAGTTAGCAT
TGACCCTGTAGGTCCCTTGGACCCCTCTATAG
TAAGTCTTGTTGAAGAATCTGGAATTGTTGAT
GTTGGTGCCCCTGCTCCTATACCACACCCTCC
TACAACATCTGGGTTTGACATTGCTACAACTG
CAGACACAACACCTGCAATTTTAGATGTAACA
AGTGTTAGCACACATGAAAATCCTACTTTTAC
TGATCCATCTGTATTGCAGCCTCCTACACCTG
CAGAAACATCAGGTCATTTACTACTTTCATCA
TCATCTATTAGCACACATAATTATGAGGAAAT
ACCTATGGATACATTTATTGTTTCTACTAATA
ATGAAAACATAACAAGTAGCACACCCATTCC
AGGGGTGCGCCGTCCTGCACGTTTAGGGTTA
TATAGTAAGGCTACACAACAAGTAAAAGTTA
TTGATCCAACGTTTCTTAGTGCTCCAAAACAG
CTAATTACATATGAAAACCCTGCCTATGAAAC
TGTAAATGCTGAAGAATCTTTATACTTTTCCA
ATACATCGCATAATATAGCCCCTGATCCCGAC
TTTCTAGATATTATAGCATTACATAGGCCTGC
CCTTACCTCACGTAGGAACACTGTTAGATATA
GTAGACTAGGTAATAAACAAACTTTGCGCAC
TCGTAGTGGTGCTACTATTGGTGCAAGGGTG
CATTATTATTATGATATTAGTAGTATTAATCCT
GCAGGTGAAAGTATTGAAATGCAACCTTTAG
GGGCGTCTGCAACTACTACTTCTACTTTAAAT
GATGGCTTATATGACATTTATGCAGACACTGA
TTTTACTGTGGATACACCTGCCACACATAATG
TTTCCCCTTCTACTGCTGTACAGTCCACATCTG
CTGTGTCTGCCTATGTACCTACAAATACCACT
GTGCCACTAAGTACAGGTTTTGACATTCCCAT
ATTTTCTGGGCCTGATGTACCTATAGAGCATG
CACCTACACAGGTTTTCCCATTTCCTTTGGCCC
CTACAACGCCACAAGTGTCTATTTTTGTTGAT
GGGGGTGATTTTTATTTGCACCCTAGTTATTA
TATGTTAAAACGTCGACGTAAACGTGTATCAT
ATTTTTTTACAGATGTCTCTGTGGCGGCCTAG
ATGAGAAGCAAAAGAAGCACTAAAAGAACT
AAAAGAGCCTCCGCAACCCAGCTGTATCAGA
CCTGTAAAGCCGCAGGAACTTGTCCATCTGA
CGTGATCCCTAAGATTGAGCACACCACAATC
GCCGATCAGATTCTGCGCTATGGGAGCATGG
GAGTGTTCTTTGGCGGGCTGGGCATTGGGAG
TGGATCAGGCACAGGAGGCAGAACTGGCTA
CGTGCCTCTGAGTACCAGGCCATCCACAGTCT
CTGAAGCCAGTATCCCAATTAGACCCCCTGTG
AGCATCGACCCCGTCGGACCTCTGGATCCAT
CAATCGTGAGCCTGGTCGAGGAAAGCGGAA
TTGTGGACGTCGGAGCACCAGCACCTATCCC
ACACCCACCCACTACCTCCGGCTTCGACATTG
CCACAACTGCTGATACCACACCCGCTATCCTG
GACGTGACTAGCGTCTCCACCCATGAGAACC
CCACCTTTACAGATCCTTCCGTGCTGCAGCCT
CCAACACCCGCAGAAACTTCTGGGCACCTGC
TGCTGAGCTCCTCTAGTATCAGTACCCATAAC
TATGAGGAAATCCCTATGGACACCTTCATTGT
GTCTACAAACAATGAGAATATCACTTCAAGC
ACCCCCATTCCTGGGGTCCGGAGACCAGCTA
GGCTGGGACTGTACTCCAAGGCAACACAGCA
GGTGAAAGTCATTGATCCAACCTTTCTGTCTG
CCCCCAAGCAGCTGATCACCTATGAGAACCC
CGCATACGAAACAGTGAATGCCGAGGAAAG
CCTGTATTTCTCCAACACCTCTCACAATATCGC
CCCAGACCCCGATTTTCTGGATATCATTGCCC
TGCATCGCCCTGCTCTGACTTCTAGGCGCAAC
ACCGTGCGATACAGTCGGCTGGGCAACAAGC
AGACACTGAGGACTCGCAGCGGCGCTACAAT
TGGGGCACGAGTGCACTACTATTACGACATC
TCCTCTATTAACCCAGCCGGAGAGTCCATCGA
AATGCAGCCCCTGGGCGCTAGTGCAACTACC
ACATCAACCCTGAATGACGGCCTGTATGATAT
CTACGCTGACACCGATTTCACAGTGGATACTC
CCGCCACCCATAACGTGTCTCCTAGTACAGCT
GTCCAGTCAACTAGCGCAGTGAGCGCCTACG
TCCCTACTAATACTACCGTGCCACTGTCAACC
GGCTTCGACATCCCTATTTTTAGCGGGCCCGA
TGTGCCTATTGAGCACGCACCAACACAGGTC
TTCCCTTTTCCACTGGCCCCAACAACTCCCCA
GGTGTCAATCTTCGTCGACGGGGGAGATTTT
TATCTGCATCCCAGCTATTACATGCTGAAGCG
ACGGAGAAAACGGGTGAGCTACTTCTTTACC
GACGTGTCCGTCGCCGCT
MRSKRSTKRTKRASATQLYQTCKAA
GTCPSDVIPKIEHTTIADQILRYGSMG
VFFGGLGIGSGSGTGGRTGYVPLSTR
PSTVSEASIPIRPPVSIDPVGPLDPSIV
SLVEESGIVDVGAPAPIPHPPTTSGF
DIATTADTTPAILDVTSVSTHENPTFT
DPSVLQPPTPAETSGHLLLSSSSISTH
NYEEIPMDTFIVSTNNENITSSTPIPG
VRRPARLGLYSKATQQVKVIDPTFLS
APKQLITYENPAYETVNAEESLYFSNT
SHNIAPDPDFLDIIALHRPALTSRRNT
VRYSRLGNKQTLRTRSGATIGARVHY
YYDISSINPAGESIEMQPLGASATTTS
TLNDGLYDIYADTDFTVDTPATHNVS
PSTAVQSTSAVSAYVPTNTTVPLSTG
FDIPIFSGPDVPIEHAPTQVFPFPLAP
TTPQVSIFVDGGDFYLHPSYYMLKRR
RKRVSYFFTDVSVAA
Page 20 of 48
% Nucleotide
Change
28.3%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV33 E1
ATGGCCGATCCTGAAGGTACAAATGGGGCTG
GGATGGGGTGTACTGGTTGGTTTGAGGTAG
AAGCAGTCATAGAGAGAAGAACAGGAGATA
ATATTTCAGAAGATGAGGATGAAACAGCAGA
TGACAGTGGCACGGATTTACTAGAGTTTATA
GATGATTCTATGGAAAATAGTATACAGGCAG
ACACAGAGGCAGCCCGGGCATTGTTTAATAT
ACAGGAAGGGGAGGATGATTTAAATGCTGT
GTGTGCACTAAAACGAAAGTTTGCCGCATGT
TCACAAAGTGCTGCGGAGGACGTTGTTGATC
GTGCTGCAAACCCGTGTAGAACGTCTATTAAT
AAAAATAAAGAATGCACATACAGAAAACGAA
AAATAGATGAGCTAGAAGACAGCGGATATG
GCAATACTGAAGTGGAAACTCAGCAGATGGT
ACAACAGGTAGAAAGTCAAAATGGCGACAC
AAACTTAAATGACTTAGAATCTAGTGGGGTG
GGGGATGATTCAGAAGTAAGCTGTGAGACA
AATGTAGATAGCTGTGAAAATGTTACGTTGC
AGGAAATTAGTAATGTTCTACATAGTAGTAAT
ACAAAAGCAAATATATTATATAAATTTAAAGA
GGCCTATGGAATAAGTTTTATGGAATTAGTA
AGACCATTTAAAAGTGATAAAACAAGCTGTA
CAGATTGGTGTATAACAGGATATGGAATTAG
TCCATCAGTAGCAGAAAGTTTAAAAGTATTA
ATTAAACAGCATAGTTTGTATACTCATTTACA
ATGTTTAACTTGCGATAGAGGAATAATAATAT
TATTGTTAATTAGATTTAGGTGTAGCAAAAAC
AGGTTAACAGTAGCAAAACTAATGAGTAATT
TATTATCAATACCTGAAACATGTATGGTTATA
GAGCCACCAAAATTACGGAGCCAAACATGTG
CATTGTATTGGTTTAGAACAGCAATGTCAAAC
ATTAGTGATGTACAAGGTACAACACCTGAAT
GGATAGATAGACTAACTGTTTTACAACATAG
CTTTAATGATAATATATTTGATTTAAGTGAAA
TGGTACAGTGGGCATATGATAACGAGTTAAC
GGACGATAGTGACATTGCATATTATTATGCAC
AACTTGCAGATTCAAATAGTAATGCTGCTGCA
TTTTTAAAAAGTAACTCACAAGCAAAAATAGT
AAAGGACTGTGGAATAATGTGTAGACATTAT
AAAAAAGCAGAAAAACGTAAAATGTCAATAG
GACAATGGATACAAAGTAGATGTGAAAAAAC
AAATGATGGAGGAAATTGGAGACCAATAGT
ACAGTTGTTAAGATATCAAAACATTGAATTTA
CAGCATTTTTAGGTGCATTTAAAAAGTTTTTA
AAAGGTATACCAAAAAAAAGCTGTATGCTAA
TTTGTGGACCAGCAAATACAGGAAAGTCATA
TTTTGGAATGAGTTTAATACAGTTTTTAAAAG
GGTGTGTTATATCATGTGTAAATTCTAAAAGT
CACTTTTGGTTGCAGCCATTATCAGATGCAAA
AATAGGAATGATAGATGATGTAACGCCAATA
AGTTGGACATATATAGATGATTACATGAGAA
ATGCGTTAGATGGAAATGAAATTTCAATAGA
TGTGAAACATAGGGCATTAGTGCAATTAAAA
TGTCCACCACTGCTTCTTACCTCAAATACAAA
TGCAGGCACAGACTCTAGATGGCCATATTTA
CATAGTAGATTAACAGTATTTGAATTTAAAAA
TCCATTCCCATTTGATGAAAATGGTAACCCAG
TGTATGCAATAAATGATGAAAATTGGAAATC
CTTTTTCTCAAGGACGTGGTGCAAATTAGATT
TAATAGAGGAAGAGGACAAGGAAAACCATG
GAGGAAATATCAGCACGTTTAAATGCAGTGC
AGGAGAAAATACTAGATCTTTACGAAGCTGA
ATGGCTGACCCTGAAGGCACTAACGGGGCTG
GGATGGGCTGTACTGGCTGGTTTGAGGTGG
AGGCTGTGATTGAAAGACGGACTGGGGACA
ACATCAGCGAAGACGAGGATGAAACAGCCG
ACGATTCCGGGACTGATCTGCTGGAGTTCAT
TGACGATTCTATGGAAAATAGTATCCAGGCA
GACACCGAGGCAGCTCGAGCACTGTTCAACA
TCCAGGAGGGAGAAGACGATCTGAATGCAG
TGTGCGCCCTGAAGAGAAAATTTGCAGCCTG
TTCACAGAGCGCTGCAGAGGACGTGGTCGAT
AGAGCCGCTAACCCTTGCAGGACATCCATTA
ACAAGAACAAGGAATGTACTTATCGGAAGAG
AAAAATCGACGAGCTGGAAGATAGTGGGTA
CGGAAACACCGAGGTGGAAACACAGCAGAT
GGTGCAGCAGGTCGAGAGCCAGAATGGCGA
CACAAACCTGAATGATCTGGAAAGCTCCGGC
GTGGGGGACGATTCAGAGGTCAGCTGCGAA
ACCAACGTGGATTCTTGTGAGAATGTCACACT
GCAGGAAATCAGTAATGTGCTGCACTCTAGT
AACACCAAGGCCAACATCCTGTACAAGTTTA
AAGAGGCTTACGGCATCAGCTTCATGGAACT
GGTGCGGCCTTTTAAGTCAGACAAAACTAGC
TGCACCGATTGGTGTATTACAGGATATGGCA
TCTCCCCATCTGTGGCCGAGTCCCTGAAGGTC
CTGATCAAGCAGCACTCTCTGTACACCCATCT
GCAGTGCCTGACATGTGACCGCGGGATCATT
ATCCTGCTGCTGATCAGGTTCCGCTGCAGCA
AGAACCGACTGACCGTGGCCAAACTGATGTC
CAATCTGCTGTCTATTCCAGAGACATGCATGG
TCATCGAACCCCCTAAGCTGCGATCCCAGACT
TGTGCTCTGTATTGGTTTCGGACCGCAATGTC
CAACATTTCTGACGTGCAGGGCACCACACCC
GAGTGGATCGATAGGCTGACAGTCCTGCAGC
ACAGTTTCAACGACAATATTTTTGATCTGTCA
GAGATGGTGCAGTGGGCATACGACAACGAA
CTGACTGACGATTCTGATATCGCCTACTATTA
CGCTCAGCTGGCAGATAGTAACTCAAATGCA
GCCGCTTTCCTGAAAAGCAATTCCCAGGCCA
AGATTGTGAAAGACTGCGGCATCATGTGTAG
GCATTATAAGAAAGCTGAGAAGCGCAAAATG
TCTATTGGGCAGTGGATCCAGAGTCGCTGCG
AAAAGACTAACGACGGCGGGAATTGGCGCC
CCATTGTGCAGCTGCTGCGATACCAGAACAT
CGAGTTCACCGCCTTTCTGGGGGCTTTCAAG
AAATTTCTGAAAGGAATTCCCAAGAAAAGCT
GCATGCTGATCTGTGGGCCTGCTAACACCGG
AAAGAGTTACTTCGGCATGTCACTGATTCAGT
TTCTGAAAGGATGCGTGATCTCATGTGTCAAT
TCTAAGAGTCACTTTTGGCTGCAGCCACTGTC
CGATGCCAAGATTGGCATGATCGACGATGTG
ACTCCCATTTCTTGGACCTATATCGACGATTA
CATGAGAAACGCTCTGGACGGGAATGAGATT
TCCATCGATGTGAAGCACAGGGCACTGGTCC
AGCTGAAATGTCCACCCCTGCTGCTGACATCT
AACACTAATGCAGGAACAGACTCACGGTGGC
CCTATCTGCATAGCAGACTGACTGTGTTCGA
GTTTAAGAACCCTTTCCCATTTGACGAAAACG
GCAATCCTGTCTACGCCATCAACGATGAGAA
TTGGAAGAGTTTCTTTTCAAGAACCTGGTGCA
AACTGGACCTGATTGAGGAAGAGGATAAGG
AGAACCATGGAGGCAATATCAGCACTTTCAA
GTGTTCCGCCGGCGAAAATACCCGAAGCCTG
CGGTCC
MADPEGTNGAGMGCTGWFEVEAV
IERRTGDNISEDEDETADDSGTDLLE
FIDDSMENSIQADTEAARALFNIQEG
EDDLNAVCALKRKFAACSQSAAEDV
VDRAANPCRTSINKNKECTYRKRKID
ELEDSGYGNTEVETQQMVQQVESQ
NGDTNLNDLESSGVGDDSEVSCETN
VDSCENVTLQEISNVLHSSNTKANILY
KFKEAYGISFMELVRPFKSDKTSCTD
WCITGYGISPSVAESLKVLIKQHSLYT
HLQCLTCDRGIIILLLIRFRCSKNRLTV
AKLMSNLLSIPETCMVIEPPKLRSQT
CALYWFRTAMSNISDVQGTTPEWID
RLTVLQHSFNDNIFDLSEMVQWAY
DNELTDDSDIAYYYAQLADSNSNAA
AFLKSNSQAKIVKDCGIMCRHYKKAE
KRKMSIGQWIQSRCEKTNDGGNW
RPIVQLLRYQNIEFTAFLGAFKKFLKGI
PKKSCMLICGPANTGKSYFGMSLIQF
LKGCVISCVNSKSHFWLQPLSDAKIG
MIDDVTPISWTYIDDYMRNALDGNE
ISIDVKHRALVQLKCPPLLLTSNTNAG
TDSRWPYLHSRLTVFEFKNPFPFDEN
GNPVYAINDENWKSFFSRTWCKLDL
IEEEDKENHGGNISTFKCSAGENTRS
LRS
Page 21 of 48
% Nucleotide
Change
27.0%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV33 E2
ATGGAGGAAATATCAGCACGTTTAAATGCAG
TGCAGGAGAAAATACTAGATCTTTACGAAGC
TGATAAAACTGATTTACCATCACAAATTGAAC
ATTGGAAACTGATACGCATGGAGTGTGCTTT
ATTGTATACAGCCAAACAAATGGGATTTTCAC
ATTTATGCCACCAGGTGGTGCCTTCTTTGTTA
GCATCAAAGACCAAAGCATTTCAAGTAATTG
AACTACAAATGGCATTAGAGACATTAAGTAA
ATCACAGTATAGTACAAGCCAATGGACATTG
CAACAAACAAGCTTAGAGGTGTGGCTTTGTG
AACCACCAAAATGTTTTAAAAAACAAGGAGA
AACAGTAACTGTGCAATATGACAATGACAAA
AAAAATACAATGGATTATACAAACTGGGGTG
AAATATATATTATAGAGGAAGATACATGTAC
TATGGTTACAGGGAAAGTAGATTATATAGGT
ATGTATTATATACATAACTGTGAAAAGGTATA
TTTTAAATATTTTAAAGAGGATGCTGCAAAGT
ATTCTAAAACACAAATGTGGGAAGTACATGT
GGGTGGTCAGGTAATTGTTTGTCCTACGTCTA
TATCTAGCAACCAAATATCCACTACTGAAACT
GCTGACATACAGACAGACAACGATAACCGAC
CACCACAAGCAGCGGCCAAACGACGACGACC
TGCAGACACCACAGACACCGCCCAGCCCCTT
ACAAAGCTGTTCTGTGCAGACCCCGCCTTGG
ACAATAGAACAGCACGTACTGCAACTAACTG
CACAAACAAGCAGCGGACTGTGTGTAGTTCT
AACGTTGCACCTATAGTGCATTTAAAAGGTG
AATCAAATAGTTTAAAATGTTTAAGATACAGA
TTAAAACCTTATAAAGAGTTGTATAGTTCTAT
GTCATCCACCTGGCATTGGACCAGTGACAAC
AAAAATAGTAAAAATGGAATTGTAACTGTAA
CATTTGTAACTGAACAGCAACAACAAATGTTT
TTAGGTACCGTAAAAATACCACCTACTGTGCA
AATAAGTACTGGATTTATGACATTATAA
MEEISARLNAVQEKILDLYEADKTDL
PSQIEHWKLIRMECALLYTAKQMGF
SHLCHQVVPSLLASKTKAFQVIELQM
ALETLSKSQYSTSQWTLQQTSLEVW
LCEPPKCFKKQGETVTVQYDNDKKN
TMDYTNWGEIYIIEEDTCTMVTGKV
DYIGMYYIHNCEKVYFKYFKEDAAKY
SKTQMWEVHVGGQVIVCPTSISSN
QISTTETADIQTDNDNRPPQAAAKR
RRPADTTDTAQPLTKLFCADPALDN
RTARTATNCTNKQRTVCSSNVAPIV
HLKGESNSLKCLRYRLKPYKELYSSMS
STWHWTSDNKNSKNGIVTVTFVTE
QQQQMFLGTVKIPPTVQISTGFMTL
HPV33 E4
TTGTTTGTCCTACGTCTATATCTAGCAACCAA
ATATCCACTACTGAAACTGCTGACATACAGAC
AGACAACGATAACCGACCACCACAAGCAGCG
GCCAAACGACGACGACCTGCAGACACCACAG
ACACCGCCCAGCCCCTTACAAAGCTGTTCTGT
GCAGACCCCGCCTTGGACAATAGAACAGCAC
GTACTGCAACTAACTGCACAAACAAGCAGCG
GACTGTGTGTAGTTCTAACGTTGCACCTATAG
ATGATATTTGTTTTTGTATTATGTTTTATATTG
TTTTTATGCTTATCCTTATTATTACGTCCTTTA
ATACTTTCCATTTCTACCTATGCTTGGTTGCTG
GTGTTGGTATTGCTGCTTTGGGTGTTTGTGG
GATCTCCTTTAAAAATTTTTTTTTGCTATTTGT
TGTTTTTATATTTACCAATGATGTGTATTAATT
TTCATGCACAGCATATGACACAACAAGAGTA
A
ATGTTTCAAGACACTGAGGAAAAACCACGAA
CATTGCATGATTTGTGCCAAGCATTGGAGAC
AACTATACACAACATTGAACTACAGTGCGTG
GAATGCAAAAAACCTTTGCAACGATCTGAGG
TATATGATTTTGCATTTGCAGATTTAACAGTT
GTATATAGAGAGGGAAATCCATTTGGAATAT
GTAAACTGTGTTTGCGGTTCTTATCTAAAATT
AGTGAATATAGACATTATAATTATTCTGTATA
TGGAAATACATTAGAACAAACAGTTAAAAAA
CCTTTAAATGAAATATTAATTAGGTGTATTAT
ATGTCAAAGACCTTTGTGTCCTCAAGAAAAA
AAACGACATGTGGATTTAAACAAACGATTTC
ATAATATTTCGGGTCGTTGGGCAGGGCGCTG
TGCGGCGTGTTGGAGGTCCCGACGTAGAGA
AACTGCACTGTGA
ATGAGAGGACACAAGCCAACGTTAAAGGAA
TATGTTTTAGATTTATATCCTGAACCAACTGA
CCTATACTGCTATGAGCAATTAAGTGACAGCT
CAGATGAGGATGAAGGCTTGGACCGGCCAG
ATGGACAAGCACAACCAGCCACAGCTGATTA
CTACATTGTAACCTGTTGTCACACTTGTAACA
CCACAGTTCGTTTATGTGTCAACAGTACAGCA
ATGGAGGAGATTAGCGCAAGACTGAACGCC
GTGCAGGAAAAGATTCTGGACCTGTATGAAG
CCGACAAGACCGACCTGCCAAGCCAGATCGA
GCACTGGAAGCTGATTCGAATGGAATGCGCC
CTGCTGTACACCGCTAAACAGATGGGGTTCA
GCCACCTGTGTCATCAGGTGGTCCCATCACTG
CTGGCAAGCAAGACTAAAGCCTTTCAGGTCA
TCGAGCTGCAGATGGCCCTGGAAACCCTGAG
TAAGTCACAGTACAGCACCTCCCAGTGGACA
CTGCAGCAGACTTCCCTGGAAGTGTGGCTGT
GCGAACCCCCTAAGTGTTTCAAGAAACAGGG
CGAGACAGTGACTGTCCAGTATGACAACGAT
AAGAAAAATACCATGGACTACACAAACTGGG
GGGAAATCTATATCATTGAGGAAGACACCTG
CACAATGGTGACCGGAAAGGTCGATTACATC
GGCATGTACTATATTCACAACTGTGAGAAGG
TGTACTTCAAGTACTTCAAGGAAGACGCCGC
TAAGTATTCTAAAACACAGATGTGGGAGGTG
CATGTCGGCGGGCAGGTCATCGTCTGCCCCA
CTAGTATCAGCTCCAACCAGATTAGCACCACA
GAAACCGCTGATATTCAGACAGACAACGATA
ATAGGCCACCACAGGCAGCAGCTAAGCGAA
GAAGGCCAGCTGACACTACCGATACTGCACA
GCCTCTGACCAAACTGTTCTGCGCAGACCCA
GCCCTGGATAATCGGACAGCTAGAACTGCAA
CCAACTGCACAAATAAGCAGCGCACTGTGTG
TTCTAGTAACGTGGCCCCCATCGTCCACCTGA
AGGGAGAGTCTAATAGTCTGAAATGTCTGCG
CTATCGACTGAAGCCTTACAAAGAACTGTATT
CAAGCATGTCCTCTACTTGGCATTGGACCTCC
GATAACAAGAATTCTAAAAACGGCATTGTGA
CAGTCACTTTCGTGACCGAGCAGCAGCAGCA
GATGTTTCTGGGGACCGTGAAGATCCCTCCA
ACAGTCCAGATTAGCACAGGCTTCATGACTCT
G
ATGTTCGTGCTGAGGCTGTACCTGGCAACTA
AGTATCCCCTGCTGAAACTGCTGACCTACCGA
CAGACCACCATCACTGACCATCACAAGCAGC
GGCCCAACGACGATGACCTGCAGACCCCTCA
GACACCCCCTTCCCCACTGCAGTCTTGCAGTG
TGCAGACACCACCCTGGACTATCGAGCAGCA
CGTCCTGCAGCTGACTGCCCAGACCAGCTCC
GGCCTGTGTGTGGTCCTGACCCTGCATCTG
ATGATTTTTGTGTTTGTCCTGTGTTTTATCCTG
TTTCTGTGCCTGAGCCTGCTGCTGAGACCACT
GATTCTGTCCATTTCTACTTATGCCTGGCTGCT
GGTGCTGGTCCTGCTGCTGTGGGTGTTCGTC
GGCAGCCCCCTGAAGATCTTCTTTTGCTACCT
GCTGTTCCTGTATCTGCCTATGATGTGTATTA
ACTTTCACGCTCAGCATATGACCCAGCAGGA
G
ATGTTTCAGGACACCGAGGAGAAGCCAAGA
ACTCTGCATGATCTGTGCCAGGCTCTGGAGA
CCACCATTCACAATATCGAACTGCAGTGCGTG
GAGTGTAAGAAACCACTGCAGCGCAGCGAA
GTCTACGACTTCGCATTTGCCGATCTGACTGT
GGTCTATCGGGAGGGCAACCCCTTCGGGATC
TGCAAGCTGTGTCTGCGATTTCTGAGCAAAA
TTTCCGAATACAGGCACTACAACTATTCTGTG
TATGGGAATACCCTGGAGCAGACAGTCAAGA
AACCCCTGAATGAAATCCTGATTCGGTGCATC
ATTTGTCAGAGACCCCTGTGCCCTCAGGAGA
AGAAAAGGCACGTGGACCTGAACAAGCGCTT
CCATAATATCTCTGGACGATGGGCTGGACGA
TGCGCAGCTTGTTGGAGAAGTCGGAGAAGG
GAAACCGCCCTG
ATGCGAGGCCACAAGCCCACCCTGAAAGAGT
ATGTCCTGGACCTGTATCCCGAGCCCACCGA
CCTGTATTGTTATGAGCAGCTGTCAGACAGCT
CCGACGAGGATGAAGGACTGGACAGGCCAG
ATGGACAGGCTCAGCCTGCAACCGCTGATTA
CTATATCGTGACTTGCTGTCACACCTGCAACA
CCACAGTGCGGCTGTGTGTCAATTCTACAGC
HPV33 E5
HPV33 E6
HPV33 E7
Page 22 of 48
% Nucleotide
Change
26.8%
LFVLRLYLATKYPLLKLLTYRQTTITDH
HKQRPNDDDLQTPQTPPSPLQSCSV
QTPPWTIEQHVLQLTAQTSSGLCVV
LTLHL
27.2%
MIFVFVLCFILFLCLSLLLRPLILSISTYA
WLLVLVLLLWVFVGSPLKIFFCYLLFL
YLPMMCINFHAQHMTQQE
26.1%
MFQDTEEKPRTLHDLCQALETTIHNI
ELQCVECKKPLQRSEVYDFAFADLTV
VYREGNPFGICKLCLRFLSKISEYRHY
NYSVYGNTLEQTVKKPLNEILIRCIICQ
RPLCPQEKKRHVDLNKRFHNISGRW
AGRCAACWRSRRRETAL
26.9%
MRGHKPTLKEYVLDLYPEPTDLYCYE
QLSDSSDEDEGLDRPDGQAQPATA
DYYIVTCCHTCNTTVRLCVNSTASDL
RTIQQLLMGTVNIVCPTCAQQ
24.0%
Gene
HPV33 L1
PaVE DNA Sequence
Optimized DNA Sequence
AGTGACCTACGAACCATACAGCAACTACTTAT
GGGCACAGTGAATATTGTGTGCCCTACCTGT
GCACAACAATAA
AAGCGACCTGAGAACTATCCAGCAGCTGCTG
ATGGGCACCGTGAACATTGTCTGCCCCACAT
GTGCCCAGCAG
ATGTCCGTGTGGCGGCCTAGTGAGGCCACAG
TGTACCTGCCTCCTGTACCTGTATCTAAAGTT
GTCAGCACTGATGAATATGTGTCTCGCACAA
GCATTTATTATTATGCTGGTAGTTCCAGACTT
CTTGCTGTTGGCCATCCATATTTTTCTATTAAA
AATCCTACTAACGCTAAAAAATTATTGGTACC
CAAAGTATCAGGCTTGCAATATAGGGTTTTTA
GGGTCCGTTTACCAGATCCTAATAAATTTGGA
TTTCCTGACACCTCCTTTTATAACCCTGATACA
CAACGATTAGTATGGGCATGTGTAGGCCTTG
AAATAGGTAGAGGGCAGCCATTAGGCGTTG
GCATAAGTGGTCATCCTTTATTAAACAAATTT
GATGACACTGAAACCGGTAACAAGTATCCTG
GACAACCGGGTGCTGATAATAGGGAATGTTT
ATCCATGGATTATAAACAAACACAGTTATGTT
TACTTGGATGTAAGCCTCCAACAGGGGAACA
TTGGGGTAAAGGTGTTGCTTGTACTAATGCA
GCACCTGCCAATGATTGTCCACCTTTAGAACT
TATAAATACTATTATTGAGGATGGTGATATG
GTGGACACAGGATTTGGTTGCATGGATTTTA
AAACATTGCAGGCTAATAAAAGTGATGTTCC
TATTGATATTTGTGGCAGTACATGCAAATATC
CAGATTATTTAAAAATGACTAGTGAGCCTTAT
GGTGATAGTTTATTTTTCTTTCTTCGACGTGA
ACAAATGTTTGTAAGACACTTTTTTAATAGGG
CTGGTACATTAGGAGAGGCTGTTCCCGATGA
CCTGTACATTAAAGGTTCAGGAACTACTGCCT
CTATTCAAAGCAGTGCTTTTTTTCCCACTCCTA
GTGGATCAATGGTTACTTCCGAATCTCAGTTA
TTTAATAAGCCATATTGGCTACAACGTGCACA
AGGTCATAATAATGGTATTTGTTGGGGCAAT
CAGGTATTTGTTACTGTGGTAGATACCACTCG
CAGTACTAATATGACTTTATGCACACAAGTAA
CTAGTGACAGTACATATAAAAATGAAAATTTT
AAAGAATATATAAGACATGTTGAAGAATATG
ATCTACAGTTTGTTTTTCAACTATGCAAAGTT
ACCTTAACTGCAGAAGTTATGACATATATTCA
TGCTATGAATCCAGATATTTTAGAAGATTGGC
AATTTGGTTTAACACCTCCTCCATCTGCTAGTT
TACAGGATACCTATAGGTTTGTTACCTCTCAG
GCTATTACGTGTCAAAAAACAGTACCTCCAAA
GGAAAAGGAAGACCCCTTAGGTAAATATACA
TTTTGGGAAGTGGATTTAAAGGAAAAATTTT
CAGCAGATTTAGATCAGTTTCCTTTGGGACGC
AAGTTTTTATTACAGGCAGGTCTTAAAGCAAA
ACCTAAACTTAAACGTGCAGCCCCCACATCCA
CCCGCACATCGTCTGCAAAACGCAAAAAGGT
TAAAAAATAA
ATGTCAGTGTGGAGACCCAGCGAGGCTACCG
TGTATCTGCCCCCAGTCCCCGTGAGCAAAGT
GGTGTCAACCGATGAGTATGTGAGCCGCACC
TCCATCTACTATTACGCTGGAAGCTCCCGACT
GCTGGCAGTGGGACACCCCTATTTTAGCATT
AAGAACCCTACAAATGCCAAGAAACTGCTGG
TGCCTAAAGTCTCCGGGCTGCAGTATAGGGT
GTTTAGGGTCCGCCTGCCCGACCCTAACAAG
TTTGGATTCCCAGACACATCTTTCTACAATCC
CGATACTCAGCGACTGGTGTGGGCATGCGTC
GGACTGGAGATCGGAAGAGGACAGCCACTG
GGAGTGGGCATTAGTGGACACCCTCTGCTGA
ACAAGTTCGACGATACAGAGACTGGCAACAA
GTATCCTGGGCAGCCAGGAGCTGACAACCGC
GAATGTCTGAGCATGGATTACAAGCAGACCC
AGCTGTGCCTGCTGGGCTGTAAGCCCCCTAC
AGGCGAGCATTGGGGGAAAGGAGTGGCCTG
CACTAACGCCGCTCCAGCTAATGACTGTCCAC
CCCTGGAGCTGATCAACACCATCATTGAAGA
CGGCGATATGGTCGACACTGGCTTTGGGTGC
ATGGATTTCAAGACCCTGCAGGCCAACAAGA
GTGACGTGCCCATCGATATTTGCGGCTCAAC
CTGTAAGTATCCAGACTACCTGAAAATGACTT
CCGAGCCCTATGGGGATTCTCTGTTCTTTTTC
CTGCGGAGAGAACAGATGTTTGTCCGACACT
TTTTCAACCGAGCAGGAACCCTGGGAGAGGC
TGTGCCCGACGATCTGTACATCAAGGGATCA
GGCACCACAGCAAGCATTCAGTCTAGTGCCT
TTTTCCCAACCCCCTCCGGCTCTATGGTGACA
AGTGAATCACAGCTGTTTAATAAGCCTTACTG
GCTGCAGCGAGCCCAGGGACATAACAATGG
CATCTGCTGGGGGAACCAGGTGTTCGTCACT
GTGGTCGACACTACCCGCTCTACTAATATGAC
CCTGTGTACACAGGTCACTAGCGATTCCACAT
ACAAGAACGAGAACTTCAAGGAATACATTCG
GCACGTGGAGGAATACGACCTGCAGTTTGTG
TTCCAGCTGTGCAAGGTCACCCTGACAGCAG
AAGTGATGACCTACATCCATGCCATGAATCCC
GACATTCTGGAAGATTGGCAGTTTGGACTGA
CACCTCCACCCTCTGCTAGTCTGCAGGATACT
TATAGATTCGTCACCAGCCAGGCAATCACCTG
TCAGAAGACAGTGCCTCCAAAGGAGAAAGA
AGACCCTCTGGGCAAATACACCTTTTGGGAG
GTGGATCTGAAGGAAAAATTCAGCGCCGACC
TGGATCAGTTTCCACTGGGCAGGAAGTTCCT
GCTGCAGGCTGGGCTGAAGGCAAAACCTAA
GCTGAAACGCGCAGCCCCAACTTCCACCAGA
ACATCAAGCGCTAAAAGGAAGAAAGTGAAG
AAA
Page 23 of 48
Protein Sequence
MSVWRPSEATVYLPPVPVSKVVSTD
EYVSRTSIYYYAGSSRLLAVGHPYFSIK
NPTNAKKLLVPKVSGLQYRVFRVRLP
DPNKFGFPDTSFYNPDTQRLVWAC
VGLEIGRGQPLGVGISGHPLLNKFDD
TETGNKYPGQPGADNRECLSMDYK
QTQLCLLGCKPPTGEHWGKGVACT
NAAPANDCPPLELINTIIEDGDMVDT
GFGCMDFKTLQANKSDVPIDICGST
CKYPDYLKMTSEPYGDSLFFFLRREQ
MFVRHFFNRAGTLGEAVPDDLYIKG
SGTTASIQSSAFFPTPSGSMVTSESQ
LFNKPYWLQRAQGHNNGICWGNQ
VFVTVVDTTRSTNMTLCTQVTSDST
YKNENFKEYIRHVEEYDLQFVFQLCK
VTLTAEVMTYIHAMNPDILEDWQF
GLTPPPSASLQDTYRFVTSQAITCQK
TVPPKEKEDPLGKYTFWEVDLKEKFS
ADLDQFPLGRKFLLQAGLKAKPKLKR
AAPTSTRTSSAKRKKVKK
% Nucleotide
Change
26.4%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV33 L2
ATGAGACACAAACGATCTACAAGGCGCAAGC
GTGCATCTGCAACACAACTATACCAAACATGC
AAGGCCACAGGCACCTGCCCACCCGATGTTA
TTCCTAAAGTGGAAGGAAGTACCATAGCAGA
TCAAATTCTTAAATATGGCAGTTTAGGGGTTT
TTTTTGGTGGTTTAGGTATTGGCACAGGCTCT
GGTTCAGGTGGAAGGACTGGCTATGTACCTA
TTGGTACTGACCCACCTACAGCTGCAATCCCC
TTGCAGCCTATACGTCCTCCGGTTACTGTAGA
CACTGTTGGACCTTTAGACTCGTCTATAGTGT
CATTAATAGAAGAAACAAGTTTTATAGAGGC
AGGTGCACCAGCCCCATCTATTCCTACACCAT
CAGGTTTTGATGTTACTACATCTGCAGATACT
ACACCTGCAATTATTAATGTTTCATCTGTTGG
GGAGTCATCTATTCAAACTATTTCTACACATT
TAAATCCCACATTTACTGAACCATCTGTACTA
CACCCTCCAGCGCCTGCAGAAGCCTCTGGAC
ATTTTATATTTTCTTCCCCTACTGTTAGCACAC
AAAGTTATGAAAACATACCAATGGATACCTTT
GTTGTTTCCACAGACAGTAGTAATGTAACATC
AAGCACGCCCATTCCAGGGTCTCGCCCTGTG
GCACGCCTTGGTTTATATAGTCGCAATACCCA
ACAGGTTAAGGTTGTTGACCCTGCTTTTTTAA
CATCGCCTCATAAACTTATAACATATGATAAT
CCTGCATTTGAAAGCTTTGACCCTGAAGACAC
ATTACAATTTCAACATAGTGATATATCACCTG
CTCCTGATCCTGACTTTCTAGATATTATTGCAT
TACATAGGCCTGCTATTACATCTCGTAGACAT
ACTGTGCGTTTTAGTAGAGTAGGTCAAAAAG
CCACACTTAAAACTCGCAGTGGTAAACAAATT
GGAGCTAGAATACATTATTATCAGGATTTAA
GTCCTATTGTGCCTTTAGACCACACCGTGCCA
AATGAACAATATGAATTACAGCCTTTACATGA
TACTTCTACATCGTCTTATAGTATTAATGATG
GTTTGTATGATGTTTATGCTGACGATGTGGAT
AATGTACACACCCCAATGCAACACTCATACAG
TACGTTTGCAACAACACGTACCAGCAATGTGT
CTATACCTTTAAATACAGGATTTGATACTCCT
GTTATGTCTGGCCCTGATATACCTTCCCCTTTA
TTTCCCACATCTAGCCCATTTGTTCCTATTTCG
CCTTTTTTTCCTTTTGACACCATTGTTGTAGAC
GGTGCTGACTTTGTTTTACATCCTAGTTATTTT
ATTTTACGTCGCAGGCGTAAACGTTTTCCATA
TTTTTTTACAGATGTCCGTGTGGCGGCCTAG
ATGCGACACAAGAGAAGCACCAGAAGGAAG
AGAGCAAGCGCCACCCAGCTGTATCAGACCT
GTAAAGCCACCGGGACCTGCCCTCCAGACGT
GATCCCCAAGGTCGAGGGCAGTACCATCGCC
GATCAGATTCTGAAATACGGATCACTGGGCG
TGTTCTTTGGAGGACTGGGAATCGGAACTGG
ATCCGGATCTGGAGGACGAACCGGATATGTG
CCAATTGGGACTGACCCACCTACCGCAGCTAT
CCCACTGCAGCCCATTCGCCCACCCGTGACAG
TCGACACTGTGGGCCCCCTGGATAGCTCCAT
CGTCAGCCTGATTGAGGAAACATCCTTCATCG
AGGCTGGAGCACCAGCACCTTCCATTCCAACT
CCCTCTGGGTTTGACGTGACCACATCCGCTGA
TACTACCCCTGCAATCATTAACGTGTCTAGTG
TCGGAGAATCAAGCATCCAGACCATTAGCAC
ACATCTGAATCCTACCTTCACAGAGCCATCCG
TGCTGCACCCTCCAGCTCCAGCAGAAGCCTCT
GGCCATTTCATCTTTTCCTCTCCAACTGTGAG
CACCCAGTCCTACGAGAACATTCCCATGGAC
ACCTTTGTGGTCAGCACAGATAGTTCAAATGT
GACAAGCTCCACTCCTATCCCAGGATCCCGAC
CAGTCGCACGACTGGGACTGTACTCTAGAAA
CACTCAGCAGGTGAAGGTGGTCGACCCAGCT
TTCCTGACCAGTCCCCATAAACTGATCACATA
TGATAATCCTGCATTCGAGTCTTTTGACCCAG
AAGATACACTGCAGTTCCAGCACTCTGACATC
AGTCCCGCTCCTGACCCAGATTTTCTGGATAT
CATTGCCCTGCACAGGCCTGCTATTACTTCAC
GGAGACATACCGTGAGATTCAGCAGGGTCG
GGCAGAAGGCAACCCTGAAAACACGGTCCG
GGAAGCAGATCGGAGCCAGAATTCACTACTA
TCAGGACCTGAGCCCTATCGTGCCACTGGAT
CATACCGTCCCCAACGAGCAGTACGAACTGC
AGCCTCTGCACGACACTAGTACCTCTAGTTAT
TCAATTAACGACGGCCTGTACGATGTGTATG
CCGACGATGTGGATAATGTCCACACCCCCAT
GCAGCATAGTTATTCAACATTCGCTACAACTC
GGACTTCAAACGTGAGCATCCCTCTGAATAC
AGGGTTTGACACTCCCGTGATGTCTGGACCT
GATATTCCCAGTCCTCTGTTCCCCACCTCAAG
CCCCTTTGTGCCTATCTCCCCATTCTTTCCCTT
CGACACAATTGTGGTCGACGGCGCCGATTTC
GTGCTGCACCCTAGCTACTTTATCCTGAGGCG
CCGACGGAAAAGGTTTCCATATTTCTTTACCG
ATGTGCGCGTCGCAGCC
MRHKRSTRRKRASATQLYQTCKATG
TCPPDVIPKVEGSTIADQILKYGSLGV
FFGGLGIGTGSGSGGRTGYVPIGTDP
PTAAIPLQPIRPPVTVDTVGPLDSSIV
SLIEETSFIEAGAPAPSIPTPSGFDVTT
SADTTPAIINVSSVGESSIQTISTHLNP
TFTEPSVLHPPAPAEASGHFIFSSPTV
STQSYENIPMDTFVVSTDSSNVTSST
PIPGSRPVARLGLYSRNTQQVKVVD
PAFLTSPHKLITYDNPAFESFDPEDTL
QFQHSDISPAPDPDFLDIIALHRPAIT
SRRHTVRFSRVGQKATLKTRSGKQIG
ARIHYYQDLSPIVPLDHTVPNEQYEL
QPLHDTSTSSYSINDGLYDVYADDVD
NVHTPMQHSYSTFATTRTSNVSIPL
NTGFDTPVMSGPDIPSPLFPTSSPFV
PISPFFPFDTIVVDGADFVLHPSYFILR
RRRKRFPYFFTDVRVAA
Page 24 of 48
% Nucleotide
Change
28.4%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV35 E1
ATGGCTGATCCTGCAGGTACAGATGAAGGG
GAGGGGACGGGATGTAATGGATGGTTTTTTG
TAGAAGCAGTAGTTAGTAGACGTACGGGGG
ATCCAGTGTCAGAGGACGAAAATGAAGATG
ACTGTGACAGGGGGGAGGATATGGTGGACT
TTATAAATGATACAGATATATTAAACATACAG
GCAGAAACAGAGACAGCACAAGCATTATTTC
ATGCACAGGAGGAGCAAACACACAAAGAGG
CTGTACAGGTCCTAAAACGAAAGTATGCTAG
TAGTCCACTTAGCAGCGTGAGCTTATGTGTTA
ATAATAACATAAGTCCACGTTTAAAAGCTATT
TGCATTGAAAATAAAAATACAGCAGCAAAGC
GACGATTATTTGAACTACCAGACAGCGGTTA
TGGCAATTCTGAAGTGGAAATACAGCAGATA
CAACAGGTAGAGGGGCATGATACAGTTGAA
CAATGTAGTATGGGCAGTGGGGATAGTATAA
CCTCTAGTAGCGATGAAAGACATGATGAGAC
TCCAACGCGAGACATAATACAAATACTAAAA
TGTAGTAATGCAAACGCAGCTATGTTGGCTA
AATTTAAAGAACTATTTGGTATTAGTTTTACA
GAACTTATTAGACCATTTAAGAGTGATAAATC
CACATGTACAGATTGGTGTGTGGCCGCATTT
GGAATAGCCCCAAGTGTGGCGGAAAGTTTAA
AAACATTAATTAAACCATATTGTTTATATATA
CATATACAATGTTTATCGTGTTCATGGGGTAT
GGTAATTCTAGCATTATTACGATTTAAATGTG
CAAAAAACAGAACAACAATTGAAAAACTATT
ATCAAAATTGCTATGTATTTCAGCTGCAAGTA
TGCTAATACAACCACCAAAATTACGTAGTACC
CCAGCTGCGTTATATTGGTTTAAAACAGCAAT
GTCAAATATTAGTGAGGTTGATGGAGAAACA
CCAGAATGGATTCAAAGACAAACAGTATTAC
AGCATAGTTTTAATGATGCAATATTTGACCTA
TCTGAAATGGTACAATGGGCATATGACAATG
ATTTTATAGATGATAGTGATATAGCATATAAA
TATGCACAATTGGCAGAAACTAATAGTAATG
CATGTGCTTTTTTAAAAAGTAATTCGCAAGCT
AAAATTGTAAAAGATTGTGCAACAATGTGTA
GACATTATAAACGAGCTGAAAAAAGAGAAAT
GACAATGTCACAGTGGATTAAAAGGCGATGT
GAAAAGGTGGACGATGACGGTGACTGGAGG
GACATAGTACGATTTTTAAGATATCAACAAGT
AGATTTTGTGGCATTTTTATCTGCACTAAAAA
ATTTTTTACATGGTGTGCCTAAAAAAAATTGC
ATACTTATATATGGAGCACCAAACACAGGTA
AATCATTATTTGGAATGAGTCTAATGCATTTC
TTACAAGGAGCTATTATATCCTATGTAAATTC
TAAAAGCCATTTTTGGTTGCAGCCATTATATG
ATGCCAAAATAGCTATGTTAGATGATGCTAC
ATCGCCATGTTGGGCATATATAGACCAATATT
TAAGAAATGCACTAGATGGAAATCCTATTTCA
TTAGATGTAAAGCATAAAGCATTAGTGCAAT
TAAAATGCCCACCTTTACTTATTACATCAAAT
ATAAATGCAGGCAAAGATGACAGGTGGCCAT
ACTTACATAGCAGGGTAGTGGTCTTTACATTT
CACAATGAATTCCCATTTGATAAAAATGGAA
ACCCAGTGTATGGGCTTAATGATAAAAACTG
GAAATCCTTTTTCTCAAGGACGTGGTGCAGA
TTAAATTTGCACGAGGAAGAGGACAAAGAA
AATGATGGAGACGCTTTCCCAGCGTTTAAGT
GTGTGTCAGGACAAAATACTAGAACATTACG
AGACTGA
ATGGCTGACCCCGCAGGGACCGATGAGGGA
GAAGGCACAGGATGTAATGGATGGTTTTTCG
TGGAGGCAGTCGTGAGCAGGAGAACAGGAA
GCTCCGTGGAAGATGAGAACGAAGACGATT
GCGACCGGGGCGAGGATATGGTGGACTTTA
TCAATGATACTGACATCCTGAACATTCAGGCC
GAGACCGAAACAGCTCAGGCACTGTTCCACG
CCCAGGAGGAACAGACCCATAAGGAGGCTG
TCCAGGTGCTGAAGAGGAAATATGCATCTAG
TCCACTGTCAAGCGTCAGCCTGTGCGTGAAC
AATAACATCTCCCCCCGCCTGAAGGCCATCTG
TATTGAGAATAAGAACACTGCCGCTAAACGG
AGACTGTTCGAACTGCCTGATTCCGGCTACG
GGAATTCTGAGGTGGAAATCCACGAGATTCA
GCAGGTCGAAGGGCATGATACTGTGGAGCA
GTGCAGTATGGGATCAGGCGACAGCATTACC
TCCTCTAGTGATGAGCGGCACGACGAAACTC
CAACCAGAGACATCATTCAGATCCTGAAGTG
TTCCAATGCAAACGCAGCCATGCTGGCCAAG
TTCAAAGAGCTGTTTGGCATCTCTTTCACAGA
ACTGATTAGGCCCTTCAAGTCCGATAAATCTA
CATGCACTGACTGGTGTGTCGCTGCATTTGG
GATTGCACCTAGCGTGGCCAACTTCAAGCAC
ATCACCTACGTCTATATCTACAATGTCTATCG
CGTGCATGGAGCCATGGTCATCCTGGCTCTG
CTGCGCTTTAAGGTCGAGAAACGAGAACAGC
AGCTGAAGACAATCGATGCCAAACTGCTGTG
TATTAGTGCCGCTTCAATGCTGATCCAGCCAC
CTAAGCTGCGATCCACCCCAGCAGCCCTGTAT
TGGTTCAAAACAGCCATGAGCAACATTTCCG
AGGTGGACGGCGAGACACCCGAATGGATCC
AGAGACAGACTGTCCTGCAGCACTCTTTTAAC
GATGCCATCTTCGACCTGAGTGAGATGGTGC
AGTGGGCTTACGATAATGACTTTATCGACGA
TTCAGATATTGCCTATAAGTACGCACAGCTGG
CCGAAACCAATAGCAACGCCTGCGCTTTCCTG
AAATCAAATAGCCAGGCTAAGATCGTGAAAG
ACTGCGCAACAATGTGTCGCCACTACAAGCG
AGCAGAGAAACGGGAAATGACTATGTCTCAG
TGGATTAAGAGGCGATGTGCACAGGTGGAC
GATGACGGCGATTGGAGGGACATCGTCCGA
TTTCTGCGGTATCAGCAGGTCGATTTCGTGGC
TTTTCTGAGCGCACTGAAGAATTTCCTGCATG
GGGTGCCCAAGAAAAACTGCATCCTGATCTA
CGGGGCTCCTAATACCGGAAAAAGTCTGTTT
GGCATGTCACTGATGCACTTCCTGCAGGGAG
CCATCATTAGTTATGTGAACTCCAAGTCTCAT
TTTTGGCTGCAGCCCCTGTACGACGCTAAAAT
TGCAATGCTGGATGACGCCACCAGCCCATGC
GGCATCTATCGACCCATTTTTAAGAAATGTAC
ACGGTGGAAGTCTTACATCAGTTTCAGATGT
AAAGCTCTGAGCATCGTGCACATTATGCCTAC
CTTCACATACTATATCAATATTAACGCCGGCA
AGGATGACAGGTGGCCATATCTGCACTCCCG
CGTGGTCGTGTTCACATTCCATAACGAGTTCC
CTTTCGATAAGAATGGGAACCCAGAATACGG
ACTGAATGACAAGAACTGGAAATCATTCTTTA
GCAGAACTTGGTGCAGGCTGAACCTGCATGA
GGAAGAGGTGAAGGAGAATGATGGCGACGC
CTTCCCTGCTTTTAAATGTGTGTCTGGGCAGA
ATACTAGAACCCTGAGGGAC
MADPAGTDEGEGTGCNGWFFVEA
VVSRRTGDPVSEDENEDDCDRGED
MVDFINDTDILNIQAETETAQALFHA
QEEQTHKEAVQVLKRKYASSPLSSVS
LCVNNNISPRLKAICIENKNTAAKRRL
FELPDSGYGNSEVEIQQIQQVEGHD
TVEQCSMGSGDSITSSSDERHDETPT
RDIIQILKCSNANAAMLAKFKELFGIS
FTELIRPFKSDKSTCTDWCVAAFGIA
PSVAESLKTLIKPYCLYIHIQCLSCSWG
MVILALLRFKCAKNRTTIEKLLSKLLCI
SAASMLIQPPKLRSTPAALYWFKTA
MSNISEVDGETPEWIQRQTVLQHSF
NDAIFDLSEMVQWAYDNDFIDDSDI
AYKYAQLAETNSNACAFLKSNSQAKI
VKDCATMCRHYKRAEKREMTMSQ
WIKRRCEKVDDDGDWRDIVRFLRY
QQVDFVAFLSALKNFLHGVPKKNCIL
IYGAPNTGKSLFGMSLMHFLQGAIIS
YVNSKSHFWLQPLYDAKIAMLDDAT
SPCWAYIDQYLRNALDGNPISLDVK
HKALVQLKCPPLLITSNINAGKDDRW
PYLHSRVVVFTFHNEFPFDKNGNPV
YGLNDKNWKSFFSRTWCRLNLHEEE
DKENDGDAFPAFKCVSGQNTRTLRD
Page 25 of 48
% Nucleotide
Change
27.6%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV35 E2
ATGATGGAGACGCTTTCCCAGCGTTTAAGTG
TGTGTCAGGACAAAATACTAGAACATTACGA
GACTGATAGCACATGTTTGTCTGATCACATAC
AGTATTGGAAACTGATTCGTCTTGAATGTGC
AGTATTTTATAAAGCAAGAGAAATGGGAATT
AAAACTCTTAACCACCAAGTGGTTCCAACGCA
GGCCATTTCAAAAGCCAAAGCAATGCAAGCA
ATTGAACTGCAATTAATGTTAGAGACATTAAA
TACAACTGAGTATAGCACAGAAACATGGACA
CTGCAAGAAACAAGTATTGAATTATATACAA
CAGTTCCACAAGGATGTTTTAAAAAACATGG
GGTTACAGTGGAAGTACAATTTGATGGTGAT
AAACAAAATACTATGCATTATACTAATTGGAC
ACATATATATATATTAGAGGACAGTATATGTA
CTGTTGTAAAGGGACTGGTAAATTATAAAGG
TATTTATTATGTGCATCAGGGTGTAGAAACAT
ATTATGTTACTTTTAGGGAAGAGGCTAAAAA
GTATGGAAAAAAAAATATATGGGAAGTGCAT
GTGGGTGGTCAGGTAATTGTTTGTCCTGAAT
CTGTATTTAGCAGCACAGAACTATCCACTGCT
GAAATTGCTACACAGCTACACGCCTACAACA
CCACCGAGACCCATACCAAAGCCTGCTCCGT
GGGCACCACAGAAACCCAGAAGACAAATCAC
AAACGACTTCGAGGGGGTACCGAGCTCCCCT
ACAACCCCACCAAGCGAGTGCGACTCAGTGC
CGTGGACAGTGTTGACAGAGGGGTCTACTCT
ACATCTGACTGCACAAACAAAGACCGGTGTG
GTAGTTGTAGTACAACTACACCTATAGTACAT
TTAAAAGGTGATGCAAATACATTAAAGTGTTT
AAGATATAGATTGGGTAAATATAAAGCATTG
TATCAAGATGCTTCATCTACATGGAGATGGA
CATGTACAAACGATAAAAAACAAATAGCAAT
TGTAACATTAACTTACACAACAGAATATCAAA
GGGATAAATTTTTAACTACAGTAAAAATACCT
AACACAGTTACAGTGTCTAAAGGATATATGT
CTATATGA
TTGTTTGTCCTGAATCTGTATTTAGCAGCACA
GAACTATCCACTGCTGAAATTGCTACACAGCT
ACACGCCTACAACACCACCGAGACCCATACC
AAAGCCTGCTCCGTGGGCACCACAGAAACCC
AGAAGACAAATCACAAACGACTTCGAGGGG
GTACCGAGCTCCCCTACAACCCCACCAAGCG
AGTGCGACTCAGTGCCGTGGACAGTGTTGAC
AGAGGGGTCTACTCTACATCTGACTGCACAA
ACAAAGACCGGTGTGGTAGTTGTAGTACAAC
TACACCTATAG
ATGATAGACCTTACAGCTTCCAGTACTGTGTT
GCTGTGCTTTTTGTTGTGCTTTTGTGTGCTTTT
GTGCTTGTGTCTGCTTGTACGTTCGCTATTGC
TATCTGTGTCATTATACTCAGCATTAATATTAC
TGGTTTTAATACTGTGGGTTACTGTAGCAACA
CCACTACGTTGCTTTTGTTGTTTTCTTTGCTTT
TTGTATATACCTATGGGAATGATTAACGCTCA
TGCACAATATTTGGCAGTACAGTAA
ATGTTTCAGGACCCAGCTGAACGACCTTACA
AACTGCATGATTTGTGCAACGAGGTAGAAGA
AAGCATCCATGAAATTTGTTTGAATTGTGTAT
ACTGCAAACAAGAATTACAGCGGAGTGAGGT
ATATGACTTTGCATGCTATGATTTGTGTATAG
TATATAGAGAAGGCCAGCCATATGGAGTATG
CATGAAATGTTTAAAATTTTATTCAAAAATAA
GTGAATATAGATGGTATAGATATAGTGTGTA
TGGAGAAACGTTAGAAAAACAATGCAACAAA
CAGTTATGTCATTTATTAATTAGGTGTATTAC
ATGTCAAAAACCGCTGTGTCCAGTTGAAAAG
CAAAGACATTTAGAAGAAAAAAAACGATTCC
ATAACATCGGTGGACGGTGGACAGGTCGGT
GTATGTCCTGTTGGAAACCAACACGTAGAGA
AACCGAGGTGTAA
ATGATGGAAACTCTGTCACAGCGACTGAGCG
TCTGCCAGGATAAGATTCTGGAGCATTACGA
AACCGATAGCACTTGTCTGAGCGACCACATC
CAGTACTGGAAGCTGATTAGACTGGAGTGCG
CTGTGTTCTATAAGGCAAGGGAAATGGGGAT
CAAAACTCTGAACCATCAGGTGGTCCCAACC
CAGGCAATCAGCAAGGCCAAAGCTATGCAG
GCCATTGAGCTGCAGCTGATGCTGGAAACCC
TGAATACCACAGAGTACTCAACCGAAGACTG
GACACTGCAGGAGACTAGCATTGAACTGTAC
ACTACCGTGCCCACACGGTGCCTGAAGAAAG
ATGTGTATACTGTCGAGGCTCAGTTTGACGG
CGATAAGCAGAACACCATGCACTACACTAAT
TGGACCCATATCTATATTCTGGAAGACAGTAT
CTGTACAGTGGTCAAGGGGCTGGTGAACTAC
AAAGGAATCTACTATGTGCACCAGGGCGTCG
AGACCTACTATGTCACATTCAGAGAGGAAGC
CAAGAAATATGGGAAGAAAAATATCTGGGA
GGTGCATGTCGGCGGGCAGGTCATCGTCTGC
CCCGAATCTGTGTTTAGCTCCACTGAGCTGAG
TACCGCAGAAATCGCCACCCAGCTGCACGCT
TACAACACAACTGAGACCCATACAAAGGCAT
GTTCCGTGGGAACCACAGAAACACAGAAGAC
TAACCACAAACGGCTGAGAGGAGGCACCGA
GCTGCCCTACAATCCTACAAAGAGGGTGCGC
CTGAGTGCCGTGGACTCAGTCGATCGCGGCG
TCTATTCAACAAGCGACTGTACTAACAAAGAT
CGATGCGGCTCCTGTTCTACTACCACACCTAT
CGTGCATCTGAAGGGGGACGCTAATACCCTG
AAATGCTCTCGATACCGGCTGGGAAAGTACA
AAGCCCTGTATCAGGACGCTTCTAGTACATG
GAGGTGGACTTGTACCAACGATAAGAAACAG
ATCGCAATTGTGACACTGACTTACACTACCGA
GTATCAGCGCGATAAGTTCCTGACAACTGTG
AAAATCCCAAATACCGTGACAGTCAGCAAGG
GCTATATGTCCATT
ATGTTTGTCCTGAACCTGTACCTGGCTGCCCA
GAACTATCCCCTGCTGAAGCTGCTGCATTCCT
ATACCCCTACCACTCCTCCACGGCCAATCCCC
AAGCCTGCCCCATGGGCTCCCCAGAAACCTC
GGAGACAGATTACCAACGACTTCGAGGGAGT
GCCAAGCTCCCCAACCACACCACCTTCTGAGT
GCGATAGTGTCCCTTGGACAGTGCTGACTGA
AGGCTCCACCCTGCACCTGACAGCCCAGACT
AAGACCGGGGTGGTCGTGGTCGTGCAGCTG
CATCTG
ATGATTGACCTGACTGCTTCCTCCACTGTGCT
GCTGTGTTTTCTGCTGTGTTTCTGCGTCCTGCT
GTGCCTGTGTCTGCTGGTGCGGTCTCTGCTGC
TGAGCGTGTCCCTGTACAGTGCTCTGATCCTG
CTGGTCCTGATTCTGTGGGTGACCGTCGCAA
CACCCCTGCTGGCCTTCGTGGTCTCCTGCTTT
TGTATCTACCTGTGGATGATTAACGCCCACGC
TCAGTATCTGGCCGTGCAG
ATGTTCCAGGACCCCGCCGAAAGGCCCTATA
AACTGCACGATCTGTGTAACGAAGTCGAAGA
GAGCATCCACGAAATCTGTCTGAATTGCGTG
TACTGTAAGCAGGAGCTGCAGCGCTCTGAAG
TCTACGACTTCGCCTGCTATGATCTGTGTATC
GTGTACCGAGAGGGACAGCCATATGGCGTCT
GCATGAAGTGTCTGAAGTTCTACAGCAAGAT
CTCCGAATACAGGTGGTACCGCTATAGTGTG
TATGGGGAGACTCTGGAAAAGCAGTGCAAC
AAACAGCTGTGTCACCTGCTGATCAGGTGCA
TTACCTGTCAGAAGCCCCTGTGCCCTGTCGAG
AAACAGAGACACCTGGAGGAAAAGAAAAGG
TTCCATAATATCGGAGGACGATGGACAGGAC
GATGCATGTCCTGTTGGAAGCCCACCCGGAG
AGAGACAGAAGTG
MMETLSQRLSVCQDKILEHYETDST
CLSDHIQYWKLIRLECAVFYKAREMG
IKTLNHQVVPTQAISKAKAMQAIELQ
LMLETLNTTEYSTETWTLQETSIELYT
TVPQGCFKKHGVTVEVQFDGDKQN
TMHYTNWTHIYILEDSICTVVKGLVN
YKGIYYVHQGVETYYVTFREEAKKYG
KKNIWEVHVGGQVIVCPESVFSSTEL
STAEIATQLHAYNTTETHTKACSVGT
TETQKTNHKRLRGGTELPYNPTKRV
RLSAVDSVDRGVYSTSDCTNKDRCG
SCSTTTPIVHLKGDANTLKCLRYRLGK
YKALYQDASSTWRWTCTNDKKQIAI
VTLTYTTEYQRDKFLTTVKIPNTVTVS
KGYMSI
HPV35 E4
HPV35 E5
HPV35 E6
Page 26 of 48
% Nucleotide
Change
25.7%
LFVLNLYLAAQNYPLLKLLHSYTPTTP
PRPIPKPAPWAPQKPRRQITNDFEG
VPSSPTTPPSECDSVPWTVLTEGSTL
HLTAQTKTGVVVVVQLHL
25.6%
MIDLTASSTVLLCFLLCFCVLLCLCLLV
RSLLLSVSLYSALILLVLILWVTVATPL
RCFCCFLCFLYIPMGMINAHAQYLA
VQ
28.1%
MFQDPAERPYKLHDLCNEVEESIHEI
CLNCVYCKQELQRSEVYDFACYDLCI
VYREGQPYGVCMKCLKFYSKISEYR
WYRYSVYGETLEKQCNKQLCHLLIRC
ITCQKPLCPVEKQRHLEEKKRFHNIG
GRWTGRCMSCWKPTRRETEV
24.1%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV35 E7
ATGCATGGAGAAATAACTACATTGCAAGACT
ATGTTTTAGATTTGGAACCCGAGGCAACTGA
CCTATACTGTTATGAGCAATTGTGTGACAGCT
CAGAGGAGGAGGAAGATACTATTGACGGTC
CAGCTGGACAAGCAAAACCAGACACCTCCAA
TTATAATATTGTAACGTCCTGTTGTAAATGTG
AGGCGACACTACGTCTGTGTGTACAGAGCAC
ACACATTGACATACGTAAATTGGAAGATTTAT
TAATGGGCACATTTGGAATAGTGTGCCCCGG
CTGTTCACAGAGAGCATAA
ATGTCTCTGTGGCGGTCTAACGAAGCCACTG
TCTACCTGCCTCCAGTGTCAGTGTCTAAGGTT
GTTAGCACTGATGAATATGTAACACGCACAA
ACATCTACTATCATGCAGGCAGTTCTAGGCTA
TTAGCTGTGGGTCACCCATACTATGCTATTAA
AAAACAAGATTCTAATAAAATAGCAGTACCC
AAGGTATCTGGTTTGCAATACAGAGTATTTA
GAGTAAAATTACCAGATCCTAATAAGTTTGG
ATTTCCAGACACATCATTTTATGATCCTGCCTC
CCAGCGTTTGGTTTGGGCCTGTACAGGAGTT
GAAGTAGGTCGTGGTCAGCCATTGGGTGTAG
GTATTAGTGGTCATCCTTTATTAAATAAATTG
GATGATACTGAAAATTCTAATAAATATGTTGG
TAACTCTGGTACAGATAACAGGGAATGCATT
TCTATGGATTATAAACAAACACAATTGTGTTT
AATAGGTTGTAGGCCTCCTATAGGTGAACAT
TGGGGAAAAGGCACACCTTGTAATGCTAACC
AGGTAAAAGCAGGAGAATGTCCTCCTTTGGA
GTTACTAAACACTGTACTACAAGACGGGGAC
ATGGTAGACACAGGATTTGGTGCAATGGATT
TTACTACATTACAAGCTAATAAAAGTGATGTT
CCCCTAGATATATGCAGTTCCATTTGCAAATA
TCCTGATTATCTAAAAATGGTTTCTGAGCCAT
ATGGAGATATGTTATTTTTTTATTTACGTAGG
GAGCAAATGTTTGTTAGACATTTATTTAATAG
GGCTGGAACTGTAGGTGAAACAGTACCTGCA
GACCTATATATTAAGGGTACCACTGGCACATT
GCCTAGTACTAGTTATTTTCCTACTCCTAGTG
GCTCTATGGTAACCTCCGATGCACAAATATTT
AATAAACCATATTGGTTGCAACGTGCACAAG
GCCATAATAATGGTATTTGTTGGAGTAACCA
ATTGTTTGTTACTGTAGTTGATACAACCCGTA
GTACAAATATGTCTGTGTGTTCTGCTGTGTCT
TCTAGTGACAGTACATATAAAAATGACAATTT
TAAGGAATATTTAAGGCATGGTGAAGAATAT
GATTTACAGTTTATTTTTCAGTTATGTAAAAT
AACACTAACAGCAGATGTTATGACATATATTC
ATAGTATGAACCCGTCCATTTTAGAGGATTG
GAATTTTGGCCTTACACCACCGCCTTCTGGTA
CCTTAGAGGACACATATCGCTATGTAACATCA
CAGGCTGTAACTTGTCAAAAACCCAGTGCAC
CAAAACCTAAAGATGATCCATTAAAAAATTAT
ACTTTTTGGGAGGTTGATTTAAAGGAAAAGT
TTTCTGCAGACTTAGATCAATTTCCGTTGGGC
CGTAAATTTTTGTTACAAGCAGGACTAAAGG
CCAGGCCTAATTTTAGATTAGGCAAGCGTGC
AGCTCCAGCATCTACATCTAAAAAATCTTCTA
CTAAACGTAGAAAAGTAAAAAGTTAA
ATGCACGGCGAAATTACCACTCTGCAGGATT
ATGTCCTGGATCTGGAGCCCGAAGCCACTGA
CCTGTATTGTTATGAGCAGCTGTGTGACAGCT
CCGAGGAAGAGGAAGACACAATCGATGGAC
CAGCAGGACAGGCTAAGCCTGATACCTCTAA
CTACAATATTGTGACAAGTTGCTGTAAATGCG
AGGCAACTCTGCGGCTGTGTGTCCAGAGCAC
CCACATCGACATTAGAAAGCTGGAAGATCTG
CTGATGGGAACCTTCGGCATCGTGTGCCCCG
GGTGTTCTCAGAGGGCC
ATGAGCCTGTGGAGGAGCAATGAAGCAACC
GTCTATCTGCCCCCTGTGAGCGTGAGCAAAG
TCGTGAGCACTGATGAATACGTGACAAGGAC
CAACATCTACTATCACGCAGGAAGCTCCCGA
CTGCTGGCTGTGGGGCATCCTTACTATGCAAT
CAAGAAACAGGACTCCAACAAGATTGCCGTG
CCAAAAGTCTCTGGACTGCAGTACAGAGTGT
TCAGGGTCAAGCTGCCTGATCCAAACAAGTT
CGGCTTTCCTGACACTTCCTTTTATGATCCATG
CCTGCAGCGACTGGTGTGGGCCTGTACCGGA
GTGGAGGTCGGACGAGGACAGCCACTGGGA
GTCGGCATCTCTGGACACCCTCTGCTGAACAA
GCTGGACGATACCGAGAACCTGAACAAGTAC
GTGGGAAACAGCGGCAATTCCGGGACCGAC
AATCGGGAATGCATTAGCATGGATTATAAGC
AGACACAGCTGTGCCTGATCGGATGTAGACC
CCCTATTGGCGAACATTGGGGGAAGGGAAC
ACCCTGCAACGCTAATCAGGTGAAAGCAGGC
GAGTGTCCACCCCTGGAACTGCTGAACACAG
TGCTGCAGGACGGGGATATGGTCGACACTG
GCTTCGGGGCCATGGATTTTACCACACTGCA
GGCTAATAAGTCTGACGTGCCTCTGGATATCT
GCTCTAGTATTTGTAAGTACCCAGACTATCTG
AAAATGGTCAGTGAGCCCTACGGGGATATGC
TGTTCTTTTATCTGCGGAGAGAACAGATGTTC
GTGCGGCACCTGTTTAACAGAGCTGGAACTG
TGGGCGAGACCGTCCCAGCAGACCTGTACAT
CAAGGGGACTACCGGAACACTGCCCTCAACT
AGCTATTTCCCCACCCCTTCCGGCTCTATGGT
GACATCCGATGCCCAGATCTTCAACAAGCCTT
ACTGGCTGCAGAGGGCTCAGGGCCATAACAA
TGGGATTTGCTGGAGCAACCAGCTGTTCGTG
ACTGTGGTCGACACAACTCGCTCCACCAATAT
GTCTGTGTGTAGTGCTGTCTCAAGCTCCGACT
CTACCTACAAGAACGATAACTTCAAGGAGTA
CCTGAGACACGGCGAGGAATATGACCTGCAG
TTCATCTTTCAGCTGTGCAAGATTACCCTGAC
AGCCGATGTGATGACATATATCCATTCAATGA
ACCCAAGCATTCTGGAGGACTGGAATTTCGG
GCTGACTCCTCCACCCAGCGGAACCCTGGAA
GATACATACAGATATGTGACTAGTCAGGCAG
TCACCTGTCAGAAGCCTTCAGCCCCAAAGCCC
AAAGACGATCCACTGAAAAACTACACATTCT
GGGAGGTGGACCTGAAGGAAAAGTTCAGCG
CAGACCTGGATCAGTTCCCCCTGGGACGGAA
GTTTCTGCTGCAGGCAGGCCTGAAAGCACGA
CCAAATTTCCGACTGGGAAGGCGAGCAGCTC
CTGCAAGTACATCAAAGAAATCTAGTACTAA
GCGACGGAAGGTGAAAAGC
MHGEITTLQDYVLDLEPEATDLYCYE
QLCDSSEEEEDTIDGPAGQAKPDTS
NYNIVTSCCKCEATLRLCVQSTHIDIR
KLEDLLMGTFGIVCPGCSQRA
HPV35 L1
Page 27 of 48
MSLWRSNEATVYLPPVSVSKVVSTD
EYVTRTNIYYHAGSSRLLAVGHPYYAI
KKQDSNKIAVPKVSGLQYRVFRVKLP
DPNKFGFPDTSFYDPASQRLVWACT
GVEVGRGQPLGVGISGHPLLNKLDD
TENSNKYVGNSGTDNRECISMDYKQ
TQLCLIGCRPPIGEHWGKGTPCNAN
QVKAGECPPLELLNTVLQDGDMVD
TGFGAMDFTTLQANKSDVPLDICSSI
CKYPDYLKMVSEPYGDMLFFYLRRE
QMFVRHLFNRAGTVGETVPADLYIK
GTTGTLPSTSYFPTPSGSMVTSDAQI
FNKPYWLQRAQGHNNGICWSNQL
FVTVVDTTRSTNMSVCSAVSSSDSTY
KNDNFKEYLRHGEEYDLQFIFQLCKIT
LTADVMTYIHSMNPSILEDWNFGLT
PPPSGTLEDTYRYVTSQAVTCQKPSA
PKPKDDPLKNYTFWEVDLKEKFSADL
DQFPLGRKFLLQAGLKARPNFRLGK
RAAPASTSKKSSTKRRKVKS
% Nucleotide
Change
23.2%
27.7%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV35 L2
ATGCGACACAAAAGGTCTACAAAACGTGTTA
AACGTGCATCTGCAACACAACTATATCGTACT
TGCAAAGCTGCAGGAACTTGTCCACCAGATG
TTATACCTAAGGTTGAGGGTAATACTGTTGCT
GATCAAATTTTAAAATATGGCAGCATGGCTG
TGTTTTTTGGGGGGTTAGGAATTGGTTCTGG
ATCTGGCACAGGTGGAAGATCTGGATATGTT
CCACTGGGTACAACACCTCCAACGGCTGCCA
CAAACATTCCTATACGACCCCCTGTAACTGTG
GAAAGTATACCATTAGACACAATTGGCCCTTT
AGATTCTTCTATAGTGTCATTAGTAGAGGAAA
CTAGTTTTATTGAGTCTGGTGCCCCTGTTGTT
ACACCAAGGGTCCCACCTACAACAGGTTTTAC
AATAACCACATCTACAGATACCACACCTGCTA
TTTTAGATGTGACATCCATAAGTACACATGAT
AATCCTACTTTCACTGATCCTTCTGTTTTACAC
CCACCCACGCCTGCAGAAACTTCAGGTCATTT
TGTACTTTCATCATCTTCTATTAGTACACATAA
TTATGAAGAAATCCCTATGGATACTTTTATTG
TTTCCACAGACAGCAATAATATAACTAATAGC
ACGCCTATTCCAGGGTCTCGCCCTACGACAC
GCCTAGGATTATATAGTAAAGGTACCCAGCA
GGTTAAGGTTGTTGACCCTGCCTTTATGACTT
CTCCTGCAAAACTTATTACATATGATAATCCT
GCATATGAAGGCCTTAACCCTGATACAACCTT
ACAATTTGAGCATGAGGATATTAGCTTAGCTC
CGGATCCTGACTTTATGGACATTATAGCTTTA
CATAGGCCTGCACTAACATCTAGGAAAGGCA
CTATTAGATATAGTAGAGTAGGTAATAAACG
TACTATGCATACACGAAGTGGAAAAGCTATA
GGGGCACGGGTACATTATTATCAGGATTTAA
GTAGTATTACTGAAGATATAGAATTACAACCC
TTACAACATGTACCATCCTCTTTACCACATACC
ACTGTTTCAACATCATTAAATGATGGTATGTT
TGATATTTATGCTCCTATAGATACTGAGGAAG
ATATTATATTTTCAGCATCTTCTAACAATACTT
TATATACTACATCTAACACTGCATATGTTCCTA
GCAATACTACTATACCATTAAGTAGTGGCTAT
GATATTCCTATAACAGCAGGGCCAGACATTG
TATTTAACTCTAATACTATTACTAACACTGTAC
TACCGGTACCCACAGGTCCTATATATTCTATT
ATTGCAGATGGGGGTGACTTTTATTTACACCC
TAGTTATTATTTATTAAAACGACGTCGTAAAC
GTATCCCATATTTTTTTGCAGATGTCTCTGTG
GCGGTCTAA
ATGAGACATAAAAGAAGCACAAAGAGAGTC
AAGAGAGCAAGCGCAACACAGCTGTACCGA
ACCTGCAAAGCCGCCGGAACATGCCCTCCAG
ACGTCATCCCCAAGGTGGAGGGAAACACCGT
CGCTGATCAGATTCTGAAATACGGCTCCATG
GCAGTGTTCTTTGGAGGACTGGGAATCGGAT
CAGGAAGCGGAACAGGAGGACGATCTGGCT
ATGTGCCACTGGGAACCACACCACCTACAGC
AGCTACTAATATCCCCATTCGGCCACCCGTGA
CCGTCGAGTCTATCCCCCTGGACACAATTGGC
CCTCTGGATAGCTCCATCGTCAGTCTGGTGG
AGGAAACTTCTTTCATTGAAAGTGGGGCCCC
TGTGGTCACCCCAAGAGTGCCTCCAACTACC
GGCTTCACCATCACAACTAGCACCGACACCAC
ACCCGCCATCCTGGATGTGACATCCATTTCTA
CTCACGACAACCCAACCTTCACAGATCCATCT
GTCCTGCACCCACCTACCCCAGCAGAGACAA
GTGGCCATTTTGTGCTGTCTAGTTCAAGCATC
TCAACCCATAACTACGAGGAAATCCCTATGG
ACACATTCATTGTGAGCACTGATTCCAACAAT
ATCACCAATTCAACACCAATTCCCGGGAGCC
GGCCTACTACCAGACTGGGACTGTATAGCAA
GGGCACCCAGCAGGTGAAAGTGGTCGACCC
AGCCTTCATGACTAGCCCCGCCAAGCTGATCA
CCTACGATAACCCCGCATATGAAGGCCTGAA
TCCTGACACAACTCTGCAGTTCGAGCACGAA
GATATTAGCCTGGCCCCTGACCCAGATTTTAT
GGACATCATTGCTCTGCATCGACCAGCACTG
ACCAGCAGGAAAGGGACAATCCGCTACTCCC
GAGTCGGAAACAAGAGGACTATGCACACCC
GCAGCGGGAAAGCAATTGGAGCCAGGGTGC
ATTACTATCAGGACCTGTCCTCTATCACCGAG
GATATTGAACTGCAGCCACTGCAGCACGTCC
CCAGTTCACTGCCTCATACCACAGTGAGTACA
TCACTGAATGACGGCATGTTCGATATCTACGC
CCCCATTGACACTGAGGAAGATATCATCTTCA
GCGCTAGCTCCAACAATACACTGTACACTACC
AGTAACACTGCTTATGTGCCTTCAAATACAAC
TATCCCACTGTCTAGTGGCTATGACATCCCTA
TTACCGCAGGGCCAGATATCGTGTTCAACTCC
AATACTATTACCAATTCTGTCCTGCCCGTGCC
TACAGGCCCTATCTACAGCATCATTGCCGACG
GGGGAGATTTTTATCTGCACCCTTCCTACTAT
CTGCTGAAGCGGAGAAGGAAAGCTATTCCAT
ACTTCTTTGCCGACGTGTCTGTCGCTGTG
MRHKRSTKRVKRASATQLYRTCKAA
GTCPPDVIPKVEGNTVADQILKYGS
MAVFFGGLGIGSGSGTGGRSGYVPL
GTTPPTAATNIPIRPPVTVESIPLDTIG
PLDSSIVSLVEETSFIESGAPVVTPRVP
PTTGFTITTSTDTTPAILDVTSISTHDN
PTFTDPSVLHPPTPAETSGHFVLSSSS
ISTHNYEEIPMDTFIVSTDSNNITNST
PIPGSRPTTRLGLYSKGTQQVKVVDP
AFMTSPAKLITYDNPAYEGLNPDTTL
QFEHEDISLAPDPDFMDIIALHRPAL
TSRKGTIRYSRVGNKRTMHTRSGKAI
GARVHYYQDLSSITEDIELQPLQHVP
SSLPHTTVSTSLNDGMFDIYAPIDTEE
DIIFSASSNNTLYTTSNTAYVPSNTTIP
LSSGYDIPITAGPDIVFNSNTITNTVLP
VPTGPIYSIIADGGDFYLHPSYYLLKRR
RKRIPYFFADVSVAV
Page 28 of 48
% Nucleotide
Change
28.9%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV39 E1
ATGGCCAATCGTGAAGGTACAGACGGGGAT
GGGTCGGGATGTAACGGATGGTTTCTAGTAC
AGGCAATAGTAGATAAACAAACAGGCGACA
CAGTGTCGGAGGATGAGGATGAAAATGCAA
CAGATACAGGTTCAGACCTGGCAGACTTTATT
GATGATTCCACAGATATTTGTGTACAGGCAG
AGCGTGAGACAGCACAGGTACTTTTACATAT
GCAAGAGGCCCAAAGGGATGCACAAGCAGT
GCGTGCCTTAAAACGAAAGTATACAGACAGC
AGTGGCGACACTAGACCGTATGGAAAAAAA
GTAGGCAGGAATACCAGGGGAACACTACAG
GAAATTTCATTAAATGTAAGCAGTACGCAGG
CAACACAAACGGTGTATTCCGTGCCAGACAG
CGGATATGGCAATATGGAAGTGGAAACAGCT
GAAGTGGAGGAGGTAACTGTAGCAACTAAT
ACAAATGGGGATGCTGAAGGGGAACATGGC
GGCAGTGTACGGGAGGAGTGCAGTAGTGTG
GATAGTGCTATAGATAGTGAAAACCAGGATC
CCAAATCTCCAACTGCACAAATTAAATTATTG
TTACAATCCAATAACAAAAAGGCTGCAATGC
TAACACAATTTAAAGAAACATATGGACTATCC
TTTACTGACCTGGTACGTACGTTTAAAAGTGA
TAAAACAACATGTACAGACTGGGTGGCAGCC
ATATTTGGAGTACATCCAACTATTGCAGAAG
GATTTAAAACATTAATCAACAAATATGCCTTA
TATACACATATACAAAGCTTAGACACAAAAC
AAGGAGTACTAATTTTAATGCTAATAAGATAT
ACATGTGGAAAAAATAGGGTTACTGTAGGAA
AGGGATTAAGTACATTGTTACATGTTCCAGA
AAGTTGTATGCTTCTGGAGCCTCCTAAACTGC
GCAGCCCTGTAGCAGCACTATATTGGTATCG
CACAGGTATATCCAATATTAGTGTGGTAACA
GGGGATACGCCAGAATGGATACAACGATTAA
CTGTTATACAACATGGAATAGATGATAGTGT
ATTTGACCTATCGGACATGGTACAATGGGCA
TTTGACAATGAATATACTGATGAAAGTGACA
TAGCATTTAATTATGCAATGTTAGCAGATTGT
AACAGTAATGCTGCAGCCTTTTTAAAAAGTAA
CTGCCAGGCAAAATATGTAAAAGATTGTGCA
ACAATGTGTAAACATTACAAGCGAGCACAAA
AAAGGCAAATGTCCATGTCTCAATGGATAAA
ATTTAGGTGTAGTAAATGTGATGAAGGCGGG
GACTGGAGACCCATAGTACAATTCTTAAGAT
ATCAAGGAATAGAATTTATATCCTTTTTATGT
GCATTAAAGGAATTTTTAAAGGGTACTCCCA
AAAAAAACTGTATAGTTATATATGGACCTGC
GAATACAGGAAAGTCACATTTTTGTATGAGC
CTTATGCATTTTTTACAGGGCACAGTTATTTC
ATATGTAAACTCCACCAGCCACTTTTGGCTAG
AACCACTTGCAGATGCAAAACTAGCAATGTT
AGATGATGCAACCGGTACCTGCTGGTCATAT
TTCGATAATTATATGAGAAATGCATTAGATG
GGTATGCAATAAGTTTAGATAGGAAATATAA
AAGTTTACTACAAATGAAATGTCCACCATTAT
TAATAACCTCCAATACCAATCCTGTGGAAGAC
GATAGGTGGCCATATTTACGTAGTAGGCTAA
CAGTGTTTAAATTTCCTAATGCATTTCCATTTG
ACCAAAACAGGAATCCAGTGTACACAATCAA
TGATAAAAACTGGAAATGTTTTTTTGAAAAG
ACTTGGTGCAGATTAGACTTGCAGCAGGACG
AGGATGAAGGAGACAATGATGAAAACACTTT
CACAACGTTTAAATGTGTTACAGGACAAAAT
ACTAGAATACTATGA
ATGGCTAATCGGGAGGGGACTGATGGGGAT
GGGAGCGGCTGTAATGGCTGGTTCCTGGTGC
AGGCAATCGTGGATAAGCAGACTGGCGACA
CCGTGAGCGAGGACGAAGATGAGAACGCTA
CAGATACTGGCTCCGACCTGGCAGATTTCATC
GACGATTCTACAGACATTTGCGTGCAGGCCG
AAAGAGAGACTGCTCAGGTCCTGCTGCACAT
GCAGGAGGCACAGAGGGATGCACAGGCTGT
GCGAGCTCTGAAGCGGAAATACACCGACAGC
TCCGGAGATACAAGGCCATATGGAAAGAAA
GTGGGCAGGAACACCCGCGGCACACTGCAG
GAAATCTCCCTGAATGTCTCTAGTACCCAGGC
TACCCAGACAGTGTACTCTGTCCCCGACAGTG
GGTATGGAAACATGGAAGTGGAGACTGCCG
AGGTCGAGGAAGTGACCGTCGCCACTAACAC
CAATGGGGATGCTGAAGGAGAGCATGGCGG
GAGCGTGCGGGAGGAATGCTCAAGCGTCGA
CTCAGCTATCGATAGCGAAAATCAGGACCCT
AAGAGCCCAACAGCCCAGATCAAGCTGCTGC
TGCAGTCCAACAATAAGAAAGCCGCTATGCT
GACTCAGTTTAAGGAGACCTACGGGCTGAGT
TTCACAGATCTGGTGAGAACTTTTAAGTCAGA
CAAAACCACATGTACCGATTGGGTGGCAGCC
ATCTTCGGAGTCCACCCCACAATTGCAGAGG
GCTTTAAGACTCTGATCAACAAATACGCCCTG
TATACCCATATTCAGTCTCTGGACACAAAGCA
GGGCGTGCTGATCCTGATGCTGATTCGCTAC
ACTTGCGGGAAGAATCGAGTGACCGTCGGCA
AAGGGCTGTCTACACTGCTGCACGTGCCCGA
AAGTTGTATGCTGCTGGAGCCCCCTAAGCTG
AGGAGCCCTGTCGCTGCACTGTACTGGTATC
GCACAGGGATCAGCAACATTTCCGTGGTCAC
TGGAGATACCCCTGAGTGGATCCAGCGGCTG
ACTGTGATCCAGCATGGCATTGACGATTCCG
TGTTCGACCTGTCTGATATGGTCCAGTGGGCT
TTTGACAACGAATACACAGACGAGTCCGATA
TTGCATTCAATTATGCTATGCTGGCAGATTGC
AACTCAAATGCCGCTGCATTTCTGAAGAGCA
ATTGTCAGGCAAAGTACGTGAAAGACTGCGC
CACCATGTGTAAGCACTATAAAAGAGCCCAG
AAAAGGCAGATGTCCATGTCTCAGTGGATCA
AGTTCAGATGCTCTAAATGTGACGAGGGAGG
CGATTGGCGGCCTATTGTGCAGTTTCTGAGA
TACCAGGGCATCGAATTCATTAGCTTTCTGTG
CGCCCTGAAGGAGTTCCTGAAAGGGACTCCT
AAGAAAAACTGTATCGTGATCTACGGCCCAG
CCAATACCGGGAAGAGTCACTTCTGCATGTC
ACTGATGCATTTTCTGCAGGGCACTGTGATCA
GCTATGTCAACAGTACCTCACATTTCTGGCTG
GAACCACTGGCAGACGCCAAGCTGGCAATGC
TGGACGATGCCACAGGAACTTGCTGGTCCTA
CTTTGATAACTATATGCGAAATGCTCTGGACG
GCTACGCAATCAGCCTGGATCGGAAGTATAA
ATCCCTGCTGCAGATGAAGTGTCCACCCCTGC
TGATTACATCTAACACTAATCCAGTGGAGGA
CGATAGGTGGCCCTACCTGCGGAGTAGACTG
ACCGTCTTCAAATTTCCCAACGCCTTCCCTTTT
GACCAGAACCGCAATCCCGTGTATACCATCA
ACGATAAGAACTGGAAGTGCTTCTTTGAAAA
GACATGGTGTCGACTGGACCTGCAGCAGGAC
GAAGATGAGGGGGACAACGATGAGAATACC
TTCACTACCTTTAAATGTGTGACCGGACAGAA
TACACGCATCCTG
MANREGTDGDGSGCNGWFLVQAI
VDKQTGDTVSEDEDENATDTGSDLA
DFIDDSTDICVQAERETAQVLLHMQ
EAQRDAQAVRALKRKYTDSSGDTRP
YGKKVGRNTRGTLQEISLNVSSTQAT
QTVYSVPDSGYGNMEVETAEVEEVT
VATNTNGDAEGEHGGSVREECSSV
DSAIDSENQDPKSPTAQIKLLLQSNN
KKAAMLTQFKETYGLSFTDLVRTFKS
DKTTCTDWVAAIFGVHPTIAEGFKTL
INKYALYTHIQSLDTKQGVLILMLIRY
TCGKNRVTVGKGLSTLLHVPESCMLL
EPPKLRSPVAALYWYRTGISNISVVT
GDTPEWIQRLTVIQHGIDDSVFDLSD
MVQWAFDNEYTDESDIAFNYAMLA
DCNSNAAAFLKSNCQAKYVKDCAT
MCKHYKRAQKRQMSMSQWIKFRC
SKCDEGGDWRPIVQFLRYQGIEFISF
LCALKEFLKGTPKKNCIVIYGPANTGK
SHFCMSLMHFLQGTVISYVNSTSHF
WLEPLADAKLAMLDDATGTCWSYF
DNYMRNALDGYAISLDRKYKSLLQM
KCPPLLITSNTNPVEDDRWPYLRSRL
TVFKFPNAFPFDQNRNPVYTINDKN
WKCFFEKTWCRLDLQQDEDEGDND
ENTFTTFKCVTGQNTRIL
Page 29 of 48
% Nucleotide
Change
25.8%
% Nucleotide
Change
35.4%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV39 E2
ATGAAGGAGACAATGATGAAAACACTTTCAC
AACGTTTAAATGTGTTACAGGACAAAATACT
AGAATACTATGAACAAGACAGTAAATCAATA
TATGATCAAATTAATTATTGGAAATGTGTGCG
AATGGAAAATGCAATATTTTATGCAGCACGA
GAACGTGGCATGCATACTATTGACCACCAGG
TGGTGCCAACCATAAACATTTCAAAATGTAAA
GCATATCAAGCTATTGAACTGCAGATGGCAC
TAGAAAGTGTTGCACAAACTGAATACAATAC
AGAGGAGTGGACATTAAAAGACACTAGTAAT
GAACTGTGGCATACACAGCCAAAACAATGTT
TTAAAAAACAAGGAACTACAGTGGAGGTGTG
GTATGATGGGGACAAATGTAATGCTATGAAC
TATGTATTATGGGGTGCTATATATTATAAAAA
TAATATAGACATATGGTGTAAAACAGAAGGG
TGTGTGGACTATTGGGGTATATATTATATGA
ACGAGCACCTAAAAGTATACTATGAAGTGTT
TATTCAAGATGCGGAAAGGTATGGGACTAGT
GGCAAATGGGAAGTGCATTATAATGGCAACA
TAATTCATTGTCCTGACTCTATGTGCAGTACC
AGTGACGGATCGGTACCCACTACTGAACTTA
CTACCGAATTATCAAACACCACCGCGACCCAT
TCCACCGCAACAACCCCATGCACCCAAAAAA
CAATCCCGCCGCCGTCTCGAAAGCGACCTCG
ACAGTGTGCAGTCACAGAGCCCACTGAGCCC
GACGGAGTGTCCCTGGACCATCTTAACAACC
CACTCCACAGTAACAGTACAGGCCACAACAC
AAGACGGTACCTCAGTTGTGGTAACACTACG
CCTATAATACATTTAAAAGGTGACAAAAATG
GTTTAAAATGTTTAAGATATAGACTACAAAAA
TATGACACATTGTTTGAAAATATTTCATGTAC
CTGGCATTGGATACGGGGTAAGGGAACCAA
AAACGCTGGCATATTAACTGTTACATATGCCA
CAGAGTCACAACGCCAAAAATTTTTGGACAC
TGTTAAAATACCTTCTAGTGTACATGTTTCATT
GGGTTACATGACATTGTAA
TTCATTGTCCTGACTCTATGTGCAGTACCAGT
GACGGATCGGTACCCACTACTGAACTTACTAC
CGAATTATCAAACACCACCGCGACCCATTCCA
CCGCAACAACCCCATGCACCCAAAAAACAAT
CCCGCCGCCGTCTCGAAAGCGACCTCGACAG
TGTGCAGTCACAGAGCCCACTGAGCCCGACG
GAGTGTCCCTGGACCATCTTAACAACCCACTC
CACAGTAACAGTACAGGCCACAACACAAGAC
GGTACCTCAGTTGTGGTAACACTACGCCTATA
A
ATGATATTATTGGTATTTTTGGTGTGGTTTGG
TGTGTGTATATATATATGTTGCAATGTCCCGC
TTTTGCCGTCTGTGCATGTGTGTGCGTATGTG
TGGATAATTGTGTTTGTGTTTATTCTTATACGT
ACCACACCATTGGAGGTGTTTTTTGTATATTT
ACTATTTTTTGTATTGCCCATGTGGTTGTTGC
ATAGATGGCAATGGATATGATATAG
ATGGCGCGATTTCACAATCCTGCAGAACGGC
CATACAAATTGCCAGACCTGTGCACAACGCT
GGACACCACCTTGCAGGACATTACAATAGCC
TGTGTCTATTGCAGACGACCACTACAGCAAA
CCGAGGTATATGAATTTGCATTTAGTGATTTA
TATGTAGTATATAGGGACGGGGAACCACTAG
CTGCATGCCAATCATGTATAAAATTTTATGCT
AAAATACGGGAGCTACGATATTACTCGGACT
CGGTGTATGCAACTACATTAGAAAATATAACT
AATACAAAGTTATATAATTTATTAATAAGGTG
CATGTGTTGTCTGAAACCGCTGTGTCCAGCA
GAAAAATTAAGACACCTAAATAGCAAACGAA
GATTTCATAAAATAGCAGGAAGCTATACAGG
ACAGTGTCGACGGTGCTGGACCACAAAACGG
GAGGACCGCAGACTAACACGAAGAGAAACC
CAAGTATAA
ATGAAGGAAACTATGATGAAAACACTGAGCC
AGAGACTGAACGTGCTGCAGGATAAGATTCT
GGAGTATTACGAACAGGATTCTAAAAGTATC
TACGACCAGATTAACTATTGGAAGTGCGTGC
GGATGGAGAATGCCATCTTCTACGCCGCTAG
GGAACGCGGGATGCACACTATTGATCATCAG
GTGGTCCCAACTTCAAGAAACAGGGAACCAC
AGTGGAAGTCTGGTACGACGGCGATAAGTG
TAACGCAATGAATTATGTGCTGTGGGGCGCC
ATCTACTATAAGAACAATATCGACATTTGGTG
CAAAACCGAGGGCTGTGTCGATTATTGGGGG
ATCTACTATATGAACGAACATCTGAAGGTGT
ACTATGAGGTCTTTATTCAGGATGCCGAACG
GTACGGAACTAGCGGCAAATGGGAGGTGCA
TTATAACGGCAATATCATTCACTGCCCAGACA
GTATGTGTTCTACAAGTGATGGCTCAGTGCC
CACTACCGAGCTGACAACTGAACTGTCCAAT
ACCACAGCCACTCACTCTACCGCTACTACCCC
TTGCACACAGAAGACTATCCCCCCTCCATCAC
GAAAACGGCCAAGACAGTGTGCTGTGACCG
AGCCCACAGAACCTGACGGCGTCAGCCTGGA
TCACCTGAACAATCCCCTGCATTCAAACAGCA
CCGGGCACAATACACGGAGATACCTGTCCTG
CGGAAACACAACTCCTATCATTCATCTGAAGG
GGGACAAAAATGGACTGAAGTGCCTGAGGT
ACCGCCTGCAGAAATATGATACACTGTTCGA
GAACATCTCTTGTACTTGGCACTGGATTCGAG
GCAAGGGGACCAAAAATGCTGGAATCCTGAC
CGTGACATACGCAACCGAATCTCAGAGGCAG
AAGTTTCTGGACACAGTCAAAATTCCTAGCTC
CGTGCACGTCAGTCTGGGCTATATGACACTG
MKETMMKTLSQRLNVLQDKILEYYE
QDSKSIYDQINYWKCVRMENAIFYA
ARERGMHTIDHQVVPTINISKCKAY
QAIELQMALESVAQTEYNTEEWTLK
DTSNELWHTQPKQCFKKQGTTVEV
WYDGDKCNAMNYVLWGAIYYKNNI
DIWCKTEGCVDYWGIYYMNEHLKV
YYEVFIQDAERYGTSGKWEVHYNGN
IIHCPDSMCSTSDGSVPTTELTTELSN
TTATHSTATTPCTQKTIPPPSRKRPR
QCAVTEPTEPDGVSLDHLNNPLHSN
STGHNTRRYLSCGNTTPIIHLKGDKN
GLKCLRYRLQKYDTLFENISCTWHWI
RGKGTKNAGILTVTYATESQRQKFLD
TVKIPSSVHVSLGYMTL
ATGATTGTGCTGACACTGTGCGCCGTGCCTG
TGACCGACCGCTACCCCCTGCTGAACCTGCTG
CCCAACTACCAGACCCCTCCCCGCCCAATCCC
ACCTCAGCAGCCACACGCACCCAAGAAACAG
AGTCGGAGAAGGCTGGAGTCTGACCTGGAT
AGTGTCCAGAGCCAGTCCCCTCTGTCACCAAC
CGAATGCCCTTGGACAATTCTGACCACACATA
GCACAGTCACTGTGCAGGCTACTACCCAGGA
CGGCACTTCCGTGGTCGTGACACTGCGGCTG
FIVLTLCAVPVTDRYPLLNLLPNYQTP
PRPIPPQQPHAPKKQSRRRLESDLDS
VQSQSPLSPTECPWTILTTHSTVTVQ
ATTQDGTSVVVTLRL
34.8%
ATGATTCTGCTGGTCTTTCTGGTCTGGTTCGG
CGTGTGTATCTATATCTGTTGTAATGTCCCTCT
GCTGCCATCCGTCCATGTGTGTGCCTACGTGT
GGATCATTGTGTTCGTCTTTATCCTGATTCGG
ACCACACCCCTGGAGGTGTTCTTTGTCTATCT
GCTGTTCTTTGTCCTGCCTATGTGGCTGCTGC
ACAGACTGGCTATGGACATGATC
ATGGCTAGATTTCATAACCCCGCCGAACGAC
CTTACAAACTGCCCGACCTGTGCACTACACTG
GATACAACTCTGCAGGACATCACCATCGCCT
GCGTGTACTGTCGGAGACCACTGCAGCAGAC
TGAGGTCTATGAATTCGCTTTTAGCGACCTGT
ACGTGGTCTATAGGGATGGAGAGCCACTGG
CAGCTTGCCAGTCTTGTATCAAGTTCTACGCA
AAAATTAGGGAGCTGCGCTACTATAGCGACT
CCGTGTACGCCACCACACTGGAAAACATCAC
AAATACTAAGCTGTATAACCTGCTGATTCGGT
GCATGTGCTGTCTGAAGCCCCTGTGTCCTGCT
GAAAAACTGAGACACCTGAATTCTAAGAGGC
GCTTTCATAAAATTGCAGGCAGTTATACAGG
GCAGTGCCGACGGTGTTGGACTACCAAACGG
GAGGATAGAAGGCTGACCCGCCGAGAAACA
CAGGTC
MILLVFLVWFGVCIYICCNVPLLPSVH
VCAYVWIIVFVFILIRTTPLEVFFVYLL
FFVLPMWLLHRLAMDMI
21.9%
MARFHNPAERPYKLPDLCTTLDTTL
QDITIACVYCRRPLQQTEVYEFAFSDL
YVVYRDGEPLAACQSCIKFYAKIRELR
YYSDSVYATTLENITNTKLYNLLIRCM
CCLKPLCPAEKLRHLNSKRRFHKIAGS
YTGQCRRCWTTKREDRRLTRRETQV
24.8%
HPV39 E4
HPV39 E5
HPV39 E6
Page 30 of 48
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV39 E7
ATGCGTGGACCAAAGCCCACCTTGCAGGAAA
TTGTATTAGATTTATGTCCTTACAATGAAATA
CAGCCGGTTGACCTTGTATGTCACGAGCAAT
TAGGAGAGTCAGAGGATGAAATAGATGAAC
CCGACCATGCAGTTAATCACCAACATCAACTA
CTAGCCAGACGGGATGAACCACAGCGTCACA
CAATACAGTGTTCGTGTTGTAAGTGTAACAAC
ACACTGCAGCTGGTAGTAGAAGCCTCACGGG
ATACTCTGCGACAACTACAGCAGCTGTTTATG
GACTCACTAGGATTTGTGTGTCCGTGTGTGC
AACTGCAAACCAGTAA
ATGGCTATGTGGCGGTCTAGTGACAGCATGG
TGTATTTGCCTCCACCTTCTGTGGCGAAGGTT
GTCAATACTGATGATTATGTTACACGCACAG
GCATATATTATTATGCTGGCAGCTCTAGATTA
TTAACAGTAGGACATCCATATTTTAAAGTGG
GTATGAATGGTGGTCGCAAGCAGGACATTCC
AAAGGTGTCTGCATATCAATATAGGGTATTTC
GCGTGACATTGCCCGATCCTAATAAATTCAGT
ATTCCAGATGCATCCTTATATAATCCAGAAAC
ACAACGTTTAGTATGGGCTTGTGTAGGGGTG
GAGGTGGGCAGGGGCCAGCCATTGGGTGTT
GGTATTAGTGGACACCCATTATATAATAGAC
AGGATGATACTGAAAACTCACCATTTTCATCA
ACCACCAATAAGGACAGTAGGGATAATGTGT
CTGTGGATTATAAACAGACACAGTTGTGCATT
ATAGGCTGTGTTCCCGCCATTGGGGAGCACT
GGGGTAAGGGAAAGGCATGCAAGCCCAATA
ATGTATCTACGGGGGACTGTCCTCCTTTGGA
ACTAGTAAACACCCCTATTGAGGATGGTGAT
ATGATTGATACTGGCTATGGAGCTATGGACT
TTGGTGCATTGCAGGAAACCAAAAGTGAGGT
GCCTTTAGATATTTGTCAATCCATTTGTAAAT
ATCCTGATTATTTGCAAATGTCTGCAGATGTG
TATGGGGACAGTATGTTCTTCTGTTTACGTAG
GGAACAACTGTTTGCAAGACATTTTTGGAATC
GTGGTGGTATGGTGGGTGACGCCATTCCTGC
CCAATTGTATATTAAGGGCACAGATATACGT
GCAAACCCCGGTAGTTCTGTATACTGCCCCTC
TCCCAGCGGTTCCATGGTAACCTCTGATTCCC
AGTTATTTAATAAGCCTTATTGGCTACATAAG
GCCCAGGGCCACAACAATGGTATATGTTGGC
ATAATCAATTATTTCTTACTGTTGTGGACACT
ACCCGTAGTACCAACTTTACATTATCTACCTCT
ATAGAGTCTTCCATACCTTCTACATATGATCC
TTCTAAGTTTAAGGAATATACCAGGCACGTG
GAGGAGTATGATTTACAATTTATATTTCAACT
GTGTACTGTCACATTAACAACTGATGTTATGT
CTTATATTCACACTATGAATTCCTCTATATTGG
ACAATTGGAATTTTGCTGTAGCTCCTCCACCA
TCTGCCAGTTTGGTAGACACTTACAGATACCT
ACAGTCTGCAGCCATTACATGTCAAAAGGAT
GCTCCAGCACCTGAAAAGAAAGATCCATATG
ACGGTCTAAAGTTTTGGAATGTTGACTTAAG
GGAAAAGTTTAGTTTGGAACTTGATCAATTCC
CTTTGGGACGTAAATTTTTGTTGCAGGCCAG
GGTCCGCAGGCGCCCTACTATAGGTCCCCGA
AAGCGGCCTGCTGCATCCACTTCCTCGTCCTC
AGCTACTAAACACAAACGTAAACGTGTGTCT
AAATAA
ATGCGAGGCCCAAAGCCAACTCTGCAGGAAA
TTGTGCTGGACCTGTGCCCCTATAATGAGATT
CAGCCCGTGGACCTGGTGTGTCACGAACAGC
TGGGCGAGTCTGAAGACGAGATCGATGAGC
CCGACCACGCAGTGAACCACCAGCATCAGCT
GCTGGCCCGGAGAGATGAACCTCAGCGACAT
ACCATTCAGTGCAGCTGCTGTAAGTGTAACA
ATACACTGCAGCTGGTGGTCGAGGCTAGCAG
GGATACTCTGCGCCAGCTGCAGCAGCTGTTC
ATGGACTCCCTGGGGTTTGTCTGCCCCTGGT
GTGCCACCGCTAATCAG
ATGGCTATGTGGCGAAGCTCCGATTCTATGG
TCTATCTGCCCCCCCCCTCCGTGGCTAAAGTC
GTGAACACCGATGATTATGTGACACGGACAG
GAATCTACTATTACGCAGGCAGCTCCAGACT
GCTGACTGTCGGCCACCCATATTTCAAAGTG
GGGATGAATGGCGGGAGAAAACAGGACATC
CCCAAGGTGAGCGCTTATCAGTACCGAGTCT
TCCGGGTGACCCTGCCAGACCCCAACAAGTT
TAGTATTCCTGATGCATCACTGTACAATCCAG
AGACACAGAGGCTGGTCTGGGCATGCGTGG
GAGTCGAAGTGGGACGAGGACAGCCACTGG
GAGTGGGAATCAGTGGACACCCTCTGTATAA
CCGACAGGACGATACAGAGAATTCTCCCTTCT
CTAGTACCACAAACAAAGACAGCCGGGATAA
TGTCTCCGTGGACTACAAGCAGACTCAGCTG
TGCATCATTGGGTGTGTGCCTGCCATTGGAG
AACATTGGGGAAAGGGCAAAGCTTGCAAGC
CAAACAATGTCAGCACAGGCGATTGTCCCCC
TCTGGAGCTGGTGAACACACCTATCGAAGAC
GGGGATATGATTGACACTGGGTATGGAGCTA
TGGATTTTGGAGCACTGCAGGAGACCAAATC
CGAAGTCCCACTGGACATCTGCCAGTCTATTT
GTAAGTATCCCGATTACCTGCAGATGTCAGCT
GACGTGTACGGGGATAGCATGTTCTTTTGTCT
GCGGAGAGAGCAGCTGTTCGCAAGACACTTT
TGGAACAGGGGAGGAATGGTGGGCGACGCA
ATCCCAGCACAGCTGTATATCAAGGGAACTG
ATATTAGGGCCAATCCTGGCTCAAGCGTCTAC
TGCCCTTCACCAAGCGGCTCCATGGTGACCTC
TGACAGTCAGCTGTTTAACAAACCCTACTGGC
TGCACAAGGCCCAGGGCCATAACAATGGGAT
TTGTTGGCATAACCAGCTGTTCCTGACAGTG
GTCGATACTACCAGAAGCACTAATTTTACCCT
GTCAACAAGCATCGAGTCCTCTATTCCATCTA
CCTATGACCCCAGTAAGTTCAAAGAATATACA
AGGCACGTGGAGGAATACGATCTGCAGTTCA
TCTTTCAGCTGTGCACCGTCACACTGACAACT
GACGTGATGTCCTACATCCATACCATGAACA
GTTCAATCCTGGATAACTGGAACTTCGCTGTC
GCACCACCCCCTTCCGCCTCTCTGGTGGACAC
TTATCGCTACCTGCAGTCCGCCGCTATTACCT
GTCAGAAAGATGCCCCCGCTCCTGAGAAGAA
AGACCCTTACGATGGCCTGAAATTCTGGAAC
GTGGACCTGCGGGAGAAGTTTTCTCTGGAAC
TGGATCAGTTCCCACTGGGACGCAAATTTCT
GCTGCAGGCACGAGTGAGGCGACGACCAAC
CATCGGACCACGAAAGAGACCTGCAGCAAGC
ACTAGCTCCTCTAGTGCTACCAAGCATAAAAG
GAAGCGCGTGAGCAAG
MRGPKPTLQEIVLDLCPYNEIQPVDL
VCHEQLGESEDEIDEPDHAVNHQH
QLLARRDEPQRHTIQCSCCKCNNTL
QLVVEASRDTLRQLQQLFMDSLGFV
CPWCATANQ
HPV39 L1
Page 31 of 48
MAMWRSSDSMVYLPPPSVAKVVN
TDDYVTRTGIYYYAGSSRLLTVGHPY
FKVGMNGGRKQDIPKVSAYQYRVF
RVTLPDPNKFSIPDASLYNPETQRLV
WACVGVEVGRGQPLGVGISGHPLY
NRQDDTENSPFSSTTNKDSRDNVSV
DYKQTQLCIIGCVPAIGEHWGKGKA
CKPNNVSTGDCPPLELVNTPIEDGD
MIDTGYGAMDFGALQETKSEVPLDI
CQSICKYPDYLQMSADVYGDSMFFC
LRREQLFARHFWNRGGMVGDAIPA
QLYIKGTDIRANPGSSVYCPSPSGSM
VTSDSQLFNKPYWLHKAQGHNNGI
CWHNQLFLTVVDTTRSTNFTLSTSIE
SSIPSTYDPSKFKEYTRHVEEYDLQFIF
QLCTVTLTTDVMSYIHTMNSSILDN
WNFAVAPPPSASLVDTYRYLQSAAIT
CQKDAPAPEKKDPYDGLKFWNVDL
REKFSLELDQFPLGRKFLLQARVRRR
PTIGPRKRPAASTSSSSATKHKRKRVS
K
% Nucleotide
Change
23.5%
26.6%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV39 L2
ATGGTTTCCCACCGTGCTGCCAGGCGTAAGC
GTGCATCTGCAACTGACCTATATAGAACCTGT
AAACAATCGGGTACCTGTCCACCAGACGTTG
TTGATAAAGTTGAGGGTACTACACTTGCTGA
CAAAATTTTACAGTGGACTAGTTTAGGTATAT
TTTTGGGTGGGTTAGGCATAGGCACAGGTAC
TGGTACTGGGGGACGCACAGGATATATACCC
CTGGGGGGTAGGCCTAATACTGTTGTAGATG
TGTCTCCTGCACGTCCACCTGTAGTTATTGAA
CCTGTTGGTCCTTCTGAGCCATCTATTGTGCA
ATTGGTGGAGGACTCAAGTGTTATAACCTCT
GGAACACCAGTACCAACATTTACAGGCACCT
CTGGATTTGAAATTACTTCTTCTTCTACTACTA
CGCCTGCGGTATTGGATATTACACCCTCCTCT
GGGTCTGTACAAATAACCTCTACTAGTTATAC
TAACCCTGCCTTTACGGATCCTTCCTTAATTG
AGGTTCCCCAAACAGGTGAAACCTCGGGTAA
TATATTTGTCAGTACCCCTACATCAGGTACAC
ATGGCTATGAGGAAATACCTATGGAAGTGTT
TGCCACACATGGCACAGGTACCGAACCTATT
AGCAGCACACCTACACCTGGAATCAGTCGTG
TGGCAGGACCACGTTTATATAGTAGAGCACA
TCAGCAGGTTCGTGTTAGTAATTTTGATTTTG
TAACTCACCCTTCATCATTTGTAACATTTGATA
ATCCTGCTTTTGAGCCTGTTGATACTACATTA
ACATATGAAGCTGCTGACATAGCTCCAGATC
CGGATTTTCTGGACATTGTTCGTTTACATAGG
CCTGCCTTAACCTCGCGTAAAGGAACAGTAA
GGTTTAGTAGGCTTGGCAAAAAGGCTACCAT
GGTTACCCGGCGTGGCACACAAATTGGAGCG
CAAGTACATTATTACCATGACATTAGTAGTAT
TGCTCCTGCTGAAAGCATTGAATTACAGCCCC
TAGTTCACGCTGAGCCCTCTGATGCTTCAGAT
GCATTATTTGATATATATGCTGATGTGGACAA
TAACACATATTTAGATACTGCATTTAATAATA
CAAGGGATTCGGGCACTACATATAACACAGG
CTCACTACCTTCTGTGGCTTCTTCAGCATCTAC
TAAATATGCCAATACAACTATTCCTTTTAGTA
CCTCATGGAATATGCCTGTAAATACTGGTCCT
GATATTGCTTTACCAAGTACTACTCCACAGTT
GCCATTGGTGCCTTCTGGACCAATAGACACA
ACATATGCAATAACCATTCAGGGTTCCAATTA
TTATTTGTTGCCATTATTGTATTTTTTCCTAAA
AAAACGTAAACGTATTCCCTATTTTTTTTCAG
ATGGCTATGTGGCGGTCTAG
ATGGTCTCCCACCGGGCAGCAAGAAGGAAAC
GGGCATCAGCAACCGACCTGTATCGCACCTG
TAAGCAGAGCGGAACCTGTCCCCCCGACGTG
GTCGATAAGGTGGAGGGGACCACACTGGCT
GACAAAATTCTGCAGTGGACCTCTCTGGGAA
TCTTCCTGGGAGGACTGGGAATTGGAACCGG
AACAGGCACTGGAGGCCGGACTGGCTATATC
CCACTGGGGGGAAGACCCAACACCGTGGTC
GACGTGTCCCCAGCAAGACCACCTGTGGTCA
TCGAGCCAGTCGGACCATCAGAACCTAGCAT
CGTGCAGCTGGTCGAGGATAGCTCCGTGATC
ACTTCAGGGACCCCAGTCCCCACCTTCACAGG
CACTAGCGGGTTTGAAATCACATCTAGTTCAA
CTACCACACCAGCTGTGCTGGACATCACCCCC
AGCTCCGGCAGCGTCCAGATTACCAGCACAT
CCTACACAAACCCTGCATTCACTGATCCATCC
CTGATTGAGGTGCCTCAGACAGGGGAAACTT
CTGGAAATATCTTTGTCAGCACTCCTACCTCC
GGAACACACGGCTATGAGGAAATCCCAATGG
AGGTGTTCGCCACCCATGGGACAGGAACTGA
ACCAATCTCTAGTACCCCTACACCAGGAATTT
CAAGGGTGGCAGGACCACGACTGTACAGCC
GAGCACACCAGCAGGTGCGCGTCTCCAACTT
CGATTTTGTGACTCATCCCTCAAGCTTCGTCA
CCTTTGACAATCCCGCCTTTGAGCCTGTGGAT
ACTACCCTGACATACGAAGCCGCTGACATCG
CACCCGACCCTGATTTCCTGGATATTGTGCGA
CTGCACCGACCTGCACTGACCTCTCGAAAGG
GCACCGTGCGGTTCAGCAGGCTGGGGAAGA
AAGCCACCATGGTGACACGGAGAGGCACCC
AGATTGGGGCTCAGGTCCACTACTATCATGA
CATCTCCTCTATTGCACCCGCCGAGAGCATCG
AACTGCAGCCACTGGTGCATGCAGAGCCTTC
CGACGCTTCTGATGCACTGTTCGATATCTACG
CTGACGTGGATAACAATACTTATCTGGACACC
GCCTTCAACAATACCCGCGATAGTGGGACAA
CTTACAACACAGGAAGTCTGCCTTCAGTGGC
CAGTTCAGCTAGCACCAAGTATGCTAATACCA
CAATTCCATTTTCTACAAGTTGGAACATGCCC
GTGAATACTGGCCCTGACATCGCACTGCCATC
TACTACCCCACAGCTGCCCCTGGTGCCTAGTG
GACCAATTGATACAACTTACGCCATCACCATT
CAGGGCAGCAATTACTATCTGCTGCCCCTGCT
GTATTTCTTTCTGAAGAAAAGGAAACGCATCC
CTTACTTCTTTTCCGACGGCTATGTGGCCGTC
MVSHRAARRKRASATDLYRTCKQSG
TCPPDVVDKVEGTTLADKILQWTSL
GIFLGGLGIGTGTGTGGRTGYIPLGG
RPNTVVDVSPARPPVVIEPVGPSEPS
IVQLVEDSSVITSGTPVPTFTGTSGFEI
TSSSTTTPAVLDITPSSGSVQITSTSYT
NPAFTDPSLIEVPQTGETSGNIFVSTP
TSGTHGYEEIPMEVFATHGTGTEPIS
STPTPGISRVAGPRLYSRAHQQVRVS
NFDFVTHPSSFVTFDNPAFEPVDTTL
TYEAADIAPDPDFLDIVRLHRPALTSR
KGTVRFSRLGKKATMVTRRGTQIGA
QVHYYHDISSIAPAESIELQPLVHAEP
SDASDALFDIYADVDNNTYLDTAFN
NTRDSGTTYNTGSLPSVASSASTKYA
NTTIPFSTSWNMPVNTGPDIALPSTT
PQLPLVPSGPIDTTYAITIQGSNYYLLP
LLYFFLKKRKRIPYFFSDGYVAV
Page 32 of 48
% Nucleotide
Change
27.6%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV45 E1
ATGGCGGATCCAGAAGGTACCGACGGGGAG
GGAACGGGGTGTAATGGCTGGTTCTTTGTAG
AAACAATTGTAGAGAAAAAAACAGGGGATG
TAATATCAGATGATGAGGATGAAACTGCAAC
AGATACAGGGTCGGATATGGTAGATTTTATT
GACACACAATTATCCATTTGTGAACAGGCAG
AGCAAGAGACAGCACAGGCATTGTTCCATGC
GCAGGAAGTTCAGAATGATGCACAGGTGTTG
CATCTTTTAAAACGAAAGTTTGCAGGAGGCA
GCAAGGAAAACAGTCCATTAGGGGAGCAGC
TAAGTGTGGATACGGATCTAAGTCCACGGTT
ACAAGAAATTTCATTAAATAGTGGGCACAAA
AAAGCAAAACGACGGTTGTTTACAATATCAG
ATAGTGGCTATGGCTGTTCTGAAGTGGAAGC
TGCAGAGACTCAGGTAACTGTAAACACTAAT
GCGGAAAATGGCGGCAGTGTACATAGTACAC
AAAGTAGTGGTGGGGATAGTAGTGACAATG
CAGAAAATGTAGATCCGCATTGCAGTATTAC
AGAACTAAAGGAGCTATTACAAGCAAGTAAC
AAAAAGGCTGCAATGCTGGCAGTATTTAAAG
ACATATATGGGCTGTCATTTACGGATTTGGTT
AGAAATTTTAAAAGTGATAAAACAACATGTA
CAGATTGGGTAATGGCTATATTTGGAGTTAA
TCCAACGGTAGCAGAAGGCTTTAAAACATTA
ATTAAACCAGCAACGTTATACGCCCATATCCA
ATGTTTAGATTGTAAATGGGGAGTATTAATA
TTAGCTTTATTAAGATATAAATGTGGCAAAAA
TAGACTAACTGTTGCAAAAGGCTTAAGCACA
TTGTTGCACGTACCTGAAACATGTATGTTAAT
TGAACCACCAAAATTGCGAAGTAGTGTTGCA
GCATTATACTGGTATAGAACAGGTATATCCA
ATATTAGTGAAGTAAGTGGAGACACACCTGA
GTGGATACAAAGACTGACAATTATTCAACAT
GGTATTGACGATAGTAATTTTGATTTGTCAGA
CATGGTGCAATGGGCATTTGATAATGACCTT
ACAGATGAAAGTGATATGGCATTTCAATATG
CCCAATTAGCAGACTGCAACAGTAATGCAGC
TGCATTTTTAAAAAGTAACTGCCAAGCCAAAT
ATTTAAAAGATTGTGCTGTAATGTGTAGACAT
TATAAAAGAGCACAAAAACGCCAAATGAATA
TGTCTCAATGGATTAAATATAGATGTTCCAAA
ATAGATGAAGGTGGGGATTGGAGACCCATA
GTACAATTCCTAAGATATCAGGGAGTAGAAT
TTATTAGCTTTTTAAGGGCACTAAAGGAATTT
CTTAAAGGAACACCAAAAAAAAATTGTATAC
TGTTATATGGACCTGCAAATACAGGAAAATC
GTATTTTGGAATGAGTTTTATACATTTCCTAC
AAGGTGCAATAATATCATTTGTAAATTCAAAC
AGCCATTTTTGGTTAGAACCGTTAGCAGATAC
TAAGGTAGCCATGTTGGATGATGCCACACAC
ACGTGTTGGACATATTTTGATAATTATATGAG
AAATGCATTAGATGGTAATCCTATAAGTATA
GACAGAAAGCATAAACCATTATTACAGCTAA
AATGTCCTCCAATCCTATTAACATCCAATATT
GATCCAGCAAAAGATAATAAATGGCCATATT
TAGAAAGTAGGGTGACGGTATTTACATTTCC
ACATGCATTTCCATTTGATAAAAATGGTAATC
CAGTATATGAAATAAATGATAAAAATTGGAA
ATGTTTTTTTGAAAGGACATGGTCCAGATTAG
ATTTGCACGAGGACGATGAAGATGCAGACAC
CGAAGGAATCCCTTTCGGAACGTTTAAGTGC
GTTACAGGACAAAATACTAGACCACTATGA
ATGGCAGATCCTGAGGGGACTGACGGGGAG
GGGACTGGGTGTAATGGGTGGTTTTTCGTGG
AGACTATCGTGGAGAAGAAGACTGGGGATG
TGATCTCCGACGATGAGGATGAAACTGCCAC
CGACACAGGCTCCGACATGGTCGATTTCATC
GACACTCAGCTGTCTATTTGCGAGCAGGCTG
AGCAGGAAACCGCCCAGGCTCTGTTCCACGC
CCAGGAAGTGCAGAACGACGCTCAGGTCCTG
CATCTGCTGAAGCGCAAATTTGCCGGCGGGT
CTAAGGAGAATAGTCCACTGGGAGAACAGCT
GAGTGTGGACACAGATCTGTCACCCCGGCTG
CAGGAGATCAGCCTGAACTCCGGCCACAAGA
AAGCCAAACGGAGACTGTTCACCATTTCCGA
TTCTGGATACGGCTGCAGCGAGGTGGAAGCC
GCTGAGACACAAGTGACTGTCAACACCAATG
CAGAAAATGGAGGCTCTGTGCACAGTACACA
GAGCTCCGGGGGAGATTCTAGTGACAACGCC
GAAAATGTCGACCCTCATTGTAGTATCACTGA
GCTGAAGGAACTGCTGCAGGCTTCAAACAAG
AAAGCAGCCATGCTGGCAGTGTTTAAAGATA
TCTACGGCCTGTCATTCACCGACCTGGTCCGG
AACTTTAAGAGCGATAAAACCACATGTACAG
ACTGGGTCATGGCCATCTTCGGCGTGAATCC
CACTGTCGCTGAGGGGTTTAAGACTCTGATC
AAACCTGCTACCCTGTACGCACACATTCAGTG
CCTGGATTGTAAGTGGGGCGTGCTGATCCTG
GCTCTGCTGAGGTATAAGTGCGGAAAAAATC
GCCTGACAGTGGCAAAGGGCCTGTCCACTCT
GCTGCATGTCCCCGAGACCTGTATGCTGATC
GAACCCCCTAAACTGAGATCAAGCGTGGCTG
CACTGTACTGGTATAGGACAGGGATCTCAAA
CATTAGCGAGGTCTCCGGAGACACCCCTGAA
TGGATCCAGAGACTGACAATCATTCAGCACG
GCATTGACGATTCAAACTTCGATCTGAGCGA
CATGGTGCAGTGGGCATTTGACAATGATCTG
ACCGATGAGTCTGACATGGCCTTCCAGTACG
CACAGCTGGCCGATTGCAACAGCAATGCCGC
TGCATTTCTGAAGTCCAACTGTCAGGCAAAGT
ACCTGAAAGACTGCGCCGTGATGTGTAGGCA
TTATAAGCGCGCCCAGAAACGACAGATGAAT
ATGTCTCAGTGGATCAAGTACAGATGCAGTA
AAATTGATGAGGGCGGGGACTGGCGACCAA
TCGTGCAGTTCCTGCGGTATCAGGGCGTCGA
GTTCATTTCTTTTCTGAGGGCCCTGAAGGAAT
TTCTGAAAGGGACCCCTAAGAAAAACTGTAT
CCTGCTGTACGGCCCAGCCAATACCGGGAAG
TCCTATTTCGGAATGTCTTTCATTCACTTTCTG
CAGGGGGCTATCATCAGTTTCGTGAACAGTA
ACTCACATTTCTGGCTGGAGCCCCTGGCCGAT
ACTAAAGTCGCTATGCTGGACGATGCAACTC
ACACCTGCTGGACCTACTTTGACAACTATATG
CGCAATGCCCTGGATGGAAATCCAATCAGCA
TTGACCGAAAGCATAAACCCCTGCTGCAGCT
GAAGTGTCCACCCATCCTGCTGACCAGCAAC
ATTGATCCCGCTAAGGACAACAAGTGGCCTT
ACCTGGAGTCCCGCGTGACAGTCTTCACTTTT
CCTCACGCATTCCCATTTGATAAGAACGGCAA
TCCCGTGTATGAGATCAACGACAAGAATTGG
AAATGCTTCTTTGAACGGACCTGGAGCAGAC
TGGACCTGCATGAGGACGATGAAGACGCCG
ATACAGAAGGGATTCCTTTCGGAACTTTTAAG
TGTGTGACCGGGCAGAACACAAGGCCACTG
MADPEGTDGEGTGCNGWFFVETIV
EKKTGDVISDDEDETATDTGSDMVD
FIDTQLSICEQAEQETAQALFHAQEV
QNDAQVLHLLKRKFAGGSKENSPLG
EQLSVDTDLSPRLQEISLNSGHKKAK
RRLFTISDSGYGCSEVEAAETQVTVN
TNAENGGSVHSTQSSGGDSSDNAE
NVDPHCSITELKELLQASNKKAAMLA
VFKDIYGLSFTDLVRNFKSDKTTCTD
WVMAIFGVNPTVAEGFKTLIKPATLY
AHIQCLDCKWGVLILALLRYKCGKNR
LTVAKGLSTLLHVPETCMLIEPPKLRS
SVAALYWYRTGISNISEVSGDTPEWI
QRLTIIQHGIDDSNFDLSDMVQWAF
DNDLTDESDMAFQYAQLADCNSNA
AAFLKSNCQAKYLKDCAVMCRHYKR
AQKRQMNMSQWIKYRCSKIDEGG
DWRPIVQFLRYQGVEFISFLRALKEFL
KGTPKKNCILLYGPANTGKSYFGMSF
IHFLQGAIISFVNSNSHFWLEPLADTK
VAMLDDATHTCWTYFDNYMRNAL
DGNPISIDRKHKPLLQLKCPPILLTSNI
DPAKDNKWPYLESRVTVFTFPHAFP
FDKNGNPVYEINDKNWKCFFERTW
SRLDLHEDDEDADTEGIPFGTFKCVT
GQNTRPL
Page 33 of 48
% Nucleotide
Change
26.4%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV45 E2
ATGAAGATGCAGACACCGAAGGAATCCCTTT
CGGAACGTTTAAGTGCGTTACAGGACAAAAT
ACTAGACCACTATGAAAATGACAGTAAAGAC
ATAAACAGCCAAATAAGTTATTGGCAACTTAT
ACGTTTGGAAAATGCAATACTATTTACAGCAA
GGGAACATGGTATTACCAAACTAAACCACCA
GGTGGTGCCTCCTATTAACATTTCAAAAAGCA
AAGCACATAAAGCTATTGAACTGCAAATGGC
CTTAAAGGGCCTTGCACAAAGCAAGTATAAC
AATGAGGAATGGACACTGCAAGATACATGCG
AGGAACTATGGAATACAGAACCGTCGCAGTG
TTTTAAAAAAGGCGGTAAAACCGTGCACGTA
TACTTTGATGGCAACAAGGACAACTGTATGA
ACTATGTAGTATGGGACAGTATATATTATATA
ACTGAGACAGGGATATGGGACAAAACAGCA
GCATGTGTTAGCTATTGGGGTGTATATTATAT
AAAAGATGGAGATACCACATATTATGTACAA
TTTAAAAGCGAATGTGAGAAATATGGAAATA
GTAATACGTGGGAAGTACAATATGGGGGCA
ATGTAATTGATTGTAATGACTCTATGTGCAGT
ACCAGTGACGACACGGTATCCGCTACTCAGA
TTGTTAGACAGCTACAACACGCCTCCACGTCG
ACCCCCAAAACCGCATCCGTGGGCACCCCAA
AACCCCACATCCAGACGCCGGCTACTAAGCG
ACCTAGACAGTGTGGACTCACAGAGCAGCAC
CACGGACGTGTCAACACCCACGTGCACAACC
CGCTCCTGTGTTCAAGTACAAGTAACAACAA
AAGAAGGAAAGTGTGTAGTGGTAACACTAC
GCCTATAATACACTTAAAAGGTGACAAAAAC
AGTTTGAAATGTTTAAGATATAGGCTACGCA
AATATGCAGACCATTACTCAGAAATATCCTCC
ACCTGGCATTGGACAGGTTGTAATAAAAACA
CTGGTATATTAACTGTAACATATAATAGTGAG
GTACAAAGAAATACCTTTTTGGATGTAGTTAC
TATTCCTAACAGTGTACAAATCTCGGTGGGAT
ACATGACTATATGA
ATGACTCTATGTGCAGTACCAGTGACGACAC
GGTATCCGCTACTCAGATTGTTAGACAGCTAC
AACACGCCTCCACGTCGACCCCCAAAACCGC
ATCCGTGGGCACCCCAAAACCCCACATCCAG
ACGCCGGCTACTAAGCGACCTAGACAGTGTG
GACTCACAGAGCAGCACCACGGACGTGTCAA
CACCCACGTGCACAACCCGCTCCTGTGTTCAA
GTACAAGTAACAACAAAAGAAGGAAAGTGT
GTAGTGGTAACACTACGCCTATAA
ATGCTATCTTTAGTGTTTTTATTGTGCTTTTCT
GTGTGCCTTTATGTGTGCTGCAATGTCCCGCT
TGTGCAGTCTGTCTATGTGTGTGCTTTTGCTT
GGTTGTTGGTGTTTCTTTTTATAGTTGTTATTA
CATCCCCATTAACAGCATTTGCTGTATACATT
TGTTGCTATTTACTACCTATGTTTGTATTACAT
ATGCATGCTTTACACACCATACAATAA
ATGGCGCGCTTTGACGATCCAAAGCAACGAC
CCTACAAGCTACCAGATTTGTGCACAGAATTG
AATACATCACTACAAGACGTATCTATTGCCTG
TGTATATTGCAAAGCAACATTGGAACGCACA
GAGGTATATCAATTTGCTTTTAAAGATTTATG
TATAGTGTATAGAGACTGTATAGCATATGCT
GCATGCCATAAATGTATAGACTTTTATTCCAG
AATTAGAGAATTAAGATATTATTCAAACTCTG
TATATGGAGAGACACTGGAAAAAATAACTAA
TACAGAGTTGTATAATTTGTTAATAAGGTGCC
TGCGGTGCCAGAAACCATTGAACCCAGCAGA
AAAACGTAGACACCTTAAGGACAAACGAAGA
TTTCACAGCATAGCTGGACAGTACCGAGGGC
AGTGTAATACATGTTGTGACCAGGCACGGCA
AGAAAGACTTCGCAGACGTAGGGAAACACA
AGTATAG
ATGAAGATGCAGACTCCCAAGGAAAGCCTGA
GCGAGAGACTGAGCGCACTGCAGGATAAGA
TTCTGGACCATTACGAAAACGACTCCAAGGA
CATCAATAGCCAGATTTCCTACTGGCAGCTGA
TCAGGCTGGAGAACGCTATTCTGTTCACTGC
ACGCGAACACGGAATCACCAAGCTGAATCAT
CAGGTGGTCCCCCCTATCAACATTTCAAAGAG
CAAAGCACACAAAGCCATTGAGCTGCAGATG
GCCCTGAAGGGCCTGGCTCAGAGCAAATATA
ACAATGAGGAATGGACCCTGCAGGATACATG
CGAGGAACTGTGGAATACCGAACCATCCCAG
TGTTTCAAGAAAGGCGGGAAGACAGTGCAT
GTCTACTTTGACGGCAACAAAGATAATTGCAT
GAACTATGTGGTCTGGGACTCTATCTACTATA
TTACAGAGACTGGGATCTGGGATAAGACAGC
CGCTTGTGTGAGTTACTGGGGCGTCTACTAT
ATCAAGGACGGGGATACCACATACTATGTGC
AGTTTAAGTCAGAGTGCGAAAAATACGGGAA
CAGCAATACCTGGGAAGTGCAGTATGGAGG
CAATGTCATCGACTGCAACGATAGTATGTGTT
CCACTTCTGACGATACCGTGTCAGCTACACAG
ATTGTGCGACAGCTGCAGCACGCAAGTACCT
CAACACCCAAGACAGCCTCTGTGGGCACTCC
AAAACCCCATATCCAGACACCTGCCACTAAG
AGGCCACGCCAGTGTGGACTGACAGAGCAG
CACCATGGCAGGGTGAATACTCACGTCCATA
ACCCCCTGCTGTGCAGCTCCACCTCCAACAAT
AAGCGGAGAAAAGTGTGTTCTGGGAATACTA
CCCCTATCATTCACCTGAAGGGAGACAAAAA
CAGCCTGAAGTGTCTGCGATACCGGCTGAGA
AAATACGCCGATCACTATTCCGAGATCTCTAG
TACTTGGCATTGGACCGGGTGCAACAAGAAT
ACTGGAATTCTGACTGTGACCTACAATAGCG
AAGTGCAGCGGAACACCTTCCTGGACGTGGT
CACAATCCCTAACTCTGTGCAGATTAGTGTCG
GCTATATGACCATC
ATGACCCTGTGTGCCGTCCCCGTGACCACCC
GCTACCCACTGCTGCGACTGCTGGATAGTTA
CAATACCCCACCCCGAAGACCCCCTAAGCCAC
ACCCTTGGGCACCACAGAACCCAACTTCACG
GAGAAGGCTGCTGAGCGACCTGGATTCTGTG
GACAGTCAGAGCTCCACCACAGATGTGTCCA
CCCCCACATGCACTACCCGCAGTTGTGTCCAG
GTGCAGGTCACAACTAAGGAGGGCAAATGC
GTGGTCGTGACACTGCGACTG
ATGCTGAGTCTGGTCTTTCTGCTGTGTTTTTCC
GTCTGTCTGTATGTGTGTTGTAATGTCCCCCT
GGTGCAGTCCGTCTATGTCTGTGCCTTCGCTT
GGCTGCTGGTGTTCCTGTTTATCGTGGTCATT
ACCAGCCCCCTGACAGCATTCGCCGTGTACAT
CTGCTGTTATCTGCTGCCTATGTTTGTCCTGC
ACATGCATGCTCTGCACACCATTCAG
ATGGCCCGATTTGATGACCCTAAACAGCGCC
CCTATAAACTGCCCGATCTGTGTACCGAACTG
AATACTTCTCTGCAGGATGTGAGCATCGCAT
GCGTGTACTGTAAGGCCACTCTGGAGCGAAC
CGAAGTCTATCAGTTCGCTTTTAAAGACCTGT
GCATCGTGTACCGCGATTGTATTGCATATGCC
GCTTGCCACAAGTGTATCGACTTCTACTCTCG
CATTCGAGAGCTGAGATACTATAGCAACTCC
GTCTACGGCGAGACCCTGGAAAAAATCACCA
ACACAGAGCTGTATAATCTGCTGATTCGGTG
CCTGAGATGTCAGAAGCCCCTGAACCCTGCC
GAAAAACGGAGACACCTGAAGGACAAAAGG
CGCTTTCATAGCATTGCCGGCCAGTATCGGG
GGCAGTGCAATACATGCTGTGATCAGGCTAG
GCAGGAGCGCCTGCGACGGAGAAGGGAAAC
TCAGGTG
MKMQTPKESLSERLSALQDKILDHYE
NDSKDINSQISYWQLIRLENAILFTAR
EHGITKLNHQVVPPINISKSKAHKAIE
LQMALKGLAQSKYNNEEWTLQDTC
EELWNTEPSQCFKKGGKTVHVYFDG
NKDNCMNYVVWDSIYYITETGIWD
KTAACVSYWGVYYIKDGDTTYYVQF
KSECEKYGNSNTWEVQYGGNVIDC
NDSMCSTSDDTVSATQIVRQLQHAS
TSTPKTASVGTPKPHIQTPATKRPRQ
CGLTEQHHGRVNTHVHNPLLCSSTS
NNKRRKVCSGNTTPIIHLKGDKNSLK
CLRYRLRKYADHYSEISSTWHWTGC
NKNTGILTVTYNSEVQRNTFLDVVTI
PNSVQISVGYMTI
HPV45 E4
HPV45 E5
HPV45 E6
Page 34 of 48
% Nucleotide
Change
25.4%
MTLCAVPVTTRYPLLRLLDSYNTPPR
RPPKPHPWAPQNPTSRRRLLSDLDS
VDSQSSTTDVSTPTCTTRSCVQVQV
TTKEGKCVVVTLRL
28.5%
MLSLVFLLCFSVCLYVCCNVPLVQSV
YVCAFAWLLVFLFIVVITSPLTAFAVYI
CCYLLPMFVLHMHALHTIQ
24.2%
MARFDDPKQRPYKLPDLCTELNTSL
QDVSIACVYCKATLERTEVYQFAFKD
LCIVYRDCIAYAACHKCIDFYSRIRELR
YYSNSVYGETLEKITNTELYNLLIRCLR
CQKPLNPAEKRRHLKDKRRFHSIAG
QYRGQCNTCCDQARQERLRRRRET
QV
25.7%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV45 E7
ATGCATGGACCCCGGGAAACACTGCAAGAAA
TTGTATTGCATTTGGAACCTCAGAATGAATTA
GATCCTGTTGACCTGTTGTGTTACGAGCAATT
AAGCGAGTCAGAGGAGGAAAACGATGAAGC
AGATGGAGTTAGTCATGCACAACTACCAGCC
CGACGAGCCGAACCACAGCGTCACAAAATTT
TGTGTGTATGTTGTAAGTGTGACGGCAGAAT
TGAGCTTACAGTAGAGAGCTCGGCAGAGGA
CCTTAGAACACTACAGCAGCTGTTTTTGAGCA
CCTTGTCCTTTGTGTGTCCGTGGTGTGCACTA
ACCAATAA
ATGGCTTTGTGGCGGCCTAGTGACAGTACGG
TATATCTTCCACCACCTTCTGTGGCCAGAGTT
GTCAGCACTGATGATTATGTGTCTCGCACAA
GCATATTTTATCATGCAGGCAGTTCCCGATTA
TTAACTGTAGGCAATCCATATTTTAGGGTTGT
ACCTAATGGTGCAGGTAATAAACAGGCTGTT
CCTAAGGTATCCGCATATCAGTATAGGGTGT
TTAGAGTAGCTTTACCCGATCCTAATAAATTT
GGATTACCTGATTCTACTATATATAATCCTGA
AACACAACGTTTGGTTTGGGCATGTGTAGGT
ATGGAAATTGGTCGTGGGCAGCCTTTAGGTA
TTGGCCTAAGTGGCCATCCATTTTATAATAAA
TTGGATGATACAGAAAGTGCTCATGCAGCTA
CAGCTGTTATTACGCAGGATGTTAGGGATAA
TGTGTCAGTTGATTATAAGCAAACACAGCTGT
GTATTTTAGGTTGTGTACCTGCTATTGGTGAG
CACTGGGCCAAGGGCACACTTTGTAAACCTG
CACAATTGCAACCTGGTGACTGTCCTCCTTTG
GAACTTAAAAACACCATTATTGAGGATGGTG
ATATGGTGGATACAGGTTATGGGGCAATGGA
TTTTAGTACATTGCAGGATACAAAGTGCGAG
GTTCCATTAGACATTTGTCAATCCATCTGTAA
ATATCCAGATTATTTGCAAATGTCTGCTGATC
CCTATGGGGATTCTATGTTTTTTTGCCTACGC
CGTGAACAACTGTTTGCAAGACATTTTTGGAA
TAGGGCAGGTGTTATGGGTGACACAGTACCT
ACGGACCTATATATTAAAGGCACTAGCGCTA
ATATGCGTGAAACCCCTGGCAGTTGTGTGTA
TTCCCCTTCTCCCAGTGGCTCTATTATTACTTC
TGATTCTCAATTATTTAATAAGCCATATTGGT
TACATAAGGCCCAGGGCCATAACAATGGTAT
TTGTTGGCATAATCAGTTGTTTGTTACTGTAG
TGGACACTACCCGCAGTACTAATTTAACATTA
TGTGCCTCTACACAAAATCCTGTGCCAAGTAC
ATATGACCCTACTAAGTTTAAGCAGTATAGTA
GACATGTGGAGGAATATGATTTACAGTTTAT
TTTTCAGTTGTGCACTATTACTTTAACTGCAG
AGGTTATGTCATATATCCATAGTATGAATAGT
AGTATATTAGAAAATTGGAATTTTGGTGTCCC
TCCACCACCTACTACAAGTTTGGTGGATACAT
ATCGTTTTGTGCAATCAGTTGCTGTTACCTGT
CAAAAGGATACTACACCTCCAGAAAAGCAGG
ATCCATATGATAAATTAAAGTTTTGGACTGTT
GACCTAAAGGAAAAATTTTCCTCCGATTTGGA
TCAATATCCCCTTGGTCGAAAGTTTTTAGTTC
AGGCTGGGTTACGTCGTAGGCCTACCATAGG
ACCTCGTAAGCGTCCTGCTGCTTCCACGTCTA
CTGCATCTACTGCATCTAGGCCTGCCAAACGT
GTACGTATACGTAGTAAGAAATAA
ATGCATGGACCAAGAGAAACCCTGCAGGAA
ATCGTGCTGCATCTGGAGCCTCAGAATGAAC
TGGACCCTGTGGACCTGCTGTGCTATGAACA
GCTGTCTGAGAGTGAGGAAGAGAACGACGA
GGCAGATGGCGTGTCTCACGCACAGCTGCCA
GCACGAAGAGCTGAACCTCAGAGGCATAAG
ATCCTGTGCGTGTGCTGTAAATGTGACGGGC
GCATTGAACTGACTGTCGAGAGCTCCGCTGA
AGATCTGCGAACCCTGCAGCAGCTGTTCCTG
TCAACACTGAGCTTTGTCTGCCCCTGGTGTGC
CACCAATCAG
ATGGCTCACAATATTATCTACGGGCACGGGA
TTATCATCTTCCTGAAAAATGTCAATGTCTTCC
CTATCTTCCTGCAGATGGCACTGTGGAGGCC
CTCCGACTCTACAGTGTACCTGCCCCCTCCAT
CTGTCGCTCGAGTGGTCAGTACTGACGATTA
CGTGTCCCGGACCTCTATCTTCTATCACGCAG
GCAGCTCCAGACTGCTGACCGTGGGGAACCC
TTATTTTAGGGTGGTCCCAAACGGCGCCGGG
AATAAGCAGGCTGTGCCTAAAGTCAGCGCAT
ACCAGTATCGGGTGTTCAGAGTCGCCCTGCC
AGACCCCAACAAGTTTGGCCTGCCCGATTCCA
CCATCTACAATCCTGAGACACAGCGACTGGT
GTGGGCATGCGTCGGAATGGAAATCGGACG
AGGCCAGCCACTGGGGATTGGACTGTCTGGC
CACCCCTTCTACAACAAACTGGACGATACCGA
GAGTGCACATGCCGCTACCGCCGTGATCACA
CAGGACGTCCGCGATAATGTGTCCGTCGATT
ATAAGCAGACTCAGCTGTGCATCCTGGGCTG
TGTGCCAGCTATTGGGGAACATTGGGCAAAG
GGAACCCTGTGCAAACCAGCACAGCTGCAGC
CAGGGGACTGTCCACCTCTGGAGCTGAAGAA
TACCATCATTGAAGACGGCGATATGGTGGAT
ACAGGCTACGGGGCCATGGACTTTTCTACTCT
GCAGGATACCAAGTGCGAGGTGCCTCTGGAC
ATCTGCCAGAGCATTTGTAAATACCCTGATTA
TCTGCAGATGTCAGCCGACCCATATGGCGAT
AGCATGTTCTTTTGTCTGCGGAGAGAGCAGC
TGTTCGCCAGGCACTTTTGGAACCGCGCTGG
AGTGATGGGCGACACTGTCCCCACCGATCTG
TACATCAAGGGAACAAGTGCCAATATGCGGG
AAACTCCTGGCTCATGTGTGTATAGTCCTTCA
CCAAGCGGGTCCATCATTACATCTGACAGTCA
GCTGTTCAACAAGCCTTACTGGCTGCACAAA
GCTCAGGGGCATAACAATGGAATTTGCTGGC
ATAATCAGCTGTTTGTGACAGTGGTCGATAC
CACACGGTCTACAAACCTGACTCTGTGTGCAT
CCACTCAGAATCCCGTGCCTTCTACATACGAC
CCAACTAAGTTCAAACAGTACAGCAGACACG
TCGAGGAATATGATCTGCAGTTCATCTTTCAG
CTGTGCACCATTACACTGACTGCCGAAGTGAT
GAGCTACATCCATTCCATGAACTCTAGTATTC
TGGAAAACTGGAATTTCGGCGTGCCACCCCC
TCCAACTACCAGCCTGGTGGACACTTATAGAT
TTGTCCAGTCCGTGGCTGTCACCTGTCAGAA
GGATACAACTCCCCCTGAGAAACAGGACCCA
TACGATAAGCTGAAATTCTGGACCGTGGACC
TGAAGGAAAAATTTTCAAGCGACCTGGATCA
GTATCCCCTGGGAAGGAAGTTTCTGGTGCAG
GCAGGACTGAGGCGACGACCAACCATCGGG
CCCCGGAAAAGACCTGCAGCCTCAACCAGCA
CAGCTAGTACAGCATCACGCCCCGCTAAGAG
GGTGCGCATTCGAAGCAAGAAA
MHGPRETLQEIVLHLEPQNELDPVD
LLCYEQLSESEEENDEADGVSHAQLP
ARRAEPQRHKILCVCCKCDGRIELTV
ESSAEDLRTLQQLFLSTLSFVCPWCA
TNQ
HPV45 L1
Page 35 of 48
MALWRPSDSTVYLPPPSVARVVSTD
DYVSRTSIFYHAGSSRLLTVGNPYFRV
VPNGAGNKQAVPKVSAYQYRVFRV
ALPDPNKFGLPDSTIYNPETQRLVW
ACVGMEIGRGQPLGIGLSGHPFYNK
LDDTESAHAATAVITQDVRDNVSVD
YKQTQLCILGCVPAIGEHWAKGTLCK
PAQLQPGDCPPLELKNTIIEDGDMV
DTGYGAMDFSTLQDTKCEVPLDICQ
SICKYPDYLQMSADPYGDSMFFCLR
REQLFARHFWNRAGVMGDTVPTDL
YIKGTSANMRETPGSCVYSPSPSGSII
TSDSQLFNKPYWLHKAQGHNNGIC
WHNQLFVTVVDTTRSTNLTLCASTQ
NPVPSTYDPTKFKQYSRHVEEYDLQF
IFQLCTITLTAEVMSYIHSMNSSILEN
WNFGVPPPPTTSLVDTYRFVQSVAV
TCQKDTTPPEKQDPYDKLKFWTVDL
KEKFSSDLDQYPLGRKFLVQAGLRRR
PTIGPRKRPAASTSTASTASRPAKRV
RIRSKK
% Nucleotide
Change
27.3%
29.6%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV45 L2
ATGGTATCCCACCGTGCAGCACGTCGCAAGC
GGGCCTCTGCAACTGACTTATATAGAACATGT
AAGCAATCCGGTACGTGCCCCCCTGATGTTAT
TAACAAAGTGGAAGGCACAACCTTAGCTGAT
AAAATTTTACAGTGGTCTAGCCTTGGGATATT
TTTGGGTGGCCTTGGCATTGGTACCGGCAGT
GGTTCTGGAGGCCGTACGGGCTATGTACCCT
TAGGGGGCAGGTCTAATACTGTTGTGGATGT
TGGCCCCACTAGGCCACCTGTGGTTATTGAA
CCTGTAGGGCCTACTGATCCATCTATTGTTAC
GTTGGTAGAGGATTCCAGTGTTGTTGCCTCT
GGTGCTCCGGTTCCCACATTTACCGGAACCTC
TGGGTTTGAAATTACGTCTTCTGGTACTACCA
CACCAGCTGTGTTGGACATCACACCTACCGTG
GACTCTGTTTCTATTTCGTCAACTAGTTTTACA
AATCCTGCATTTTCTGATCCCTCTATTATTGAG
GTGCCCCAAACAGGGGAGGTATCAGGTAATA
TATTTGTTGGTACACCAACATCGGGCAGCCAT
GGATATGAGGAAATACCTTTACAAACATTTG
CATCTTCTGGGTCAGGTACGGAACCCATTAG
TAGTACCCCCCTCCCTACTGTGCGGCGGGTAC
GGGGTCCCCGCCTGTATAGTAGGGCTAATCA
ACAGGTCCGTGTGTCCACCTCACAGTTTTTAA
CACATCCCTCATCGTTGGTTACATTTGATAAT
CCAGCTTATGAGCCCCTGGACACCACACTATC
CTTTGAGCCTACCAGTAATGTTCCTGATTCCG
ATTTTATGGATATTATTCGTTTGCATAGGCCA
GCATTATCCTCTAGACGTGGCACTGTTAGATT
TAGTAGATTGGGTCAAAGGGCAACCATGTTT
ACACGTAGTGGTAAACAAATAGGGGGTAGG
GTACATTTTTACCATGATATAAGCCCCATTGC
TGCTACAGAGGAAATTGAATTGCAGCCTTTA
ATTAGTGCTACAAATGATAGTGACCTGTTTGA
TGTATATGCAGACTTCCCACCTCCTGCGTCCA
CTACACCTAGCACTATACACAAATCATTTACA
TATCCAAAGTATTCCTTGACCATGCCTTCTACT
GCTGCATCCTCTTACAGTAATGTTACAGTACC
ATTAACATCTGCATGGGATGTACCTATATATA
CTGGCCCGGACATTATATTGCCATCCCATACT
CCTATGTGGCCTAGTACATCTCCTACCAATGC
TTCCACCACCACCTATATAGGTATTCATGGCA
CACAATATTATTTATGGCCATGGTATTATTAT
TTTCCTAAAAAACGTAAACGTATTCCCTATTTT
TTTGCAGATGGCTTTGTGGCGGCCTAG
ATGGTGTCACATAGAGCAGCCAGAAGAAAA
AGGGCATCAGCAACCGACCTGTATAGAACCT
GTAAGCAGAGCGGAACTTGTCCTCCCGACGT
GATCAACAAGGTCGAGGGAACCACACTGGCC
GATAAAATTCTGCAGTGGAGCTCCCTGGGAA
TCTTCCTGGGAGGACTGGGAATTGGAACCGG
AAGTGGATCAGGAGGACGAACAGGATATGT
CCCACTGGGAGGAAGAAGCAATACAGTGGT
CGACGTGGGCCCTACTAGGCCACCTGTGGTC
ATCGAGCCAGTCGGACCTACCGACCCATCCA
TTGTGACACTGGTCGAAGATTCTAGTGTGGT
CGCATCCGGAGCTCCAGTGCCAACCTTCACA
GGAACTTCTGGCTTTGAGATCACCTCAAGCG
GAACTACCACACCTGCCGTGCTGGACATCAC
CCCAACAGTGGATAGCGTCTCCATTTCCTCTA
CAAGCTTCACTAACCCTGCTTTTAGTGATCCA
TCAATCATTGAGGTGCCTCAGACCGGAGAAG
TCAGCGGCAATATCTTCGTGGGCACTCCAACC
TCTGGGAGTCACGGATACGAGGAAATTCCCC
TGCAGACTTTTGCCAGTTCAGGCTCCGGGAC
CGAACCAATCAGCTCCACACCTCTGCCAACTG
TGCGGAGAGTCAGAGGGCCCAGGCTGTATTC
TAGGGCAAACCAGCAGGTGCGCGTCTCAACA
AGCCAGTTCCTGACTCACCCCTCTAGTCTGGT
GACATTTGACAACCCAGCCTACGAGCCCCTG
GATACTACCCTGTCTTTCGAACCCACCAGTAA
TGTGCCTGACTCAGATTTTATGGACATCATTC
GCCTGCATCGACCTGCTCTGTCAAGCAGGCG
AGGAACCGTGCGGTTCAGTAGACTGGGACA
GCGAGCAACCATGTTTACACGAAGCGGCAAG
CAGATCGGAGGACGAGTGCACTTCTATCATG
ATATCAGCCCTATTGCCGCTACAGAGGAAAT
CGAGCTGCAGCCACTGATTAGCGCCACCAAT
GACTCCGATCTGTTCGACGTGTACGCAGATTT
TCCACCCCCTGCCAGCACAACTCCATCCACTA
TTCATAAGAGCTTTACCTATCCCAAATACTCTC
TGACCATGCCTAGTACAGCAGCCTCCTCTTAT
TCAAACGTGACTGTCCCTCTGACCAGCGCTTG
GGACGTGCCAATCTACACAGGCCCCGATATC
ATTCTGCCTTCCCACACTCCCATGTGGCCTTCC
ACTTCTCCAACCAATGCATCTACCACAACTTA
TATCGGCATTCATGGGACCCAGTACTATCTGT
GGCCATGGTACTATTACTTCCCCAAGAAACG
AAAACGGATCCCCTACTTCTTTGCTGACGGCT
TTGTGGCTGCA
MVSHRAARRKRASATDLYRTCKQSG
TCPPDVINKVEGTTLADKILQWSSLG
IFLGGLGIGTGSGSGGRTGYVPLGGR
SNTVVDVGPTRPPVVIEPVGPTDPSI
VTLVEDSSVVASGAPVPTFTGTSGFE
ITSSGTTTPAVLDITPTVDSVSISSTSF
TNPAFSDPSIIEVPQTGEVSGNIFVGT
PTSGSHGYEEIPLQTFASSGSGTEPIS
STPLPTVRRVRGPRLYSRANQQVRV
STSQFLTHPSSLVTFDNPAYEPLDTTL
SFEPTSNVPDSDFMDIIRLHRPALSSR
RGTVRFSRLGQRATMFTRSGKQIGG
RVHFYHDISPIAATEEIELQPLISATND
SDLFDVYADFPPPASTTPSTIHKSFTY
PKYSLTMPSTAASSYSNVTVPLTSAW
DVPIYTGPDIILPSHTPMWPSTSPTN
ASTTTYIGIHGTQYYLWPWYYYFPKK
RKRIPYFFADGFVAA
Page 36 of 48
% Nucleotide
Change
28.0%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV51 E1
ATGGACTGTGAAGGTACAGAGGATGAGGGG
GCGGGGTGTAATGGGTGGTTTTTTGTTGAAG
CAATAGTAGAAAAAAAAACAGGAGATAATGT
TTCGGATGATGAGGATGAAAATGCAGATGAT
ACAGGATCTGATTTAATAAACTTTATAGATAG
TGAAACTAGTATTTGCAGTCAGGCGGAACAG
GAGACAGCACGGGCGTTGTTTCAGGCCCAAG
AATTACAGGCAAACAAAGAGGCTGTGCATCA
GTTAAAACGAAAGTTTCTAGTCAGCCCGCGA
AGCAGCCCATTAGGAGACATTACAAATCAAA
ACAACACACACAGCCATAGTCAGGCAAACGA
GTCACAAGTTAAAAGGAGATTACTGGACAGT
TATCCGGACAGCGGATATGGCAATACACAAG
TGGAAACTGTGGAAGCAACGTTGCAGGTAG
ATGGGCAACATGGCGGTTCACAGAACAGTGT
GTGTAGTAGCGGGGGGGGCAGTGTTATGGA
TGTGGAAACAACAGAAAGCTGTGCAAATGTA
GAACTAAACAGTATATGTGAAGTATTAAAAA
GCAGTAATGCAAAAGCAACGTTAATGGCAAA
ATTTAAAGAGTTGTATGGTATTAGTTATAATG
AGTTGGTACGGGTGTTTAAAAGTGATAAAAC
ATGTTGTATAGATTGGGTTTGTGCATTGTTTG
GCGTTTCCCCAATGGTAGCAGAAAATTTAAA
AACACTAATTAAGCCATTTTGCATGTACTACC
ATATACAATGTTTATCATGTGATTGGGGCACC
ATTGTATTAATGCTAATTAGGTTTTCATGTGC
AAAAAACAGAACAACAATTGCTAAGTGTTTA
AGTACATTAGTAAATATCCCACAATCACAAAT
GTTTATAGAACCACCAAAATTACGTAGTACAC
CTGTGGCATTATATTTTTATAGAACAGGCATA
TCAAACATTAGCAATACATATGGAGAGACAC
CTGAATGGATTACACGACAAACGCAACTACA
ACATAGTTTTGAGGATAGTACCTTTGAATTAT
CACAAATGGTGCAATGGGCATTTGACCATGA
AGTATTAGATGATAGTGAAATAGCATTTCATT
ATGCACAATTAGCAGATATAGATAGTAATGC
TGCAGCGTTTTTAAAGAGTAATTGCCAAGCA
AAATATGTAAAAGATTGTGGGACCATGGCAC
GGCATTACAAACGAGCACAAAGAAAATCATT
ATCTATGTCAGCCTGGATAAGGTATAGATGT
GATAGAGCAAAGGATGGAGGCAACTGGAGA
GAAATTGCTAAATTTTTAAGATATCAAGGTGT
AAACTTTATGTCCTTTATTCAAATGTTTAAACA
GTTTTTAAAAGGAACACCAAAACACAATTGC
ATAGTCATATATGGCCCACCAAACACAGGCA
AGTCATTATTTGCAATGAGCCTAATGAAGTTT
ATGCAAGGGTCCATTATTTCATATGTAAACTC
TGGTAGTCATTTTTGGTTACAGCCACTAGAG
GATGCTAAAATAGCATTGTTAGATGATGCTA
CGTATGGGTGTTGGACATATATTGATCAGTA
TTTAAGAAACTTTTTAGATGGTAATCCATGTA
GTATAGATAGAAAACATAGGAGTTTAATACA
ATTAGTATGTCCACCATTACTAATAACGTCAA
ACATAAATCCACAAGAGGATGCAAACCTAAT
GTATTTACATACAAGGGTAACAGTATTAAAG
TTTTTAAATACATTTCCATTTGATAACAATGG
GAATGCTGTGTATACATTGAATGATGAAAAT
TGGAAAAATTTTTTTTCCACCACATGGTCCAG
ATTAGATTTGGAGGAGGAAGAGGACAAAGA
AAATGGAGACCCTATGCCACCGTTTAAATGT
GTGCCAGGAGAAAATACTAGACTGTTATGA
ATGGACTGCGAGGGAACTGAGGATGAGGGG
GCTGGGTGTAACGGCTGGTTTTTCGTGGAGG
CTATTGTGGAGAAGAAGACTGGGGATAATGT
GAGCGACGATGAGGACGAAAACGCCGACGA
TACCGGCTCCGACCTGATCAATTTCATTGATA
GTGAGACCTCAATCTGCAGCCAGGCAGAGCA
GGAAACAGCCCGGGCTCTGTTTCAGGCTCAG
GAGCTGCAGGCAAACAAGGAAGCCGTGCAC
CAGCTGAAGCGCAAATTCCTGGTCTCTCCACG
AAGCTCCCCCCTGGGAGATATTACCAATCAG
AACAATACACACAGCCATTCTCAGGCAAACG
AGAGCCAGGTGAAGCGGAGACTGCTGGACT
CTTACCCTGATAGTGGGTATGGAAATACACA
GGTGGAGACTGTCGAAGCAACCCTGCAGGT
GGACGGACAGCATGGAGGGTCCCAGAACTC
TGTCTGTTCTAGTGGAGGCGGGAGCGTGATG
GATGTCGAGACCACAGAATCTTGCGCTAATG
TGGAGCTGAACAGTATCTGTGAAGTCCTGAA
ATCAAGCAATGCAAAGGCCACTCTGATGGCA
AAGTTCAAAGAGCTGTACGGCATCAGTTATA
ACGAACTGGTGAGGGTCTTTAAGTCAGACAA
AACCTGCTGTATTGATTGGGTGTGCGCACTG
TTCGGGGTGTCCCCTATGGTGGCCGAGAACC
TGAAGACCCTGATCAAACCTTTCTGTATGTAC
TACCACATCCAGTGCCTGTCTTGTGACTGGG
GGACAATCGTGCTGATGCTGATTCGGTTCAG
TTGCGCCAAAAATAGAACTACCATCGCTAAG
TGTCTGTCAACCCTGGTGAACATCCCACAGA
GCCAGATGTTTATTGAACCCCCTAAGCTGCGA
TCCACACCCGTCGCCCTGTACTTCTATCGGAC
TGGAATCAGTAACATTTCAAATACCTACGGC
GAGACTCCAGAATGGATCACCCGCCAGACAC
AGCTGCAGCACTCATTCGAGGACAGCACATT
TGAACTGAGCCAGATGGTGCAGTGGGCATTC
GATCACGAGGTCCTGGACGATTCCGAAATCG
CCTTTCATTACGCTCAGCTGGCAGACATTGAT
TCCAATGCCGCTGCATTTCTGAAGTCTAACTG
CCAGGCCAAGTACGTGAAAGACTGTGGCACT
ATGGCTAGACATTATAAACGAGCACAGCGGA
AGAGTCTGTCAATGAGCGCTTGGATCAGATA
CCGGTGCGACAGGGCAAAAGATGGAGGCAA
TTGGAGGGAGATCGCCAAGTTCCTGCGCTAT
CAGGGGGTGAACTTCATGTCTTTTATTCAGAT
GTTCAAGCAGTTTCTGAAAGGAACACCCAAG
CACAATTGTATCGTCATCTACGGCCCACCCAA
CACTGGGAAAAGTCTGTTCGCTATGTCACTG
ATGAAGTTTATGCAGGGGAGCATCATTAGCT
ACGTGAACAGCGGATCTCATTTTTGGCTGCA
GCCCCTGGAGGACGCCAAAATCGCTCTGCTG
GACGATGCCACTTACGGATGCTGGACCTACA
TTGATCAGTATCTGCGGAATTTCCTGGACGG
CAACCCTTGCAGCATCGATAGAAAGCACAGG
TCCCTGATTCAGCTGGTGTGTCCTCCACTGCT
GATCACAAGCAACATTAATCCTCAGGAGGAC
GCCAACCTGATGTACCTGCATACTAGAGTGA
CCGTCCTGAAATTTCTGAATACTTTCCCATTTG
ACAACAATGGCAACGCTGTGTATACCCTGAA
CGATGAAAATTGGAAGAACTTCTTTTCCACAA
CTTGGTCTCGCCTGGATCTGGAGGAAGAGGA
AGACAAAGAGAATGGCGATCCTATGCCCCCT
TTCAAGTGCGTGCCAGGGGAAAACACCCGAC
TGCTG
MDCEGTEDEGAGCNGWFFVEAIVE
KKTGDNVSDDEDENADDTGSDLINF
IDSETSICSQAEQETARALFQAQELQ
ANKEAVHQLKRKFLVSPRSSPLGDIT
NQNNTHSHSQANESQVKRRLLDSYP
DSGYGNTQVETVEATLQVDGQHGG
SQNSVCSSGGGSVMDVETTESCAN
VELNSICEVLKSSNAKATLMAKFKELY
GISYNELVRVFKSDKTCCIDWVCALF
GVSPMVAENLKTLIKPFCMYYHIQCL
SCDWGTIVLMLIRFSCAKNRTTIAKC
LSTLVNIPQSQMFIEPPKLRSTPVALY
FYRTGISNISNTYGETPEWITRQTQL
QHSFEDSTFELSQMVQWAFDHEVL
DDSEIAFHYAQLADIDSNAAAFLKSN
CQAKYVKDCGTMARHYKRAQRKSL
SMSAWIRYRCDRAKDGGNWREIAK
FLRYQGVNFMSFIQMFKQFLKGTPK
HNCIVIYGPPNTGKSLFAMSLMKFM
QGSIISYVNSGSHFWLQPLEDAKIALL
DDATYGCWTYIDQYLRNFLDGNPCS
IDRKHRSLIQLVCPPLLITSNINPQED
ANLMYLHTRVTVLKFLNTFPFDNNG
NAVYTLNDENWKNFFSTTWSRLDLE
EEEDKENGDPMPPFKCVPGENTRLL
Page 37 of 48
% Nucleotide
Change
26.9%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV51 E2
ATGGAGACCCTATGCCACCGTTTAAATGTGT
GCCAGGAGAAAATACTAGACTGTTATGAACT
GGACAGTGATAAATTAGTAGATCAAATTAAC
TATTGGACATTGTTACGATATGAAGCTGCTAT
GTTTTATGCAGCACGGGAAAGAAACTTACGA
ACAATCAATCACCAGGTAGTACCAGCAACAA
CAGTATCAAAACAAAAGGCCTGTCAAGCAAT
TGAAATGCACATGGCCTTACAATCGCTTAACA
AATCAGACTATAACATGGAACCATGGACAAT
GCGGGAGACATGTTATGAACTATGGTGTGTG
GCTCCCAAGCAATGTTTCAAAAAGGGGGGCA
TAACTGTAACAGTTATATTTGATGGAAATAAG
GACAATGCAATGGACTATACAAGCTGGAAAT
TTATATATATATATGATAATGATAAGTGGGTA
AAGACAAATGGAAATGTGGACTATACGGGTA
TATATTACACTGTAAATTCAAAAAAAGAATAT
TATGTACAGTTTAAAGATGAAGCCAAAATAT
ATGGGGCACAACAGTGGGAGGTCTATATGTA
TGGTACTGTAATAACATGTCCTGAATATGTAT
CTAGTACCTGCAGCGACGCGTTATCCACTACT
ACAACTGTTGAACAACTATCAAACACCCCAAC
GACCAATCCCCTTACCACCTGCGTGGGCGCC
AAAGAAGCCCAGACACAACAGCGAAAACGA
CAGCGACTTACTGAGCCCGACTCCTCCACAAT
CTCCCCACTGTCCGTGGACAATACAAACAACC
AAATACACTGTGGAAGTGGAAGCACTAACAC
TGGAGGGCACCAAAGTGCAACTCAGACTGCG
TTTATAGTGCATTTAAAAGGTGATACAAATTG
TTTAAAATGTTTTAGATACAGATTTACAAAAC
ACAAAGGGTTATATAAAAACGTATCCTCAAC
CTGGCATTGGACCAGTAATACTAAAACAGGC
ATTGTTACCATTGTGTTTGACAGTGCACATCA
ACGGGAAACATTTATAAAAACCATTAAAGTA
CCCCCAAGTGTAACACTGTCATTGGGAATTAT
GACACTGTAA
ATGTATCTAGTACCTGCAGCGACGCGTTATCC
ACTACTACAACTGTTGAACAACTATCAAACAC
CCCAACGACCAATCCCCTTACCACCTGCGTGG
GCGCCAAAGAAGCCCAGACACAACAGCGAA
AACGACAGCGACTTACTGAGCCCGACTCCTC
CACAATCTCCCCACTGTCCGTGGACAATACAA
ACAACCAAATACACTGTGGAAGTGGAAGCAC
TAACACTGGAGGGCACCAAAGTGCAACTCAG
ACTGCGTTTATAG
ATGTATAGACATATTGTAACCATTGCAGTGTT
TATTATTTTGCTATTTGTGCTTTGCTTGTGTGT
GTGTCTTGTGTTGTGTTGTTTGTTGCCGCTAC
TGCTGTCCCAATACGTGTTTGCAGCTGCCTTA
TTATTAATTTTATGTTTTTGGTTTGTTGTTGCA
ACATCCCAATTAACTACATTTTTTGTATATTTG
ATTTTTTTTTACTTACCTTGTTTACTTTTACATC
TATATACATTTTTACTTTTGCAATAA
ATGTTCGAAGACAAGAGGGAAAGACCACGA
ACGCTGCATGAATTATGTGAAGCTTTGAACG
TTTCTATGCACAATATACAGGTAGTGTGTGTG
TATTGTAAAAAGGAATTATGTAGAGCAGATG
TATATAATGTAGCATTTACTGAAATTAAGATT
GTATATAGGGATAATAATCCATATGCAGTAT
GCAAACAATGTTTACTGTTTTATTCAAAAATT
AGAGAGTATAGACGTTATAGCAGGTCTGTGT
ATGGTACTACATTAGAGGCAATTACTAAAAA
AAGCTTATATGATTTATCGATAAGGTGTCATA
GATGTCAAAGACCACTTGGGCCTGAAGAAAA
GCAAAAATTGGTGGACGAAAAAAAAAGGTT
CCATGAAATAGCGGGACGTTGGACGGGGCA
ATGCGCTAATTGCTGGCAACGTACACGACAA
CGTAACGAAACCCAAGTGTAA
ATGGAAACTCTGTGTCATAGACTGAACGTGT
GTCAGGAAAAGATTCTGGACTGCTACGAACT
GGACTCCGATAAACTGGTGGATCAGATCAAC
TACTGGACACTGCTGAGATATGAGGCCGCTA
TGTTCTACGCAGCCCGGGAAAGAAACCTGCG
CACCATCAATCATCAGGTGGTCCCAGCCACCA
CAGTGTCTAAGCAGAAGGCCTGCCAGGCAAT
TGAGATGCACATGGCTCTGCAGAGCCTGAAC
AAGTCCGATTATAATATGGAACCCTGGACTAT
GCGAGAGACCTGCTACGAACTGTGGTGTGTG
GCACCTAAGCAGTGTTTCAAGAAAGGCGGGA
TCACAGTGACTGTCATTTTTGACGGCAACAAG
GATAATGCTATGGACTATACAAGTTGGAAAT
TCATCTACATCTACGACAACGATAAGTGGGT
GAAAACTAACGGCAATGTCGATTACACTGGG
ATCTACTATACCGTGAATAGCAAGAAAGAGT
ACTATGTCCAGTTTAAGGACGAAGCTAAAAT
CTATGGAGCACAGCAGTGGGAGGTGTACAT
GTATGGCACCGTCATTACATGCCCCGAATAC
GTGAGCTCCACCTGTTCTGACGCCCTGAGTAC
TACCACAACTGTCGAGCAGCTGAGCAACACA
CCAACCACAAATCCCCTGACTACCTGCGTGG
GAGCAAAGGAGGCTCAGACCCAGCAGAGGA
AACGCCAGCGACTGACAGAACCTGATTCTAG
TACTATCAGTCCACTGTCAGTGGACAACACCA
ACAATCAGATTCACTGTGGGTCCGGATCTACC
AATACAGGAGGCCATCAGTCCGCAACTCAGA
CCGCCTTTATCGTGCACCTGAAGGGGGATAC
TAATTGCCTGAAATGTTTCCGGTACAGATTCA
CCAAGCATAAAGGACTGTACAAGAACGTGTC
AAGCACATGGCACTGGACTTCCAATACAAAA
ACTGGCATCGTGACCATTGTCTTCGACTCTGC
CCATCAGAGGGAGACCTTTATCAAGACCATC
AAGGTGCCCCCTTCAGTCACCCTGAGCCTGG
GGATTATGACACTG
ATGTACCTGGTCCCTGCCGCTACCCGCTATCC
CCTGCTGCAGCTGCTGAACAACTATCAGACTC
CACAGAGGCCCATTCCCCTGCCACCCGCCTG
GGCTCCAAAGAAACCCAGGCACAACAGCGA
GAATGACTCCGATCTGCTGTCTCCTACCCCCC
CTCAGAGCCCTCATTGCCCCTGGACAATCCAG
ACCACAAAGTACACTGTGGAGGTCGAAGCCC
TGACTCTGGAAGGCACCAAAGTGCAGCTGCG
GCTGAGACTG
ATGTATCGCCATATTGTGACTATCGCCGTGTT
CATCATTCTGCTGTTCGTGCTGTGCCTGTGTG
TGTGTCTGGTGCTGTGCTGCCTGCTGCCCCTG
CTGCTGAGCCAGTACGTGTTCGCCGCTGCAC
TGCTGCTGATCCTGTGCTTCTGGTTTGTGGTC
GCCACTTCCCAGCTGACCACATTCTTTGTCTA
CCTGATTTTCTTTTATCTGCCTTGTCTGCTGCT
GCACCTGTATACCTTTCTGCTGCTGCAG
ATGTTTGAAGACAAGCGGGAACGCCCAAGG
ACTCTGCACGAACTGTGTGAAGCACTGAATG
TGAGCATGCATAATATCCAGGTGGTGTGCGT
GTACTGTAAGAAAGAGCTGTGCCGGGCCGAC
GTGTATAACGTCGCTTTCACCGAAATCAAGAT
TGTGTACAGAGATAACAATCCCTATGCTGTCT
GCAAGCAGTGTCTGCTGTTTTACTCCAAAATC
CGCGAGTACCGGAGATATAGCCGATCCGTGT
ACGGAACCACACTGGAAGCCATCACAAAGAA
ATCTCTGTATGACCTGAGTATTCGGTGCCACA
GATGTCAGAGGCCCCTGGGCCCTGAGGAAA
AGCAGAAACTGGTGGATGAGAAGAAAAGGT
TCCATGAAATTGCAGGACGATGGACTGGACA
GTGCGCAAACTGTTGGCAGAGGACTCGCCAG
CGAAATGAGACCCAGGTC
METLCHRLNVCQEKILDCYELDSDKL
VDQINYWTLLRYEAAMFYAARERNL
RTINHQVVPATTVSKQKACQAIEMH
MALQSLNKSDYNMEPWTMRETCY
ELWCVAPKQCFKKGGITVTVIFDGN
KDNAMDYTSWKFIYIYDNDKWVKT
NGNVDYTGIYYTVNSKKEYYVQFKD
EAKIYGAQQWEVYMYGTVITCPEYV
SSTCSDALSTTTTVEQLSNTPTTNPLT
TCVGAKEAQTQQRKRQRLTEPDSST
ISPLSVDNTNNQIHCGSGSTNTGGH
QSATQTAFIVHLKGDTNCLKCFRYRF
TKHKGLYKNVSSTWHWTSNTKTGIV
TIVFDSAHQRETFIKTIKVPPSVTLSLG
IMTL
HPV51 E4
HPV51 E5
HPV51 E6
Page 38 of 48
% Nucleotide
Change
25.1%
MYLVPAATRYPLLQLLNNYQTPQRPI
PLPPAWAPKKPRHNSENDSDLLSPT
PPQSPHCPWTIQTTKYTVEVEALTLE
GTKVQLRLRL
27.6%
MYRHIVTIAVFIILLFVLCLCVCLVLCC
LLPLLLSQYVFAAALLLILCFWFVVAT
SQLTTFFVYLIFFYLPCLLLHLYTFLLLQ
*X
27.5%
MFEDKRERPRTLHELCEALNVSMHN
IQVVCVYCKKELCRADVYNVAFTEIKI
VYRDNNPYAVCKQCLLFYSKIREYRR
YSRSVYGTTLEAITKKSLYDLSIRCHRC
QRPLGPEEKQKLVDEKKRFHEIAGR
WTGQCANCWQRTRQRNETQV
26.3%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV51 E7
ATGCGTGGTAATGTACCACAATTAAAAGATG
TAGTATTGCATTTAACACCACAGACTGAAATT
GACTTGCAATGCTACGAGCAATTTGACAGCT
CAGAGGAGGAGGATGAAGTAGATAATATGC
GTGACCAGCTACCAGAAAGACGGGCTGGAC
AGGCTACGTGTTACAGAATTGAAGCTCCGTG
TTGCAGGTGTTCAAGTGTAGTACAACTGGCA
GTGGAAAGCAGTGGAGACACCCTTCGCGTTG
TACAGCAGATGTTAATGGGCGAACTAAGCCT
GGTTTGCCCGTGTTGTGCGAACAACTAG
ATGGCATTGTGGCGCACTAATGACAGCAAGG
TGTATTTGCCACCTGCACCTGTGTCTCGAATT
GTGAATACAGAAGAATATATCACACGCACCG
GCATATATTACTATGCAGGCAGTTCCAGACTA
ATAACATTAGGACATCCCTATTTTCCAATACC
TAAAACCTCAACGCGTGCTGCTATTCCTAAAG
TATCTGCATTTCAATACAGGGTATTTAGGGTA
CAGTTACCAGATCCTAACAAGTTTGGACTCCC
GGATCCAAATTTATATAATCCAGACACAGATA
GGTTGGTGTGGGGTTGTGTGGGCGTTGAGG
TGGGCAGAGGACAGCCCCTTGGTGTTGGCCT
TAGTGGTCATCCCTTATTTAATAAATATGATG
ACACAGAAAATTCACGCATAGCAAATGGCAA
TGCACAACAAGATGTTAGAGATAACACATCT
GTTGACAACAAACAGACTCAGTTATGTATAAT
AGGCTGTGCTCCACCTATTGGGGAACACTGG
GGTATTGGCACTACATGCAAAAACACACCTG
TACCTCCAGGAGACTGCCCCCCCCTGGAACTT
GTATCCTCTGTCATTCAGGATGGCGATATGAT
TGATACAGGGTTTGGAGCTATGGATTTCGCT
GCCCTACAGGCCACCAAATCAGACGTCCCTTT
GGATATTTCACAGTCTGTTTGTAAATATCCTG
ATTATTTAAAAATGTCTGCAGACACATATGGT
AATTCCATGTTTTTTCATTTACGCAGGGAGCA
AATCTTTGCTAGGCACTATTATAATAAACTTG
TAGGTGTTGGGGAAGACATTCCTAACGATTA
TTATATTAAGGGTAGTGGTAATGGCCGTGAC
CCTATAGAAAGTTATATATACTCTGCTACTCC
CAGTGGGTCTATGATAACATCTGATTCTCAAA
TTTTTAATAAGCCTTATTGGCTCCACCGTGCG
CAGGGTCACAATAATGGCATTTGCTGGAACA
ATCAGCTTTTTATTACCTGTGTTGATACTACCA
GAAGTACAAATTTAACTATTAGCACTGCCACT
GCTGCGGTTTCCCCAACATTTACTCCAAGTAA
CTTTAAGCAATATATTAGGCATGGGGAAGAG
TATGAATTGCAATTTATTTTTCAATTATGTAAA
ATTACTTTAACTACAGAGGTAATGGCTTATTT
ACACACAATGGATCCTACCATTCTTGAACAGT
GGAATTTTGGATTAACATTACCTCCGTCTGCT
AGTTTGGAGGATGCATATAGGTTTGTTAGAA
ATGCAGCTACTAGCTGTCAAAAGGACACCCC
TCCACAGGCTAAGCCAGATCCTTTGGCCAAAT
ATAAATTTTGGGATGTTGATTTAAAGGAACG
ATTTTCTTTAGATTTAGACCAATTTGCATTGG
GTCGCAAGTTTTTGTTGCAGGTTGGCGTACA
ACGCAAGCCCAGACCAGGCCTTAAACGCCCG
GCCTCATCGGCATCCTCTTCCTCTTCCTCTTCA
GCCAAACGTAAACGTGTTAAAAAGTAA
ATGAGAGGCAACGTCCCCCAGCTGAAAGATG
TCGTCCTGCACCTGACTCCACAGACCGAGATT
GACCTGCAGTGTTATGAACAGTTTGACAGCT
CCGAGGAAGAGGACGAAGTGGATAACATGC
GAGATCAGCTGCCAGAGCGAAGAGCAGGAC
AGGCTACCTGCTACAGGATCGAAGCACCTTG
CTGTCGCTGTTCTAGTGTGGTCCAGCTGGCC
GTGGAGTCAAGCGGCGACACACTGAGAGTG
GTCCAGCAGATGCTGATGGGGGAACTGTCTC
TGGTCTGCCCCTGCTGTGCTAACAAT
ATGGCTCTGTGGCGAACTAATGACTCTAAGG
TCTATCTGCCCCCTGCCCCCGTCTCCCGAATC
GTGAATACTGAAGAATACATCACCCGGACAG
GAATCTACTATTACGCTGGCAGCTCCAGACTG
ATCACTCTGGGGCACCCTTATTTCCCTATTCC
AAAGACCTCAACACGGGCCGCTATCCCAAAA
GTGTCTGCATTCCAGTACCGAGTCTTTCGGGT
GCAGCTGCCCGACCCTAACAAGTTTGGACTG
CCAGATCCCAACCTGTATAATCCAGACACAG
ATCGACTGGTCTGGGGATGCGTGGGAGTCG
AAGTGGGACGAGGGCAGCCTCTGGGAGTGG
GCCTGAGTGGACACCCACTGTTCAACAAGTA
CGACGATACTGAAAATTCTAGGATCGCTAAC
GGCAATGCACAGCAGGACGTCCGCGATAACA
CCTCCGTGGACAATAAGCAGACACAGCTGTG
CATCATTGGGTGTGCCCCCCCTATCGGAGAG
CATTGGGGGATTGGAACCACATGCAAAAATA
CCCCCGTGCCACCAGGCGATTGTCCTCCACTG
GAACTGGTCTCTAGTGTGATCCAGGACGGGG
ATATGATTGACACCGGCTTCGGGGCTATGGA
TTTTGCAGCCCTGCAGGCAACAAAGTCCGAC
GTCCCACTGGATATTAGTCAGTCAGTGTGTA
AGTATCCCGACTACCTGAAAATGTCTGCCGAT
ACCTACGGCAACAGCATGTTCTTTCACCTGCG
GAGAGAGCAGATTTTCGCTCGGCATTATTAC
AATAAGCTGGTCGGAGTGGGCGAAGACATC
CCCAACGATTATTACATCAAGGGGAGCGGAA
ATGGCAGAGACCCCATCGAGTCCTATATCTAC
TCTGCAACTCCTAGCGGCTCCATGATCACCTC
TGATAGTCAGATTTTCAACAAGCCTTACTGGC
TGCACCGAGCACAGGGACATAACAATGGGAT
CTGCTGGAACAATCAGCTGTTTATTACCTGTG
TGGACACTACCCGAAGTACCAACCTGACAAT
TTCAACTGCCACCGCTGCAGTGAGCCCAACAT
TCACTCCCTCCAATTTTAAGCAGTATATCAGG
CACGGCGAGGAATACGAGCTGCAGTTCATCT
TTCAGCTGTGCAAAATTACTCTGACAACTGAA
GTGATGGCCTACCTGCATACTATGGACCCTAC
CATCCTGGAACAGTGGAACTTCGGACTGACC
CTGCCACCTTCAGCCAGCCTGGAGGATGCTT
ATAGATTTGTGAGGAATGCCGCTACATCCTG
TCAGAAGGACACTCCACCCCAGGCAAAACCT
GATCCACTGGCCAAGTACAAATTCTGGGACG
TGGATCTGAAGGAACGGTTCAGCCTGGACCT
GGACCAGTTCGCCCTGGGCAGGAAATTTCTG
CTGCAGGTCGGAGTGCAGCGAAAGCCACGA
CCTGGACTGAAACGCCCCGCATCAAGCGCCT
CCTCTAGTTCAAGCTCCTCTGCTAAGAGGAAA
CGCGTGAAGAAA
MRGNVPQLKDVVLHLTPQTEIDLQC
YEQFDSSEEEDEVDNMRDQLPERRA
GQATCYRIEAPCCRCSSVVQLAVESS
GDTLRVVQQMLMGELSLVCPCCAN
N*X
HPV51 L1
Page 39 of 48
MALWRTNDSKVYLPPAPVSRIVNTE
EYITRTGIYYYAGSSRLITLGHPYFPIPK
TSTRAAIPKVSAFQYRVFRVQLPDPN
KFGLPDPNLYNPDTDRLVWGCVGV
EVGRGQPLGVGLSGHPLFNKYDDTE
NSRIANGNAQQDVRDNTSVDNKQT
QLCIIGCAPPIGEHWGIGTTCKNTPV
PPGDCPPLELVSSVIQDGDMIDTGF
GAMDFAALQATKSDVPLDISQSVCK
YPDYLKMSADTYGNSMFFHLRREQI
FARHYYNKLVGVGEDIPNDYYIKGSG
NGRDPIESYIYSATPSGSMITSDSQIF
NKPYWLHRAQGHNNGICWNNQLFI
TCVDTTRSTNLTISTATAAVSPTFTPS
NFKQYIRHGEEYELQFIFQLCKITLTTE
VMAYLHTMDPTILEQWNFGLTLPPS
ASLEDAYRFVRNAATSCQKDTPPQA
KPDPLAKYKFWDVDLKERFSLDLDQ
FALGRKFLLQVGVQRKPRPGLKRPAS
SASSSSSSSAKRKRVKK
% Nucleotide
Change
25.9%
24.8%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV51 L2
ATGGTGGCTACACGTGCACGGCGTCGGAAG
CGAGCATCTGTAACACAATTATATTCTACATG
CAAAGCTGCTGGTACATGTCCTCCTGATGTTG
TGAATAAGGTTGAAGGTACTACATTGGCCGA
TAAAATATTACAGTGGAGTGGGTTGGGTATA
TTTTTGGGTGGCCTAGGTATTGGTACTGGGT
CTGGATCTGGGGGGCGTACTGGATATATCCC
TTTAGGTGGTGGGGGTCGCCCAGGCGTGGT
GGATATTGCTCCTGCAAGGCCACCTATTATAA
TTGACCTATGGCACCATACTGAACCTTCTATA
GTAAATTTGGTTGAGGACTCTAGTATTATTCA
GTCTGGGTCTCCTATACCTACCTTTACTGGTA
CCGATGGCTTTGAAATTACTTCATCTTCCACA
ACAACCCCTGCTGTGTTGGACATCACCCCATC
TGCTGGTACTGTACATGTTTCTAGTACTAACA
TTGAAAATCCTTTATATATTGAACCTCCATCCA
TTGAGGCTCCACAATCTGGAGAAGTGTCAGA
TATATATTTACTAGTACACTACTCTGGTACTC
ATGGGTATGAAGAAATACCTATGGAAGTGTT
TGCATCCAATGTCAGTACTGGTACTGAACCTA
TTAGCAGCACACCTACTCCAGGGGTTAGTCG
CATAGCTGCTCCCCGCTTGTATAGTAAGTCCT
ACACACAGGTTAAAGTTACAAATCCTGATTTT
ATTAGTAAGCCATCCACATTTGTTACATTTAA
TAATCCTGCTTTTGAGCCTATTGACACATCCA
TAACTTTTGAGGAACCTGATGCTGTTGCACCT
GATCCTGATTTTCTGGATATTATTACACTGCA
CCGCCCTGCCCTTACATCTCGTAGAGGCACA
GTACGCTTTAGTAGGTTAGGTCAAAAGGCCA
CCATGCGCACTCGTAGTGGCAAACAAATTGG
TGCTCGTGTACATTATTATCATGATATTAGTA
GAATTGCACCAGCTGATGAACTTGAAATGCA
GCCTTTACTTTCACCTTCTAATAATTATAGTTA
TGACATTTATGCTGATTTAGATGAAGCTGAAA
CAGGTTTTATACAGCCCACACACACCACACCT
ATGTCACACTCCTCTTTGTCTAGGCAGTTGCC
CTCCTTATCTTCATCTATGTCTTCATCTTATGC
AAATGTTACTATTCCATTTTCAACTACATATTC
TGTTCCTATTCATACAGGGCCTGATGTGGTAT
TGCCCACATCTCCTACAGTATGGCCTTATGTT
CCCCACACTTCCATTGACACCAAGCATTCTAT
TGTTATACTAGGTGGGGATTACTATTTGTGGC
CCTATACACATTTACTACGCAAACGCCGTAAA
CGTATACCCTATTTTTTTACAGATGGCATTGT
GGCGCACTAA
ATGGTCGCAACTCGGGCAAGAAGAAGGAAA
AGAGCCTCAGTCACTCAGCTGTATAGCACCT
GTAAAGCCGCCGGAACCTGTCCCCCAGACGT
GGTCAACAAGGTGGAGGGCACCACACTGGC
CGATAAAATTCTGCAGTGGAGCGGGCTGGG
AATTTTCCTGGGAGGACTGGGAATCGGGACA
GGATCTGGCAGTGGAGGCCGGACTGGATAC
ATTCCACTGGGAGGAGGCGGGCGACCTGGA
GTGGTGGACATCGCACCAGCCCGCCCCCCTA
TCATTATCGATCTGTGGCACCATACCGAGCCC
AGCATCGTGAATCTGGTCGAAGACAGCTCCA
TTATCCAGTCAGGCAGCCCTATTCCAACCTTC
ACAGGGACTGACGGATTTGAGATCACCTCTA
GTTCAACTACCACACCCGCTGTGCTGGATATT
ACTCCTTCCGCAGGGACCGTGCACGTCAGCT
CCACAAACATCGAGAATCCACTGTACATTGA
GCCACCCTCTATCGAAGCACCCCAGTCAGGA
GAAGTGAGCGATATCTACCTGCTGGTCCACT
ATAGTGGCACCCACGGCTATGAGGAAATCCC
TATGGAGGTGTTCGCCAGCAACGTCTCCACC
GGAACAGAACCTATTTCTAGTACTCCAACCCC
TGGCGTGTCTCGCATCGCAGCTCCTCGACTGT
ACTCCAAGTCTTATACACAGGTGAAAGTCACT
AATCCTGACTTTATCTCTAAGCCAAGTACATT
CGTGACTTTTAACAATCCAGCTTTCGAGCCCA
TTGACACCTCCATCACATTTGAGGAACCTGAT
GCTGTGGCACCAGACCCCGATTTCCTGGATA
TTATCACCCTGCACAGACCAGCACTGACCAGT
CGGAGAGGGACAGTGAGATTTTCAAGGCTG
GGACAGAAGGCCACTATGCGAACCCGGAGC
GGAAAACAGATCGGCGCTAGGGTGCACTACT
ATCATGACATTTCCCGCATCGCCCCCGCTGAT
GAGCTGGAAATGCAGCCTCTGCTGAGTCCAT
CAAACAATTACAGCTATGACATCTACGCAGA
CCTGGATGAGGCCGAAACTGGCTTCATCCAG
CCCACCCACACTACCCCTATGAGTCATTCAAG
CCTGTCAAGGCAGCTGCCAAGCCTGTCCTCTA
GTATGTCAAGCTCCTACGCCAACGTGACCATT
CCCTTTTCCACAACTTATTCTGTCCCAATCCAT
ACAGGGCCCGACGTGGTCCTGCCTACATCCC
CAACTGTGTGGCCCTATGTCCCTCACACCTCC
ATCGACACAAAACATTCTATTGTGATCCTGGG
AGGCGATTACTATCTGTGGCCTTACACTCACC
TGCTGCGGAAGAGGCGCAAAAGAATCCCCTA
TTTCTTTACAGATGGCATCGTGGCTCAT
MVATRARRRKRASVTQLYSTCKAAG
TCPPDVVNKVEGTTLADKILQWSGL
GIFLGGLGIGTGSGSGGRTGYIPLGG
GGRPGVVDIAPARPPIIIDLWHHTEP
SIVNLVEDSSIIQSGSPIPTFTGTDGFE
ITSSSTTTPAVLDITPSAGTVHVSSTNI
ENPLYIEPPSIEAPQSGEVSDIYLLVHY
SGTHGYEEIPMEVFASNVSTGTEPIS
STPTPGVSRIAAPRLYSKSYTQVKVT
NPDFISKPSTFVTFNNPAFEPIDTSITF
EEPDAVAPDPDFLDIITLHRPALTSRR
GTVRFSRLGQKATMRTRSGKQIGAR
VHYYHDISRIAPADELEMQPLLSPSN
NYSYDIYADLDEAETGFIQPTHTTPM
SHSSLSRQLPSLSSSMSSSYANVTIPF
STTYSVPIHTGPDVVLPTSPTVWPYV
PHTSIDTKHSIVILGGDYYLWPYTHLL
RKRRKRIPYFFTDGIVAH
Page 40 of 48
% Nucleotide
Change
28.1%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV52 E1
ATGGAGGACCCTGAAGGTACAGAGGGCGAA
AGGGAGGGATGTACAGGCTGGTTTGAAGTA
GAGGCAATAATAGAAAAACAAACAGGAGAT
AACATTTCAGAGGACGAGGATGAAAATGCAT
ATGATAGTGGAACAGATCTAATAGATTTTATA
GATGATTCAAATATAAATAATGAACAGGCAG
AACATGAGGCAGCCCGGGCATTGTTTAATGC
ACAGGAAGGGGAGGATGATTTACATGCTGT
GTCTGCAGTAAAACGAAAGTTTACAAGCAGT
CCGGAAAGTGCTGGGCAAGATGGTGTAGAA
AAACATGGTAGTCCGCGTGCAAAACACATTT
GTGTAAATACAGAGTGTGTTTTACCAAAACG
CAAACCATGTCACGTAGAAGACAGCGGCTAT
GGCAATAGTGAAGTGGAAGCGCAGCAGATG
GCAGACCAGGTAGACGGGCAAAATGGCGAC
TGGCAAAGTAACAGTAGTCAATCAAGTGGGG
TGGGGGCTAGTAATTCAGATGTAAGTTGTAC
TAGTATAGAGGACAATGAGGAAAATAGTAAT
AGAACGCTAAAAAGCATACAAAATATTATGT
GCGAAAATAGCATAAAAACAACTGTATTATTT
AAATTTAAAGAAACATATGGTGTTAGCTTTAT
GGAATTAGTAAGACCATTTAAAAGTAATAGA
AGTAGTTGTACAGATTGGTGTATTATAGGAA
TGGGAGTAACACCATCAGTTGCAGAAGGATT
AAAAGTATTAATACAGCCCTATAGCATATATG
CCCATTTGCAATGTTTAACATGTGACAGAGGC
GTGCTTATACTGCTGCTAATTAGGTTTAAATG
TGGAAAAAACAGATTAACAGTGTCCAAACTA
ATGTCACAGCTGTTAAATATACCAGAAACAC
ATATGGTAATAGAACCACCAAAATTACGAAG
TGCTACCTGTGCATTATATTGGTATAGAACAG
GTTTGTCTAATATTAGTGAGGTATATGGTACC
ACCCCAGAATGGATAGAACAACAAACAGTAT
TACAGCATAGCTTTGACAATAGCATATTCGAT
TTTGGAGAAATGGTGCAATGGGCATATGATC
ATGATATAACAGATGATAGTGACATAGCATA
TAAATATGCACAGTTAGCAGATGTAAATAGC
AATGCTGCAGCATTCCTAAAAAGCAATTCGC
AAGCAAAAATAGTAAAGGACTGTGCAACCAT
GTGTAGACATTATAAACGGGCAGAAAGAAA
ACATATGAATATTGGACAATGGATACAGTAT
AGATGTGATAGAATAGATGATGGTGGAGATT
GGAGGCCTATAGTAAGATTTTTAAGATATCA
AGACATAGAATTTACAGCCTTTTTAGACGCAT
TTAAAAAATTTTTAAAAGGTATACCTAAAAAA
AATTGTTTAGTATTATATGGACCTGCAAACAC
AGGAAAATCATATTTTGGAATGAGTTTAATTA
GGTTCTTAAGTGGATGTGTAATATCCTATGTA
AACTCAAAAAGCCATTTTTGGCTACAACCATT
AACAGATGCAAAAGTGGGTATGATAGATGAT
GTAACACCTATATGTTGGACATATATAGATGA
TTATATGAGAAATGCACTGGATGGAAATGAT
ATATCAGTAGATGTAAAGCATAGAGCCTTAG
TACAAATAAAATGCCCACCATTAATTTTAACA
ACAAATACAAATGCAGGAACAGATCCTAGGT
GGCCATATTTACATAGTAGATTGGTTGTGTTT
CATTTCAAAAACCCATTTCCATTTGATGAAAA
TGGCAATCCTATATATGAAATTAACAACGAA
AATTGGAAATCCTTTTTCTCAAGGACGTGGTG
CAAATTAGATTTAATACAGGAAGAGGACAAG
GAAAACGATGGAGTCGATACCGGCACGTTTA
AATGCAGTGCAGGAAAAAATACTAGATCTAT
ACGAAGCTGA
ATGGAAGACCCCGAGGGAACTGAAGGAGAA
AGAGAAGGATGCACAGGCTGGTTTGAGGTG
GAGGCTATCATTGAAAAGCAGACCGGAGAC
AACATTTCAGAGGACGAAGATGAGAATGCTT
ACGATAGCGGCACCGACCTGATCGATTTCATT
GACGATAGCAACATCAACAATGAGCAGGCA
GAACACGAGGCAGCTCGGGCACTGTTCAATG
CCCAGGAAGGAGAGGACGATCTGCATGCAG
TGTCCGCCGTCAAGAGAAAGTTCACCAGCAG
CCCAGAAAGTGCAGGACAGGACGGAGTGGA
GAAGCACGGCTCACCCAGAGCTAAACATATC
TGCGTGAACACCGAATGTGTCCTGCCAAAGA
GGAAACCCTGCCACGTGGAGGACTCCGGATA
CGGCAATTCTGAAGTGGAGGCTCAGCAGATG
GCAGACCAGGTCGATGGGCAGAACGGAGAT
TGGCAGAGCAATTCTAGTCAGTCAAGCGGGG
TGGGAGCCAGTAACTCAGACGTCTCATGCAC
AAGCATTGAAGATAATGAGGAAAACTCTAAT
CGGACTCTGAAAAGTATCCAGAACATTATGT
GTGAGAACAGCATCAAGACCACAGTGCTGTT
CAAGTTTAAAGAAACCTACGGCGTGAGCTTC
ATGGAGCTGGTCCGCCCTTTTAAGTCTAACCG
ATCCTCTTGCACTGACTGGTGTATCATTGGAA
TGGGAGTGACCCCAAGCGTCGCAGAGGGAC
TGAAGGTGCTGATCCAGCCTTACTCCATCTAC
GCCCACCTGCAGTGCCTGACTTGTGATAGAG
GGGTGCTGATCCTGCTGCTGATTCGGTTTAA
GTGCGGAAAAAACAGACTGACCGTGTCTAAA
CTGATGAGTCAGCTGCTGAATATCCCCGAAA
CACACATGGTCATCGAGCCCCCTAAGCTGCG
ATCTGCTACCTGTGCACTGTACTGGTATCGGA
CAGGACTGTCCAACATTTCTGAAGTGTACGG
CACTACCCCTGAATGGATCGAGCAGCAGACA
GTCCTGCAGCACTCATTCGACAATAGCATCTT
CGATTTTGGCGAGATGGTGCAGTGGGCTTAT
GACCATGATATCACTGACGATTCTGACATTGC
ATACAAATATGCCCAGCTGGCTGATGTGAAC
AGTAATGCAGCCGCTTTTCTGAAGAGCAACT
CCCAGGCAAAGATCGTCAAAGACTGCGCCAC
CATGTGTAGGCACTACAAGCGGGCCGAGAG
AAAACACATGAATATCGGCCAGTGGATTCAG
TATAGGTGCGACCGAATCGACGATGGAGGG
GATTGGCGACCAATTGTGCGATTCCTGAGAT
ACCAGGACATCGAGTTCACCGCCTTTCTGGAT
GCTTTCAAGAAATTTCTGAAAGGCATCCCCAA
GAAGAACTGCCTGGTGCTGTACGGACCAGCT
AATACAGGCAAGAGTTATTTCGGGATGTCAC
TGATCAGGTTTCTGAGCGGCTGTGTGATTTCC
TATGTCAACTCTAAAAGTCACTTTTGGCTGCA
GCCTCTGACAGACGCCAAAGTGGGGATGATT
GACGATGTCACACCAATCTGCTGGACTTACAT
TGACGATTATATGCGCAACGCTCTGGACGGA
AATGATATCTCTGTGGATGTCAAGCATCGAG
CACTGGTGCAGATCAAATGTCCACCCCTGATT
CTGACAACTAACACTAATGCAGGCACCGACC
CCAGGTGGCCTTACCTGCACAGCCGCCTGGT
GGTCTTCCATTTTAAGAACCCTTTCCCATTTGA
TGAAAACGGGAACCCCATCTATGAAATTAAC
AACGAGAACTGGAAGAGTTTCTTTTCACGCA
CTTGGTGCAAACTGGACCTGATTCAGGAGGA
AGATAAGGAGAACGACGGCGTGGATACCGG
GACATTCAAGTGTAGCGCCGGCAAAAATACC
AGAAGCATCAGGTCC
MEDPEGTEGEREGCTGWFEVEAIIE
KQTGDNISEDEDENAYDSGTDLIDFI
DDSNINNEQAEHEAARALFNAQEGE
DDLHAVSAVKRKFTSSPESAGQDGV
EKHGSPRAKHICVNTECVLPKRKPCH
VEDSGYGNSEVEAQQMADQVDGQ
NGDWQSNSSQSSGVGASNSDVSCT
SIEDNEENSNRTLKSIQNIMCENSIKT
TVLFKFKETYGVSFMELVRPFKSNRS
SCTDWCIIGMGVTPSVAEGLKVLIQP
YSIYAHLQCLTCDRGVLILLLIRFKCGK
NRLTVSKLMSQLLNIPETHMVIEPPK
LRSATCALYWYRTGLSNISEVYGTTP
EWIEQQTVLQHSFDNSIFDFGEMVQ
WAYDHDITDDSDIAYKYAQLADVNS
NAAAFLKSNSQAKIVKDCATMCRHY
KRAERKHMNIGQWIQYRCDRIDDG
GDWRPIVRFLRYQDIEFTAFLDAFKK
FLKGIPKKNCLVLYGPANTGKSYFGM
SLIRFLSGCVISYVNSKSHFWLQPLTD
AKVGMIDDVTPICWTYIDDYMRNAL
DGNDISVDVKHRALVQIKCPPLILTT
NTNAGTDPRWPYLHSRLVVFHFKNP
FPFDENGNPIYEINNENWKSFFSRT
WCKLDLIQEEDKENDGVDTGTFKCS
AGKNTRSIRS
Page 41 of 48
% Nucleotide
Change
25.5%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV52 E2
ATGGAGTCGATACCGGCACGTTTAAATGCAG
TGCAGGAAAAAATACTAGATCTATACGAAGC
TGATAGTAATGACCTAAACGCACAAATTGAA
CATTGGAAATTGACTCGAATGGAATGTGTTTT
GTTTTACAAAGCAAAGGAACTGGGAATAACT
CATATAGGCCACCAGGTGGTGCCACCAATGG
CAGTGTCTAAGGCAAAGGCCTGCCAAGCTAT
TGAACTACAATTGGCATTGGAGGCATTAAAC
AAAACACAATATAGCACAGATGGATGGACAT
TACAACAAACAAGTCTAGAAATGTGGCGTGC
AGAACCACAAAAATACTTTAAAAAACATGGG
TATACAATAACAGTGCAATACGATAATGATA
AAAACAATACTATGGATTATACAAACTGGAA
GGAAATTTATTTACTTGGTGAGTGTGAATGT
ACAATTGTAGAAGGACAAGTAGATTACTATG
GGTTATATTATTGGTGTGATGGAGAAAAAAT
ATATTTTGTAAAATTTAGTAACGATGCAAAGC
AATATTGTGTAACAGGAGTATGGGAAGTACA
TGTGGGTGGTCAGGTAATTGTTTGTCCTGCAT
CTGTATCTAGTAACGAAGTATCCACTACTGAA
ACTGCTGTCCACCTATGCACCGAAACCTCCAA
GACCTCCGCAGTGTCCGTGGGTGCCAAAGAC
ACACACCTACAACCACCACAGAAACGACGAC
GACCAGACGTCACAGACTCCAGAAACACCAA
GTACCCCAACAACCTTTTGCGGGGACAACAA
TCCGTGGACAGTACTACACGGGGACTCGTCA
CTGCAACTGAGTGCACAAACAAAGGACGGGT
TGCACATACAACTTGTACTGCACCTATAATAC
ACCTAAAAGGTGATCCTAATAGTTTAAAATGT
TTAAGATATAGGGTAAAAACACATAAAAGTT
TGTATGTTCAAATTTCATCTACCTGGCATTGG
ACCAGTAATGAATGTACAAATAATAAACTAG
GTATTGTAACAATAACGTACAGTGATGAAAC
ACAACGTCAACAATTTTTAAAAACTGTTAAAA
TACCAAATACTGTGCAAGTTATACAAGGTGTC
ATGTCATTGTGA
TTGTTTGTCCTGCATCTGTATCTAGTAACGAA
GTATCCACTACTGAAACTGCTGTCCACCTATG
CACCGAAACCTCCAAGACCTCCGCAGTGTCC
GTGGGTGCCAAAGACACACACCTACAACCAC
CACAGAAACGACGACGACCAGACGTCACAG
ACTCCAGAAACACCAAGTACCCCAACAACCTT
TTGCGGGGACAACAATCCGTGGACAGTACTA
CACGGGGACTCGTCACTGCAACTGAGTGCAC
AAACAAAGGACGGGTTGCACATACAACTTGT
ACTGCACCTATAA
ATGTTAGGATTATTTGTATTTTGTTTTATTTTG
CTTATGGTGTTTTGTGCAGTGCTTAGGCCGCT
CTTGCTATCTATATCGGTGTATGCGCAGGTGT
TGGTGCTGGTGCTTTTGCTATGGGTATCTATT
GGGTCACCATTTAAAGTGTTTTTTTTGTACCT
ACTGTTTTTATATTTTCCAATGTTTTGTATTCA
CTGTCATGCACAGTATTTGGCACAACTGCAAT
AA
ATGTTTGAGGATCCAGCAACACGACCCCGGA
CCCTGCACGAATTGTGTGAGGTGCTGGAAGA
ATCGGTGCATGAAATAAGGCTGCAGTGTGTG
CAGTGCAAAAAAGAGCTACAACGAAGAGAG
GTATACAAGTTTCTATTTACAGATTTACGAAT
AGTATATAGAGACAATAATCCATATGGCGTG
TGTATTATGTGCCTACGCTTTTTATCTAAGAT
AAGTGAATATAGGCATTATCAATATTCACTGT
ATGGGAAAACATTAGAAGAGAGGGTAAAAA
AACCATTAAGTGAAATAACTATTAGATGTATA
ATTTGTCAAACGCCATTATGTCCTGAAGAAAA
AGAAAGACATGTTAATGCAAACAAGCGATTT
CATAATATTATGGGTCGTTGGACAGGGCGCT
GTTCAGAGTGTTGGAGACCCCGACCTGTGAC
CCAAGTGTAA
ATGGAAAGTATCCCCGCAAGACTGAACGCCG
TGCAGGAAAAAATTCTGGACCTGTATGAAGC
CGACTCAAATGATCTGAACGCCCAGATCGAG
CACTGGAAGCTGACTCGCATGGAATGCGTGC
TGTTCTATAAGGCCAAAGAGCTGGGAATCAC
ACACATTGGCCATCAGGTGGTCCCCCCTATG
GCAGTGAGCAAGGCCAAGGCCTGCCAGGCC
ATTGAGCTGCAGCTGGCACTGGAAGCCCTGA
ACAAAACTCAGTACTCCACCGACGGCTGGAC
ACTGCAGCAGACTTCTCTGGAGATGTGGCGA
GCTGAACCACAGAAGTACTTTAAGAAACACG
GGTATACCATCACAGTGCAGTACGACAACGA
TAAGAACAACACAATGGACTACACAAACTGG
AAGGAAATCTACCTGCTGGGGGAGTGCGAAT
GTACCATTGTGGAAGGGCAGGTGGACTACTA
TGGACTGTACTATTGGTGCGATGGGGAGAAA
ATCTACTTCGTGAAGTTCAGCAACGACGCCA
AGCAGTACTGCGTGACCGGAGTCTGGGAAG
TGCATGTCGGCGGGCAGGTCATCGTCTGTCC
CGCTTCAGTGAGCTCCAATGAGGTCAGCACC
ACAGAAACAGCAGTGCACCTGTGTACTGAGA
CCTCAAAGACTAGCGCTGTGTCCGTCGGCGC
AAAAGATACCCATCTGCAGCCACCCCAGAAG
CGGAGAAGGCCCGACGTGACAGATAGCCGG
AATACTAAATATCCTAACAATCTGCTGAGAG
GCCAGCAGTCTGTGGACAGTACTACCAGGGG
GCTGGTCACAGCCACTGAGTGCACAAACAAG
GGAAGGGTGGCCCACACAACTTGTACTGCTC
CTATCATTCATCTGAAGGGCGATCCAAATAGT
CTGAAATGCCTGCGCTATCGAGTGAAGACCC
ACAAATCACTGTACGTCCAGATCAGCAGCAC
CTGGCATTGGACCAGCAACGAGTGTACCAAC
AATAAGCTGGGAATCGTGACCATTACATACT
CCGACGAAACACAGCGGCAGCAGTTCCTGAA
GACCGTGAAAATCCCTAATACAGTGCAGGTC
ATTCAGGGCGTCATGTCTCTG
ATGTTCGTCCTGCACCTGTACCTGGTCACTAA
ATACCCACTGCTGAAACTGCTGTCAACTTACG
CACCTAAACCTCCTCGCCCACCCCAGTGCCCC
TGGGTGCCTAAGACTCACACCTACAACCACC
ATCGGAATGACGATGACCAGACATCTCAGAC
TCCAGAGACCCCCAGTACACCTACCACATTCT
GTGGCGATAACAATCCCTGGACTGTGCTGCA
TGGGGACAGCTCCCTGCAGCTGTCCGCACAG
ACAAAAGATGGCCTGCACATCCAGCTGGTCC
TGCATCTG
ATGCTGGGACTGTTTGTGTTCTGCTTTATTCT
GCTGATGGTGTTTTGTGCCGTGCTGAGACCC
CTGCTGCTGAGTATTAGCGTGTATGCCCAGG
TGCTGGTCCTGGTGCTGCTGCTGTGGGTCAG
CATCGGCTCCCCCTTTAAGGTGTTCTTTCTGT
ACCTGCTGTTCCTGTATTTTCCTATGTTCTGCA
TTCACTGTCATGCCCAGTACCTGGCTCAGCTG
CAG
ATGTTTGAAGACCCCGCTACAAGACCAAGAA
CCCTGCATGAACTGTGCGAAGTGCTGGAGGA
ATCCGTCCACGAAATCAGACTGCAGTGCGTG
CAGTGTAAGAAAGAGCTGCAGCGGAGAGAA
GTCTACAAGTTCCTGTTTACAGACCTGCGAAT
CGTGTACCGGGATAACAATCCTTATGGAGTC
TGCATCATGTGTCTGAGGTTCCTGAGCAAGA
TTTCCGAGTACCGCCACTACCAGTATTCTCTG
TATGGCAAAACCCTGGAGGAACGGGTGAAG
AAACCCCTGAGTGAGATCACCATTAGATGCA
TCATTTGTCAGACACCACTGTGCCCCGAGGA
AAAGGAACGCCACGTGAACGCCAACAAGCG
ATTTCATAACATTATGGGCAGATGGACTGGG
AGGTGCTCCGAATGTTGGAGGCCCCGCCCTG
T
MESIPARLNAVQEKILDLYEADSNDL
NAQIEHWKLTRMECVLFYKAKELGIT
HIGHQVVPPMAVSKAKACQAIELQL
ALEALNKTQYSTDGWTLQQTSLEM
WRAEPQKYFKKHGYTITVQYDNDK
NNTMDYTNWKEIYLLGECECTIVEG
QVDYYGLYYWCDGEKIYFVKFSNDA
KQYCVTGVWEVHVGGQVIVCPASV
SSNEVSTTETAVHLCTETSKTSAVSV
GAKDTHLQPPQKRRRPDVTDSRNTK
YPNNLLRGQQSVDSTTRGLVTATEC
TNKGRVAHTTCTAPIIHLKGDPNSLK
CLRYRVKTHKSLYVQISSTWHWTSN
ECTNNKLGIVTITYSDETQRQQFLKT
VKIPNTVQVIQGVMSL
HPV52 E4
HPV52 E5
HPV52 E6
Page 42 of 48
% Nucleotide
Change
25.7%
LFVLHLYLVTKYPLLKLLSTYAPKPPRP
PQCPWVPKTHTYNHHRNDDDQTS
QTPETPSTPTTFCGDNNPWTVLHG
DSSLQLSAQTKDGLHIQLVLHL
23.1%
MLGLFVFCFILLMVFCAVLRPLLLSIS
VYAQVLVLVLLLWVSIGSPFKVFFLYL
LFLYFPMFCIHCHAQYLAQLQ
23.0%
MFEDPATRPRTLHELCEVLEESVHEI
RLQCVQCKKELQRREVYKFLFTDLRI
VYRDNNPYGVCIMCLRFLSKISEYRH
YQYSLYGKTLEERVKKPLSEITIRCIICQ
TPLCPEEKERHVNANKRFHNIMGR
WTGRCSECWRPRPVTQV
27.5%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV52 E7
ATGCGTGGAGACAAAGCAACTATAAAAGATT
ATATATTAGATCTGCAACCTGAAACAACTGAC
CTACACTGCTATGAGCAATTAGGTGACAGCT
CAGATGAGGAGGATACAGATGGTGTGGACC
GGCCAGATGGACAAGCAGAACAAGCCACAA
GCAATTACTACATTGTGACATATTGTCACAGT
TGTGATAGCACACTACGGCTATGCATTCATAG
CACTGCGACGGACCTTCGTACTCTACAGCAA
ATGCTGTTGGGCACATTACAAGTTGTGTGCC
CCGGCTGTGCACGGCTATAA
ATGTCCGTGTGGCGGCCTAGTGAGGCCACTG
TGTACCTGCCTCCTGTACCTGTCTCTAAGGTT
GTAAGCACTGATGAGTATGTGTCTCGCACAA
GCATCTATTATTATGCAGGCAGTTCTCGATTA
CTAACAGTAGGACATCCCTATTTTTCTATTAA
AAACACCAGTAGTGGTAATGGTAAAAAAGTT
TTAGTTCCCAAGGTGTCTGGCCTGCAATACA
GGGTATTTAGAATTAAATTGCCGGACCCTAAT
AAATTTGGTTTTCCAGATACATCTTTTTATAAC
CCAGAAACCCAAAGGTTGGTGTGGGCCTGTA
CAGGCTTGGAAATTGGTAGGGGACAGCCTTT
AGGTGTGGGTATTAGTGGGCATCCTTTATTA
AACAAGTTTGATGATACTGAAACCAGTAACA
AATATGCTGGTAAACCTGGTATAGATAATAG
GGAATGTTTATCTATGGATTATAAGCAGACTC
AGTTATGCATTTTAGGATGCAAACCTCCTATA
GGTGAACATTGGGGTAAGGGAACCCCTTGTA
ATAATAATTCAGGAAATCCTGGGGATTGTCCT
CCCCTACAGCTCATTAACAGTGTAATACAGGA
TGGGGACATGGTAGATACAGGATTTGGTTGC
ATGGATTTTAATACCTTGCAAGCTAGTAAAAG
TGATGTGCCCATTGATATATGTAGCAGTGTAT
GTAAGTATCCAGATTATTTGCAAATGGCTAGC
GAGCCATATGGTGACAGTTTGTTCTTTTTTCT
TAGACGTGAGCAAATGTTTGTTAGACACTTTT
TTAATAGGGCCGGTACCTTAGGTGACCCTGT
GCCAGGTGATTTATATATACAAGGGTCTAACT
CTGGCAATACTGCCACTGTACAAAGCAGTGC
TTTTTTTCCTACTCCTAGTGGTTCTATGGTAAC
CTCAGAATCCCAATTATTTAATAAACCGTACT
GGTTACAACGTGCGCAGGGCCACAATAATGG
CATATGTTGGGGCAATCAGTTGTTTGTCACA
GTTGTGGATACCACTCGTAGCACTAACATGA
CTTTATGTGCTGAGGTTAAAAAGGAAAGCAC
ATATAAAAATGAAAATTTTAAGGAATACCTTC
GTCATGGCGAGGAATTTGATTTACAATTTATT
TTTCAATTGTGCAAAATTACATTAACAGCTGA
TGTTATGACATACATTCATAAGATGGATGCCA
CTATTTTAGAGGACTGGCAATTTGGCCTTACC
CCACCACCGTCTGCATCTTTGGAGGACACATA
CAGATTTGTCACTTCTACTGCTATAACTTGTC
AAAAAAACACACCACCTAAAGGAAAGGAAG
ATCCTTTAAAGGACTATATGTTTTGGGAGGT
GGATTTAAAAGAAAAGTTTTCTGCAGATTTA
GATCAGTTTCCTTTAGGTAGGAAGTTTTTGTT
ACAGGCAGGGCTACAGGCTAGGCCCAAACTA
AAACGCCCTGCATCATCGGCCCCACGTACCTC
CACAAAGAAGAAAAAGGTTAAAAGGTAA
ATGAGAGGAGACAAAGCCACCATCAAGGATT
ACATTCTGGACCTGCAGCCTGAGACAACTGA
CCTGCATTGCTATGAACAGCTGGGGGACAGC
TCCGATGAGGAAGACACCGATGGAGTGGAC
AGGCCAGATGGACAGGCAGAGCAGGCTACT
AGCAACTACTATATCGTCACCTACTGCCACTC
TTGTGACAGTACACTGCGGCTGTGCATTCATT
CTACCGCAACAGATCTGAGAACACTGCAGCA
GATGCTGCTGGGAACTCTGCAGGTGGTCTGC
CCTGGCTGTGCCCGGCTG
ATGGTCCAGATCCTGTTTTATATCCTGGTCAT
TTTCTATTATGTCGCCGGGGTCAACGTCTTTC
ACATTTTCCTGCAGATGAGCGTCTGGAGGCC
TAGCGAGGCCACCGTGTATCTGCCCCCTGTG
CCAGTCTCTAAGGTGGTCAGTACAGACGAAT
ACGTGAGCCGGACTTCCATCTACTATTACGCT
GGAAGCTCCAGACTGCTGACAGTGGGCCACC
CCTACTTTTCTATTAAGAATACTTCTAGTGGC
AACGGGAAGAAAGTGCTGGTCCCTAAAGTG
AGTGGACTGCAGTATAGGGTCTTTCGCATCA
AGCTGCCAGACCCCAACAAGTTTGGCTTCCCA
GATACCAGCTTCTACAACCCCGAGACACAGA
GGCTGGTGTGGGCTTGCACCGGCCTGGAAAT
CGGACGAGGACAGCCACTGGGGGTCGGAAT
TAGTGGGCACCCTCTGCTGAATAAGTTCGAC
GATACTGAGACCTCAAACAAGTATGCCGGGA
AACCTGGAATTGACAATCGCGAATGTCTGAG
CATGGACTACAAACAGACCCAGCTGTGCATC
CTGGGCTGTAAGCCACCCATTGGGGAGCATT
GGGGCAAAGGGACACCTTGCAACAATAACA
GCGGCAATCCAGGGGACTGTCCTCCACTGCA
GCTGATCAACTCCGTGATTCAGGACGGCGAT
ATGGTGGACACCGGATTTGGCTGCATGGATT
TCAACACACTGCAGGCTAGCAAGTCCGACGT
GCCCATCGATATTTGCTCAAGCGTCTGTAAAT
ATCCAGACTACCTGCAGATGGCATCAGAGCC
CTATGGCGATAGCCTGTTCTTTTTCCTGCGGA
GAGAACAGATGTTCGTGCGACACTTTTTCAAT
CGAGCAGGAACCCTGGGCGACCCTGTCCCAG
GGGATCTGTACATCCAGGGGTCTAATAGTGG
AAACACAGCTACTGTGCAGTCCTCTGCATTTT
TCCCCACTCCTTCAGGAAGCATGGTCACCTCC
GAGTCTCAGCTGTTTAACAAGCCCTATTGGCT
GCAGCGAGCACAGGGCCATAATAACGGGAT
TTGCTGGGGAAATCAGCTGTTCGTGACTGTG
GTCGATACCACACGCTCCACCAACATGACACT
GTGTGCCGAGGTGAAGAAAGAATCTACATAC
AAGAACGAGAACTTCAAGGAATACCTGAGGC
ACGGCGAGGAGTTCGACCTGCAGTTTATCTT
CCAGCTGTGCAAGATTACCCTGACAGCCGAT
GTGATGACATACATCCATAAAATGGACGCTA
CTATTCTGGAGGATTGGCAGTTTGGCCTGAC
TCCCCCTCCAAGTGCATCACTGGAAGACACCT
ATCGGTTCGTGACTTCTACCGCCATCACTTGT
CAGAAGAATACCCCCCCTAAGGGGAAAGAG
GACCCACTGAAAGATTACATGTTTTGGGAGG
TGGATCTGAAGGAAAAATTCAGCGCCGACCT
GGATCAGTTTCCCCTGGGGAGAAAGTTCCTG
CTGCAGGCAGGACTGCAGGCCAGACCAAAG
CTGAAAAGGCCCGCCAGTTCAGCTCCTCGCA
CAAGCACTAAGAAAAAGAAAGTGAAGCGA
MRGDKATIKDYILDLQPETTDLHCYE
QLGDSSDEEDTDGVDRPDGQAEQA
TSNYYIVTYCHSCDSTLRLCIHSTATD
LRTLQQMLLGTLQVVCPGCARL
HPV52 L1
Page 43 of 48
MSVWRPSEATVYLPPVPVSKVVSTD
EYVSRTSIYYYAGSSRLLTVGHPYFSIK
NTSSGNGKKVLVPKVSGLQYRVFRIK
LPDPNKFGFPDTSFYNPETQRLVWA
CTGLEIGRGQPLGVGISGHPLLNKFD
DTETSNKYAGKPGIDNRECLSMDYK
QTQLCILGCKPPIGEHWGKGTPCNN
NSGNPGDCPPLQLINSVIQDGDMV
DTGFGCMDFNTLQASKSDVPIDICSS
VCKYPDYLQMASEPYGDSLFFFLRRE
QMFVRHFFNRAGTLGDPVPGDLYIQ
GSNSGNTATVQSSAFFPTPSGSMVT
SESQLFNKPYWLQRAQGHNNGICW
GNQLFVTVVDTTRSTNMTLCAEVKK
ESTYKNENFKEYLRHGEEFDLQFIFQL
CKITLTADVMTYIHKMDATILEDWQ
FGLTPPPSASLEDTYRFVTSTAITCQK
NTPPKGKEDPLKDYMFWEVDLKEKF
SADLDQFPLGRKFLLQAGLQARPKLK
RPASSAPRTSTKKKKVKR
% Nucleotide
Change
23.7%
31.1%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV52 L2
ATGAGATACAGACGGTCTACACGGCACAAAC
GTGCTTCTGCAACACAGCTATATCAAACATGC
AAAGCCTCTGGCACCTGCCCCCCCGATGTTAT
TCCTAAAGTGGAAGGCACAACTATTGCAGAT
CAACTTTTAAAATATGGCAGCCTAGGGGTGT
TTTTTGGAGGTTTGGGTATAGGTACAGGTGC
AGGCTCTGGTGGTAGGGCAGGCTATGTGCCA
TTGTCCACTCGTCCTCCCACTAGTAGTATTAC
CACGTCCACCATTCGTCCCCCTGTAACTGTAG
AACCCATTGGTCCCTTAGAACCATCTATAGTT
TCTATGATAGAAGAAACAACATTTATTGAGTC
TGGCGCACCTGCTCCATCTATTCCATCAGCAA
CAGGGTTTGATGTTACAACATCTGCAAATAAT
ACTCCTGCAATAATTAATGTAACATCTATAGG
TGAATCATCTGTACAATCAGTTTCTACACATT
TAAATCCTACATTCACTGAACCATCTATAATA
CAGCCCCCGGCACCTGCAGAAGCATCTGGTC
ATGTATTGTTTTCTAGTCCAACTATTAGTACA
CACACCTATGAAGAAATCCCTATGGATACATT
TGTTACCTCTACTGACAGCAGCAGTGTAACAA
GTAGTACACCTATTCCAGGGTCTCGCCCTACG
ACACGCCTTGGTTTATATAGCCGTGCCACACA
ACAGGTTAAGGTAGTCGACCCTGCTTTTATGT
CATCACCACAGAAATTAGTAACATATAACAAT
CCTGTTTTTGAGGGCGTTGATACAGATGAAA
CTATAATTTTTGATCGTTCACAACTTTTACCTG
CACCGGATCCTGATTTTTTAGACATTATAGCT
TTGCATAGGCCTGCATTAACCTCTCGAAGAG
GTACTGTTAGGTTTAGCAGGCTTGGTAATAA
GGCCACCCTACGTACACGTAGTGGAAAACAA
ATTGGGGCACGGGTACATTATTATCATGATA
TTAGTCCTATCCAGCCTGCTGAAGTTCAGGAA
GACATAGAATTGCAACCTTTATTACCACAGTC
TGTGTCCCCTTACACTATTAATGATGGTTTGT
ATGATGTGTATGCAGATTCTTTGCAGCAACCC
ACGTTTCACTTACCTTCCACACTTTCTACCCAT
AATAATACTTTCACTGTACCTATTAATAGTGG
TATTGACTTTGTATATCAACCCACTATGTCCAT
TGAGTCAGGTCCTGACATTCCATTACCTTCGT
TACCCACACATACTCCTTTTGTTCCTATAGCCC
CTACAGCTCCATCTACATCTATTATTGTTGAT
GGTACAGATTTTATTTTACATCCTAGTTATTTT
TTACTACGTCGCAGGCGTAAACGTTTTCCATA
TTTTTTTACAGATGTCCGTGTGGCGGCCTAG
ATGAGGTATCGGCGAAGCACTCGGCATAAAA
GGGCATCCGCAACCCAGCTGTATCAGACTTG
TAAGGCATCCGGCACTTGTCCCCCAGATGTG
ATCCCCAAGGTCGAAGGCACCACAATTGCTG
ACCAGCTGCTGAAATACGGAAGCCTGGGCGT
GTTCTTTGGAGGACTGGGAATCGGAACCGGA
GCTGGGTCAGGAGGCAGGGCAGGATATGTG
CCTCTGAGCACACGCCCCCCTACTAGCTCCAT
CACTACCTCTACAATTAGGCCACCCGTGACTG
TCGAACCCATCGGCCCTCTGGAGCCATCAATC
GTGAGCATGATTGAGGAAACAACTTTCATCG
AAAGCGGAGCACCAGCACCTTCCATTCCATCT
GCCACCGGATTTGATGTGACCACATCCGCCA
ACAATACCCCTGCTATCATTAACGTCACATCT
ATCGGGGAGTCTAGTGTGCAGAGTGTCTCAA
CCCACCTGAATCCAACATTCACTGAACCCAGT
ATCATTCAGCCTCCAGCTCCCGCAGAGGCCTC
AGGACACGTGCTGTTCTCAAGCCCAACTATCA
GCACCCATACATACGAGGAAATTCCTATGGA
CACCTTTGTGACTAGCACCGACTCCTCTAGTG
TCACTTCAAGCACCCCAATCCCAGGCTCCCGG
CCAACTACCAGACTGGGGCTGTACTCTAGGG
CCACCCAGCAGGTGAAGGTGGTGGACCCCGC
TTTTATGTCCTCTCCTCAGAAACTGGTGACAT
ATAACAATCCCGTGTTCGAAGGCGTGGACAC
AGATGAGACTATCATTTTTGATCGGTCCCAGC
TGCTGCCTGCACCAGACCCCGATTTCCTGGAC
ATCATTGCACTGCATAGACCCGCCCTGACTTC
TCGGAGAGGCACCGTGCGGTTCAGCCGGCT
GGGAAACAAGGCAACCCTGAGAACAAGGAG
TGGGAAACAGATCGGAGCCCGCGTGCACTAC
TATCATGATATCAGCCCCATTCAGCCTGCTGA
GGTGCAGGAAGACATCGAGCTGCAGCCACT
GCTGCCACAGAGCGTGTCCCCTTACACAATTA
ACGACGGCCTGTACGATGTCTATGCAGACTC
TCTGCAGCAGCCTACTTTCCACCTGCCAAGTA
CTCTGTCAACCCATAACAATACATTCACTGTG
CCAATCAATAGCGGCATTGATTTTGTCTATCA
GCCAACCATGAGCATCGAGTCCGGGCCCGAC
ATTCCTCTGCCATCCCTGCCTACCCACACACCT
TTCGTGCCAATCGCTCCCACCGCACCTTCTAC
AAGTATCATTGTGGACGGGACCGATTTCATT
CTGCATCCTAGCTACTTTCTGCTGAGGCGCCG
ACGGAAGCGATTTCCATATTTCTTTACAGATG
TGCGCGTCGCCGCT
MRYRRSTRHKRASATQLYQTCKASG
TCPPDVIPKVEGTTIADQLLKYGSLGV
FFGGLGIGTGAGSGGRAGYVPLSTR
PPTSSITTSTIRPPVTVEPIGPLEPSIVS
MIEETTFIESGAPAPSIPSATGFDVTT
SANNTPAIINVTSIGESSVQSVSTHLN
PTFTEPSIIQPPAPAEASGHVLFSSPTI
STHTYEEIPMDTFVTSTDSSSVTSSTP
IPGSRPTTRLGLYSRATQQVKVVDPA
FMSSPQKLVTYNNPVFEGVDTDETII
FDRSQLLPAPDPDFLDIIALHRPALTS
RRGTVRFSRLGNKATLRTRSGKQIGA
RVHYYHDISPIQPAEVQEDIELQPLLP
QSVSPYTINDGLYDVYADSLQQPTFH
LPSTLSTHNNTFTVPINSGIDFVYQPT
MSIESGPDIPLPSLPTHTPFVPIAPTA
PSTSIIVDGTDFILHPSYFLLRRRRKRF
PYFFTDVRVAA
Page 44 of 48
% Nucleotide
Change
27.8%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV58 E1
ATGGATGACCCTGAAGGTACAAACGGGGTA
GGGGCGGGCTGTACTGGCTGGTTTGAGGTA
GAAGCGGTAATAGAACGAAGAACAGGAGAT
AATATTTCAGATGATGAGGACGAAACAGCAG
ACGATAGTGGTACAGATTTAATAGAGTTTAT
AGATGATTCAGTACAAAGTACTACACAGGCA
GAAGCAGAGGCAGCCCGAGCGTTGTTTAATG
TACAGGAAGGGGTGGACGATATAAATGCTGT
GTGTGCACTAAAACGAAAGTTTGCAGCATGC
TCAGAAAGTGCTGTAGAGGACTGTGTGGACC
GGGCTGCAAATGTGTGTGTATCGTGGAAATA
TAAAAATAAAGAATGCACACACAGAAAACGA
AAAATTATTGAGCTAGAAGACAGCGGATATG
GCAATACTGAAGTGGAAACTGAGCAGATGG
CACACCAGGTAGAAAGCCAAAATGGCGACG
CAGACTTAAATGACTCGGAGTCTAGTGGGGT
GGGGGCTAGTTCAGATGTAAGCAGTGAAAC
GGATGTAGACAGTTGTAATACTGTTCCATTAC
AAAATATTAGTAATATTCTACATAACAGTAAT
ACTAAAGCAACGCTATTATATAAATTCAAAGA
AGCTTATGGAGTAAGTTTTATGGAATTAGTTA
GACCATTTAAAAGTGATAAAACAAGCTGTAC
AGATTGGTGTATAACAGGGTATGGAATAAGT
CCCTCCGTAGCAGAAAGTTTAAAAGTACTAAT
TAAACAGCACAGTATATATACACACCTACAAT
GTTTAACGTGTGACAGAGGAATTATATTATTA
TTGTTAATTAGATTTAAATGTAGCAAAAATAG
ATTAACTGTGGCAAAATTAATGAGTAATTTAC
TATCAATTCCTGAAACATGTATGATTATCGAG
CCACCAAAATTACGAAGTCAAGCATGTGCCTT
ATATTGGTTTAGAACAGCAATGTCAAATATAA
GTGATGTGCAAGGGACAACACCAGAATGGA
TAGATAGATTAACAGTGTTACAGCATAGCTTT
AATGATGATATATTTGATTTAAGTGAAATGAT
ACAATGGGCATATGATAATGACATTACAGAT
GATAGTGACATTGCATATAAATATGCACAGTT
AGCAGATGTTAATAGTAATGCAGCAGCATTT
TTAAGAAGCAATGCACAAGCAAAAATAGTAA
AAGACTGTGGCGTTATGTGCAGACATTATAA
AAGAGCAGAAAAGCGTGGTATGACAATGGG
ACAATGGATACAAAGTAGGTGTGAAAAAACA
AATGATGGAGGTAATTGGAGACCAATAGTAC
AATTTTTAAGATATCAAAATATTGAATTTACA
GCATTTTTAGTTGCATTTAAACAGTTTTTACA
AGGTGTACCAAAAAAAAGTTGTATGTTACTG
TGTGGCCCAGCAAATACAGGGAAATCATATT
TTGGAATGAGTTTAATACATTTTTTAAAAGGA
TGCATTATTTCATATGTAAATTCCAAAAGTCA
TTTTTGGTTGCAGCCATTATCAGATGCTAAAC
TAGGTATGATAGATGATGTAACAGCCATAAG
CTGGACATATATAGATGATTATATGAGAAAT
GCATTAGATGGTAACGACATTTCAATAGATG
TAAAACATAGGGCATTAGTACAATTAAAATG
TCCACCATTAATAATTACCTCAAATACAAATG
CAGGCAAAGATTCACGATGGCCATATTTGCA
CAGTAGACTAACAGTATTTGAATTTAACAATC
CATTTCCATTTGATGCAAATGGTAATCCAGTG
TATAAAATAAATGATGAAAATTGGAAATCCTT
TTTCTCAAGGACGTGGTGCAAATTAGGCTTA
ATAGAGGAAGAGGACAAGGAAAACGATGGA
GGAAATATCAGCACGTTTAAGTGCAGTGCAG
GACAAAATCCTAGACATATACGAAGCTGA
ATGGACGATCCTGAGGGAACTAACGGGGTG
GGGGCTGGCTGCACTGGCTGGTTTGAGGTG
GAGGCTGTCATTGAAAGAAGAACTGGCGAC
AACATCTCCGACGATGAAGATGAGACTGCTG
ACGATTCTGGGACCGACCTGATCGAGTTCAT
TGACGATTCTGTGCAGAGTACCACACAGGCA
GAAGCTGAGGCAGCTCGCGCCCTGTTCAACG
TGCAGGAAGGAGTGGACGATATCAATGCCGT
GTGTGCTCTGAAGAGGAAATTTGCAGCCTGC
TCCGAGTCTGCTGTGGAAGACTGTGTCGATC
GCGCTGCAAACGTGTGCGTCTCCTGGAAGTA
CAAAAATAAGGAGTGCACCCACCGGAAAAG
AAAGATCATTGAACTGGAGGATTCTGGGTAT
GGAAACACAGAAGTGGAGACTGAACAGATG
GCACATCAGGTCGAGAGTCAGAACGGCGAC
GCCGATCTGAATGACTCAGAAAGCTCCGGCG
TGGGGGCCTCTAGTGATGTCTCAAGCGAGAC
CGACGTGGATTCCTGTAATACAGTCCCTCTGC
AGAACATCTCCAATATTCTGCACAACTCTAAT
ACAAAGGCCACTCTGCTGTACAAATTCAAGG
AGGCTTATGGCGTGTCTTTCATGGAACTGGT
CAGACCCTTCAAGAGTGACAAGACCTCATGC
ACAGATTGGTGTATCACAGGATACGGCATTA
GTCCCTCAGTGGCCGAGTCTCTGAAAGTCCT
GATCAAGCAGCACAGTATCTACACACATCTG
CAGTGCCTGACTTGTGACAGGGGGATCATTC
TGCTGCTGCTGATCAGGTTCAAATGCAGCAA
GAACCGCCTGACAGTGGCCAAACTGATGAGC
AATCTGCTGTCCATTCCCGAGACTTGTATGAT
CATTGAACCACCTAAGCTGCGCAGCCAGGCA
TGCGCACTGTACTGGTTTCGAACCGCCATGTC
AAACATCAGCGACGTGCAGGGCACTACCCCT
GAGTGGATTGATCGGCTGACAGTCCTGCAGC
ACTCATTCAACGACGATATCTTTGACCTGAGC
GAAATGATTCAGTGGGCCTACGACAATGATA
TCACCGACGATAGCGACATTGCTTACAAATAT
GCACAGCTGGCCGATGTGAACAGCAATGCCG
CTGCATTCCTGCGATCCAACGCTCAGGCAAA
AATCGTGAAGGACTGCGGCGTCATGTGCCGG
CACTACAAACGGGCCGAGAAGAGAGGGATG
ACTATGGGACAGTGGATTCAGAGCCGGTGC
GAAAAGACCAACGATGGCGGGAATTGGCGA
CCAATCGTGCAGTTTCTGCGGTATCAGAATAT
TGAGTTCACAGCTTTTCTGGTGGCATTCAAAC
AGTTTCTGCAGGGCGTCCCCAAGAAATCCTG
CATGCTGCTGTGTGGCCCTGCCAACACTGGG
AAGTCTTACTTCGGAATGAGTCTGATCCACTT
TCTGAAAGGATGTATCATTAGCTATGTGAATA
GCAAGTCCCATTTCTGGCTGCAGCCCCTGTCC
GACGCTAAGCTGGGCATGATCGACGATGTGA
CCGCAATCTCTTGGACATACATTGACGATTAT
ATGCGGAACGCACTGGACGGGAATGATATCA
GTATTGACGTGAAACACAGAGCCCTGGTCCA
GCTGAAGTGCCCACCCCTGATCATTACTAGCA
ACACCAATGCTGGAAAGGATAGTAGATGGCC
TTACCTGCATTCAAGGCTGACCGTGTTCGAGT
TTAACAATCCTTTCCCATTTGACGCAAACGGC
AACCCAGTGTACAAAATCAACGATGAAAACT
GGAAGAGCTTCTTCAGCCGGACTTGGTGTAA
ACTGGGCCTGATCGAGGAAGAGGACAAGGA
GAACGATGGAGGCAATATTTCAACCTTTAAG
TGCAGCGCAGGACAGAACCCAAGGCACATCC
GCAGC
MDDPEGTNGVGAGCTGWFEVEAVI
ERRTGDNISDDEDETADDSGTDLIEFI
DDSVQSTTQAEAEAARALFNVQEG
VDDINAVCALKRKFAACSESAVEDCV
DRAANVCVSWKYKNKECTHRKRKII
ELEDSGYGNTEVETEQMAHQVESQ
NGDADLNDSESSGVGASSDVSSETD
VDSCNTVPLQNISNILHNSNTKATLL
YKFKEAYGVSFMELVRPFKSDKTSCT
DWCITGYGISPSVAESLKVLIKQHSIY
THLQCLTCDRGIILLLLIRFKCSKNRLT
VAKLMSNLLSIPETCMIIEPPKLRSQA
CALYWFRTAMSNISDVQGTTPEWID
RLTVLQHSFNDDIFDLSEMIQWAYD
NDITDDSDIAYKYAQLADVNSNAAA
FLRSNAQAKIVKDCGVMCRHYKRAE
KRGMTMGQWIQSRCEKTNDGGN
WRPIVQFLRYQNIEFTAFLVAFKQFL
QGVPKKSCMLLCGPANTGKSYFGM
SLIHFLKGCIISYVNSKSHFWLQPLSD
AKLGMIDDVTAISWTYIDDYMRNAL
DGNDISIDVKHRALVQLKCPPLIITSN
TNAGKDSRWPYLHSRLTVFEFNNPF
PFDANGNPVYKINDENWKSFFSRT
WCKLGLIEEEDKENDGGNISTFKCSA
GQNPRHIRS
Page 45 of 48
% Nucleotide
Change
26.5%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV58 E2
ATGGAGGAAATATCAGCACGTTTAAGTGCAG
TGCAGGACAAAATCCTAGACATATACGAAGC
TGATAAAAATGATTTAACATCACAAATTGAAC
ATTGGAAACTAATACGCATGGAGTGTGCTAT
AATGTATACAGCCAGACAAATGGGAATATCA
CATTTGTGCCACCAGGTGGTGCCGTCATTGG
TAGCATCAAAGACTAAAGCGTTTCAAGTAATT
GAACTGCAAATGGCATTAGAGACATTAAATG
CATCACCATATAAAACAGATGAATGGACATT
GCAACAAACAAGCTTAGAAGTGTGGTTATCA
GAGCCACAAAAATGCTTTAAAAAAAAAGGCA
TAACAGTAACTGTACAATATGACAATGATAA
AGCAAACACAATGGATTATACAAATTGGAGT
GAAATATATATTATTGAGGAAACAACATGTA
CTTTGGTAGCAGGAGAAGTTGACTATGTGGG
GTTGTATTATATACATGGCAATGAAAAGACG
TATTTTAAATATTTTAAAGAGGATGCAAAAAA
GTACTCTAAAACACAATTATGGGAGGTACAT
GTGGGTAGTCGGGTAATTGTATGTCCTACAT
CTATACCTAGTGATCAAATATCCACTACTGAA
ACTGCTGACCCAAAGACCACCGAGGCCACCA
ACAACGAAAGTACACAGGGGACAAAGCGAC
GACGACTCGATTTACCAGACTCCAGAGACAA
CACCCAGTACTCCACAAAGTATACAGACTGC
GCCGTGGACAGTAGACCACGAGGAGGAGGA
CTACACAGTACAACTAACTGTACATACAAAG
GGCGGAACGTGTGTAGTTCTAAAGTTTCACC
TATCGTGCATTTAAAAGGTGACCCAAATAGTT
TAAAATGTTTAAGATATAGATTAAAACCATTT
AAAGACTTATACTGTAATATGTCATCCACATG
GCATTGGACCAGTGATGACAAAGGTGACAAA
GTAGGAATTGTTACTGTAACATACACAACGG
AAACACAACGACAACTGTTTTTAAACACTGTT
AAAATACCACCCACTGTGCAAATAAGTACTG
GTGTTATGTCATTGTAA
TTGTATGTCCTACATCTATACCTAGTGATCAA
ATATCCACTACTGAAACTGCTGACCCAAAGAC
CACCGAGGCCACCAACAACGAAAGTACACAG
GGGACAAAGCGACGACGACTCGATTTACCAG
ACTCCAGAGACAACACCCAGTACTCCACAAA
GTATACAGACTGCGCCGTGGACAGTAGACCA
CGAGGAGGAGGACTACACAGTACAACTAACT
GTACATACAAAGGGCGGAACGTGTGTAGTTC
TAAAGTTTCACCTATCGTGCATTTAA
ATGATATTACCTATTTTTGTTGTTTGTTTTATA
CTGTTTTTATGCTTGTGCATTTTTTTGCGGCCA
TTGGTGCTATCTATTTCTATATATGCTTGGTTG
CTGGTGTTGGTGTTGCTGCTTTGGGTGTCTGT
GGGGTCGGCTCTACGAATTTTTTTCTGTTACT
TAATATTTTTATATATACCAATGATGTGTATTA
ATTTTCATGCACAATACTTAACCCAACAAGAC
TAA
ATGTTCCAGGACGCAGAGGAGAAACCACGG
ACATTGCATGATTTGTGTCAGGCGTTGGAGA
CATCTGTGCATGAAATCGAATTGAAATGCGTT
GAATGCAAAAAGACTTTGCAGCGATCTGAGG
TATATGACTTTGTATTTGCAGATTTAAGAATA
GTGTATAGAGATGGAAATCCATTTGCAGTAT
GTAAAGTGTGCTTACGATTGCTATCTAAAATA
AGTGAGTATAGACATTATAATTATTCGCTATA
TGGAGACACATTAGAACAAACACTAAAAAAG
TGTTTAAATGAAATATTAATTAGATGTATTAT
TTGTCAAAGACCATTGTGTCCACAAGAAAAA
AAAAGGCATGTGGATTTAAACAAAAGGTTTC
ATAATATTTCGGGTCGTTGGACAGGGCGCTG
TGCAGTGTGTTGGAGACCCCGACGTAGACAA
ACACAAGTGTAA
ATGGAGGAAATCTCCGCAAGACTGAGTGCCG
TGCAGGACAAAATTCTGGACATCTATGAAGC
CGACAAAAATGACCTGACATCACAGATCGAG
CACTGGAAACTGATTAGGATGGAATGCGCCA
TCATGTACACCGCTCGCCAGATGGGCATTTCT
CACCTGTGTCATCAGGTGGTCCCCTCCCTGGT
CGCATCTAAAACAAAGGCCTTCCAGGTCATC
GAGCTGCAGATGGCTCTGGAAACTCTGAACG
CAAGTCCCTACAAGACCGATGAGTGGACACT
GCAGCAGACTAGCCTGGAGGTCTGGCTGTCC
GAACCTCAGAAATGCTTTAAGAAAAAGGGGA
TTACAGTGACTGTCCAGTATGACAACGATAA
GGCAAATACAATGGACTACACAAACTGGAGC
GAAATCTACATCATTGAGGAAACCACATGTA
CCCTGGTGGCCGGAGAAGTGGATTACGTCG
GCCTGTACTATATTCACGGGAACGAGAAGAC
ATACTTCAAGTACTTCAAGGAAGACGCTAAA
AAGTACTCCAAGACCCAGCTGTGGGAGGTGC
ATGTCGGCAGCAGAGTGATCGTCTGCCCAAC
CTCAATCCCCAGCGATCAGATTAGCACTACCG
AAACTGCCGACCCTAAAACAACTGAGGCTAC
CAACAATGAATCCACTCAGGGAACCAAGCGG
AGAAGGCTGGACCTGCCAGACAGCCGGGAC
AACACACAGTACAGTACAAAGTATACTGATT
GTGCAGTGGACTCACGCCCCCGAGGCGGAG
GACTGCACAGCACCACAAACTGCACCTACAA
AGGAAGGAATGTGTGTAGCTCCAAGGTGAG
TCCTATTGTCCATCTGAAAGGCGATCCAAACT
CACTGAAGTGCCTGCGGTACAGACTGAAACC
ATTCAAGGACCTGTATTGTAATATGTCTAGTA
CTTGGCATTGGACCTCCGACGATAAAGGCGA
TAAAGTGGGGATCGTGACCGTCACATATACT
ACCGAGACCCAGCGGCAGCTGTTTCTGAATA
CAGTGAAGATTCCCCCTACAGTCCAGATCAGT
ACTGGCGTGATGTCACTG
ATGTATGTCCTGCACCTGTACCTGGTCATCAA
GTATCCACTGCTGAAACTGCTGACCCAGAGA
CCCCCACGCCCTCCAACAACTAAAGTGCATCG
GGGCCAGAGCGACGATGACTCCATCTACCAG
ACCCCAGAAACCACACCCTCTACACCTCAGAG
TATTCAGACAGCCCCCTGGACTGTCGATCACG
AGGAAGAGGACTATACTGTGCAGCTGACTGT
CCATACCAAGGGCGGGACATGCGTGGTCCTG
AAATTCCACCTGTCCTGTATC
ATGATTCTGCCCATTTTTGTGGTGTGCTTTATT
CTGTTCCTGTGCCTGTGTATCTTTCTGCGGCC
CCTGGTCCTGTCTATTTCTATTTACGCCTGGCT
GCTGGTGCTGGTCCTGCTGCTGTGGGTGAGC
GTCGGCTCCGCTCTGCGGATCTTCTTTTGCTA
CCTGATCTTCCTGTATATTCCCATGATGTGTAT
TAACTTTCACGCCCAGTATCTGACCCAGCAGG
AC
ATGTTCCAGGATGCCGAAGAAAAACCCCGAA
CTCTGCACGATCTGTGTCAGGCTCTGGAGAC
CTCTGTCCATGAGATTGAACTGAAATGCGTG
GAGTGTAAGAAAACACTGCAGCGGAGCGAA
GTGTACGACTTCGTCTTTGCCGATCTGCGCAT
CGTCTATCGAGACGGAAACCCATTCGCTGTG
TGCAAGGTCTGTCTGCGCCTGCTGAGCAAAA
TTTCCGAGTACCGGCACTACAACTATAGTCTG
TATGGCGATACCCTGGAGCAGACACTGAAGA
AATGCCTGAATGAAATCCTGATTAGGTGCAT
CATTTGTCAGCGCCCCCTGTGTCCTCAGGAAA
AGAAACGACACGTGGACCTGAACAAGAGGT
TTCATAATATCTCCGGCCGGTGGACTGGAAG
ATGCGCAGTGTGTTGGAGGCCCCGGAGAAG
GCAGACCCAGGTC
MEEISARLSAVQDKILDIYEADKNDL
TSQIEHWKLIRMECAIMYTARQMGI
SHLCHQVVPSLVASKTKAFQVIELQ
MALETLNASPYKTDEWTLQQTSLEV
WLSEPQKCFKKKGITVTVQYDNDKA
NTMDYTNWSEIYIIEETTCTLVAGEV
DYVGLYYIHGNEKTYFKYFKEDAKKY
SKTQLWEVHVGSRVIVCPTSIPSDQI
STTETADPKTTEATNNESTQGTKRRR
LDLPDSRDNTQYSTKYTDCAVDSRP
RGGGLHSTTNCTYKGRNVCSSKVSPI
VHLKGDPNSLKCLRYRLKPFKDLYCN
MSSTWHWTSDDKGDKVGIVTVTYT
TETQRQLFLNTVKIPPTVQISTGVMS
L
HPV58 E4
HPV58 E5
HPV58 E6
Page 46 of 48
% Nucleotide
Change
26.6%
LYVLHLYLVIKYPLLKLLTQRPPRPPTT
KVHRGQSDDDSIYQTPETTPSTPQSI
QTAPWTVDHEEEDYTVQLTVHTKG
GTCVVLKFHLSCI
22.1%
MILPIFVVCFILFLCLCIFLRPLVLSISIY
AWLLVLVLLLWVSVGSALRIFFCYLIF
LYIPMMCINFHAQYLTQQD
25.5%
MFQDAEEKPRTLHDLCQALETSVHEI
ELKCVECKKTLQRSEVYDFVFADLRIV
YRDGNPFAVCKVCLRLLSKISEYRHY
NYSLYGDTLEQTLKKCLNEILIRCIICQ
RPLCPQEKKRHVDLNKRFHNISGRW
TGRCAVCWRPRRRQTQV
27.7%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV58 E7
ATGAGAGGAAACAACCCAACGCTAAGAGAA
TATATTTTAGATTTACATCCTGAACCAACTGA
CCTATTCTGCTATGAGCAATTATGTGACAGCT
CAGACGAGGATGAAATAGGCTTGGACGGGC
CAGATGGACAAGCACAACCGGCCACAGCTAA
TTACTACATTGTAACTTGTTGTTACACTTGTG
GCACCACGGTTCGTTTGTGTATCAACAGTACA
ACAACCGACGTACGAACCCTACAGCAGCTGC
TTATGGGCACATGTACCATTGTGTGCCCTAGC
TGTGCACAGCAATAA
ATGTCCGTGTGGCGGCCTAGTGAGGCCACTG
TGTACCTGCCTCCTGTGCCTGTGTCTAAGGTT
GTAAGCACTGATGAATATGTGTCACGCACAA
GCATTTATTATTATGCTGGCAGTTCCAGACTT
TTGGCTGTTGGCAATCCATATTTTTCCATCAA
AAGTCCCAATAACAATAAAAAAGTATTAGTTC
CCAAGGTATCAGGCTTACAGTATAGGGTCTT
TAGGGTGCGTTTACCTGATCCCAATAAATTTG
GTTTTCCTGATACATCTTTTTATAACCCTGATA
CACAACGTTTGGTCTGGGCATGTGTAGGCCT
TGAAATAGGTAGGGGACAGCCATTGGGTGTT
GGCGTAAGTGGTCATCCTTATTTAAATAAATT
TGATGACACTGAAACCAGTAACAGATATCCC
GCACAGCCAGGGTCTGATAACAGGGAATGCT
TATCTATGGATTATAAACAAACACAATTATGT
TTAATTGGCTGTAAACCTCCCACTGGTGAGCA
TTGGGGTAAAGGTGTTGCCTGTAACAATAAT
GCAGCTGCTACTGATTGTCCTCCATTGGAACT
TTTTAATTCTATTATTGAGGATGGTGACATGG
TAGATACAGGGTTTGGATGCATGGACTTTGG
TACATTGCAGGCTAATAAAAGTGATGTGCCT
ATTGATATTTGTAACAGTACATGCAAATATCC
AGATTATTTAAAAATGGCCAGTGAACCTTATG
GGGATAGTTTGTTCTTTTTTCTTAGACGTGAG
CAGATGTTTGTTAGACACTTTTTTAATAGGGC
TGGAAAACTTGGCGAGGCTGTCCCGGATGAC
CTTTATATTAAAGGGTCCGGTAATACTGCAGT
TATCCAAAGTAGTGCATTTTTTCCAACTCCTA
GTGGCTCTATAGTTACCTCAGAATCACAATTA
TTTAATAAGCCTTATTGGCTACAGCGTGCACA
AGGTCATAACAATGGCATTTGCTGGGGCAAT
CAGTTATTTGTTACCGTGGTTGATACCACTCG
TAGCACTAATATGACATTATGCACTGAAGTAA
CTAAGGAAGGTACATATAAAAATGATAATTT
TAAGGAATATGTACGTCATGTTGAAGAATAT
GACTTACAGTTTGTTTTTCAGCTTTGCAAAAT
TACACTAACTGCAGAGATAATGACATATATAC
ATACTATGGATTCCAATATTTTGGAGGACTG
GCAATTTGGTTTAACACCTCCTCCGTCTGCCA
GTTTACAGGACACATATAGATTTGTTACCTCC
CAGGCTATTACTTGCCAAAAAACAGCACCCCC
TAAAGAAAAGGAAGATCCATTAAATAAATAT
ACTTTTTGGGAGGTTAACTTAAAGGAAAAGT
TTTCTGCAGATCTAGATCAGTTTCCTTTGGGA
CGAAAGTTTTTATTACAATCAGGCCTTAAAGC
AAAGCCCAGACTAAAACGTTCGGCCCCTACT
ACCCGTGCACCATCCACCAAACGCAAAAAGG
TTAAAAAATAA
ATGCGGGGAAATAATCCTACCCTGAGAGAGT
ACATCCTGGACCTGCACCCTGAGCCCACCGA
CCTGTTCTGTTACGAGCAGCTGTGTGACAGCT
CCGACGAGGATGAAATCGGCCTGGACGGAC
CAGATGGACAGGCACAGCCTGCAACCGCTAA
CTACTATATCGTGACATGCTGTTACACTTGCG
GCACCACAGTCCGGCTGTGTATTAATTCTACT
ACCACAGATGTGAGAACACTGCAGCAGCTGC
TGATGGGGACTTGCACCATTGTCTGCCCCAG
CTGTGCCCAGCAG
ATGGTGCTGATCCTGTGCTGTACCCTGGCTAT
CCTGTTTTGTGTCGCCGATGTCAATGTGTTTC
ATATTTTCCTGCAGATGTCCGTGTGGAGGCCC
TCTGAGGCCACCGTCTATCTGCCCCCTGTGCC
TGTCTCAAAAGTGGTCAGCACCGACGAATAC
GTGAGCAGGACATCCATCTACTATTACGCAG
GAAGCTCCCGCCTGCTGGCTGTCGGCAACCC
TTATTTCAGCATCAAGAGTCCAAACAATAACA
AGAAAGTGCTGGTCCCCAAGGTGAGTGGGC
TGCAGTATCGGGTGTTTAGGGTCCGCCTGCC
AGATCCCAACAAGTTTGGATTCCCCGACACTT
CCTTCTACAATCCTGATACCCAGCGACTGGTG
TGGGCTTGCGTCGGACTGGAGATCGGACGA
GGACAGCCACTGGGAGTGGGCGTCTCAGGA
CACCCCTATCTGAACAAATTTGACGATACAGA
GACTAGCAATAGATACCCCGCACAGCCTGGC
AGTGACAACAGAGAATGTCTGTCAATGGATT
ACAAGCAGACTCAGCTGTGCCTGATTGGCTG
TAAACCACCCACCGGGGAGCATTGGGGGAA
GGGAGTGGCTTGCAATAACAATGCCGCTGCA
ACTGACTGTCCTCCACTGGAGCTGTTCAATAG
CATCATTGAAGACGGCGATATGGTGGACACC
GGCTTTGGGTGCATGGATTTCGGGACACTGC
AGGCCAACAAGTCTGACGTGCCTATCGATAT
TTGCAACAGTACCTGTAAGTACCCTGACTACC
TGAAGATGGCTTCCGAGCCCTACGGCGACTC
TCTGTTCTTTTTCCTGCGGAGAGAACAGATGT
TTGTGAGACACTTTTTCAACAGGGCAGGGAA
ACTGGGAGAGGCCGTCCCTGACGATCTGTAC
ATCAAGGGAAGCGGCAATACCGCTGTGATTC
AGTCTAGTGCATTTTTCCCTACACCATCAGGC
AGCATCGTGACTTCCGAATCTCAGCTGTTTAA
CAAGCCATACTGGCTGCAGCGAGCACAGGG
ACATAACAATGGGATTTGCTGGGGAAACCAG
CTGTTCGTGACAGTGGTGGACACCACAAGAT
CCACTAATATGACCCTGTGTACAGAGGTCACT
AAGGAAGGCACTTACAAGAACGACAACTTCA
AGGAGTACGTGAGACACGTCGAGGAATACG
ATCTGCAGTTTGTGTTCCAGCTGTGCAAGATC
ACCCTGACAGCAGAGATCATGACCTACATTC
ATACAATGGACTCTAATATTCTGGAAGATTG
GCAGTTTGGGCTGACCCCCCCTCCAAGTGCCT
CACTGCAGGACACATATAGGTTCGTGACTAG
CCAGGCAATCACTTGTCAGAAAACCGCCCCC
CCTAAGGAGAAAGAAGATCCCCTGAACAAGT
ACACATTTTGGGAAGTGAATCTGAAGGAAAA
ATTCTCCGCTGACCTGGATCAGTTTCCACTGG
GGCGCAAGTTCCTGCTGCAGTCTGGACTGAA
GGCAAAACCCCGACTGAAACGGAGCGCACC
AACTACCCGCGCTCCCTCCACCAAGCGAAAG
AAAGTGAAGAAA
MRGNNPTLREYILDLHPEPTDLFCYE
QLCDSSDEDEIGLDGPDGQAQPATA
NYYIVTCCYTCGTTVRLCINSTTTDVR
TLQQLLMGTCTIVCPSCAQQ
HPV58 L1
Page 47 of 48
MSVWRPSEATVYLPPVPVSKVVSTD
EYVSRTSIYYYAGSSRLLAVGNPYFSIK
SPNNNKKVLVPKVSGLQYRVFRVRL
PDPNKFGFPDTSFYNPDTQRLVWAC
VGLEIGRGQPLGVGVSGHPYLNKFD
DTETSNRYPAQPGSDNRECLSMDYK
QTQLCLIGCKPPTGEHWGKGVACN
NNAAATDCPPLELFNSIIEDGDMVD
TGFGCMDFGTLQANKSDVPIDICNS
TCKYPDYLKMASEPYGDSLFFFLRRE
QMFVRHFFNRAGKLGEAVPDDLYIK
GSGNTAVIQSSAFFPTPSGSIVTSESQ
LFNKPYWLQRAQGHNNGICWGNQ
LFVTVVDTTRSTNMTLCTEVTKEGTY
KNDNFKEYVRHVEEYDLQFVFQLCKI
TLTAEIMTYIHTMDSNILEDWQFGLT
PPPSASLQDTYRFVTSQAITCQKTAP
PKEKEDPLNKYTFWEVNLKEKFSADL
DQFPLGRKFLLQSGLKAKPRLKRSAP
TTRAPSTKRKKVKK
% Nucleotide
Change
22.9%
30.1%
Gene
PaVE DNA Sequence
Optimized DNA Sequence
Protein Sequence
HPV58 L2
ATGAGACACAAACGGTCTACAAGGCGCAAGC
GTGCATCTGCTACACAACTTTACCAAACATGC
AAGGCCTCAGGCACCTGCCCACCTGATGTTAT
ACCCAAAGTTGAAGGCACTACTATAGCAGAT
CAAATATTACGATATGGTAGCTTAGGGGTGT
TTTTTGGAGGTTTAGGCATTGGTACAGGGTC
GGGTACAGGTGGCAGGACTGGATATGTGCC
CCTTGGTAGTACCCCACCGTCTGAGGCTATAC
CTTTACAGCCCATACGTCCCCCAGTTACCGTT
GATACTGTGGGGCCTTTGGATTCTTCTATTGT
ATCTTTAATAGAGGAATCTAGTTTTATAGACG
CCGGTGCACCAGCCCCATCAATTCCCACTCCA
TCTGGTTTTGATATTACCACCTCTGCAGATAC
TACACCTGCAATACTTAATGTTTCCTCTATTG
GAGAATCATCTATACAAACTGTTTCTACACAT
TTAAATCCCTCCTTTACTGAGCCATCCGTACTC
CGCCCTCCTGCACCTGCAGAGGCCTCTGGAC
ATTTAATATTTTCCTCTCCTACTGTTAGCACAC
ATAGTTATGAAAACATACCAATGGATACCTTT
GTTATTTCTACTGACAGTGGCAATGTCACGTC
TAGCACACCCATTCCAGGGTCTCGCCCTGTG
GCACGCCTTGGTTTATACAGTCGCAACACCCA
ACAAGTTAAGGTTGTTGACCCTGCTTTTTTAA
CATCTCCTCATAGACTTGTAACATATGATAAT
CCAGCATTTGAAGGCTTTAACCCTGAGGACA
CATTGCAGTTTCAACATAGTGACATATCGCCT
GCTCCTGATCCTGATTTTCTAGATATTGTTGC
ATTACACAGACCTGCATTAACCTCTCGCAGGG
GTACTGTACGTTATAGTAGGGTTGGGCAAAA
GGCTACACTTCGTACTCGCAGTGGAAAGCAA
ATAGGGGCTAAAGTACATTACTACCAAGACT
TAAGTCCCATACAGCCTGTCCAGGAACAGGT
ACAACAGCAGCAACAATTTGAATTACAATCTT
TAAATACTTCTGTTTCTCCCTATAGTATTAATG
ATGGACTTTATGATATTTATGCTGACGATGCT
GATACTATACATGATTTTCAGAGTCCTCTGCA
CTCACATACGTCCTTTGCCACCACACGTACCA
GTAATGTGTCCATACCATTAAATACTGGATTT
GACACTCCTCTTGTGTCATTGGAACCTGGTCC
AGACATTGCATCTTCTGTAACATCTATGTCTA
GTCCATTTATTCCTATATCTCCACTAACTCCTT
TTAATACCATAATTGTGGATGGTGCTGATTTT
ATGTTGCACCCTAGCTATTTTATTTTGCGTCG
CAGACGTAAACGTTTTCCATATTTTTTTGCAG
ATGTCCGTGTGGCGGCCTAG
ATGAGGCATAAGAGGAGCACCAGAAGAAAA
AGAGCATCCGCAACCCAGCTGTATCAGACCT
GTAAAGCATCCGGCACCTGTCCACCTGACGT
GATCCCCAAGGTCGAGGGCACCACAATCGCC
GATCAGATTCTGAGATACGGATCTCTGGGCG
TGTTCTTTGGAGGACTGGGAATTGGAACCGG
CAGTGGGACAGGAGGCCGGACTGGATATGT
GCCACTGGGGAGTACCCCCCCTTCAGAAGCC
ATCCCACTGCAGCCCATTAGACCACCCGTGAC
TGTGGACACCGTGGGCCCTCTGGATAGCTCC
ATCGTCAGCCTGATTGAGGAATCTAGTTTCAT
CGACGCAGGAGCACCAGCTCCTAGCATCCCA
ACACCATCCGGGTTTGACATTACTACCTCTGC
TGATACAACTCCAGCAATTCTGAACGTGTCAA
GCATCGGGGAGTCCTCTATTCAGACAGTCAG
CACTCACCTGAATCCTTCTTTCACCGAGCCAA
GTGTGCTGAGACCTCCAGCACCAGCAGAAGC
TAGCGGACACCTGATCTTCAGTTCACCCACCG
TGAGTACACATTCATACGAAAACATCCCTATG
GACACATTTGTGATTAGCACTGATTCCGGAA
ATGTCACTAGCTCCACCCCTATCCCAGGAAGC
CGGCCTGTGGCAAGACTGGGACTGTACTCCC
GAAACACTCAGCAGGTCAAGGTGGTGGACCC
CGCTTTTCTGACTTCCCCTCATCGCCTGGTGA
CCTATGATAACCCAGCATTCGAGGGCTTTAAT
CCCGAAGACACCCTGCAGTTCCAGCACTCTG
ATATCAGTCCCGCCCCTGACCCAGATTTTCTG
GACATTGTGGCCCTGCATAGGCCCGCTCTGA
CCTCACGGAGAGGGACAGTGCGCTACAGCC
GAGTCGGACAGAAAGCAACACTGAGAACTA
GGAGCGGGAAGCAGATCGGAGCCAAAGTGC
ACTACTATCAGGATCTGTCCCCTATTCAGCCA
GTGCAGGAGCAGGTCCAGCAGCAGCAGCAG
TTCGAACTGCAGTCCCTGAACACCTCCGTGTC
TCCTTATTCTATCAATGACGGCCTGTACGATA
TCTACGCAGACGATGCCGACACAATCCATGA
TTTCCAGTCCCCACTGCACTCACATACCAGCT
TCGCAACCACACGCACTTCCAACGTGTCTATC
CCTCTGAATACCGGATTTGACACACCACTGGT
GTCTCTGGAGCCCGGCCCTGATATTGCTTCTA
GTGTCACCAGTATGTCAAGCCCCTTCATCCCT
ATTTCACCACTGACTCCCTTTAATACCATCATT
GTGGACGGCGCCGATTTCATGCTGCACCCAA
GCTACTTTATCCTGAGGCGCCGACGGAAAAG
GTTCCCCTATTTCTTTGCTGACGTGCGCGTCG
CCGCT
MRHKRSTRRKRASATQLYQTCKASG
TCPPDVIPKVEGTTIADQILRYGSLGV
FFGGLGIGTGSGTGGRTGYVPLGSTP
PSEAIPLQPIRPPVTVDTVGPLDSSIV
SLIEESSFIDAGAPAPSIPTPSGFDITTS
ADTTPAILNVSSIGESSIQTVSTHLNP
SFTEPSVLRPPAPAEASGHLIFSSPTV
STHSYENIPMDTFVISTDSGNVTSST
PIPGSRPVARLGLYSRNTQQVKVVD
PAFLTSPHRLVTYDNPAFEGFNPEDT
LQFQHSDISPAPDPDFLDIVALHRPA
LTSRRGTVRYSRVGQKATLRTRSGK
QIGAKVHYYQDLSPIQPVQEQVQQ
QQQFELQSLNTSVSPYSINDGLYDIY
ADDADTIHDFQSPLHSHTSFATTRTS
NVSIPLNTGFDTPLVSLEPGPDIASSV
TSMSSPFIPISPLTPFNTIIVDGADFM
LHPSYFILRRRRKRFPYFFADVRVAA
Page 48 of 48
% Nucleotide
Change
28.5%
Download