Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGTTACAGCCTTCTCCGCCACCACTGGCGAGAACCATAAATGCCGACGGATTGTCGGAAGAGTTCTTGCCGCCAACAACAGTATCATCGGGTTAGAGGGAACCCAGGAGGCTCCGTGAGAGGAGCACCGTATCTTTCTCCTCTCTCTATTTCTTTGCACTGGTGATTTTGGAGAGAGATATTCGGAGGTTTTGGGTTTAGGATTGAAAGAGGAGAGAGAAATATACGAGAGAGTGATAGGGTCGAGGGAAGGAGGTCGAAGATTTGAGGGTTTCAGGGAAGGAGAGAGAAAGTATGTGGGAGTTATAGGGAAGCGGCGAAAGAATGAACTGATGTACGCGAAGTACACGGTATTTTCGTCTTTTTCATGGTCGACACGAAAAGTGTGTTTACGGTAATACCCCTCCCCTCTTGGCTTATATAATAGAGATTAGCAGGTAGTTTAAGTTAAGCAGGTAGTTTAAGAGTAATAGTAGATGTTTTTCGGGCGTGCTATATGTAGTATTTGTACTCGATTTTATTCGTGATCAGACTATAATTACCACCCTCTTACCGATCCCCATTGAAAAAGTCACCAAGGAAAGCACCCAAGTTCCCAAAATTATCCGAGACAACTTTATCAATAAACTTTGAGCCCGAAGTTTAAATACCTCAAGAAATTATTTATGAGTGAAGATAAATGTGCTAATGCTATGATCTCGGTGTTATGATTTGGGGTGAAAGATGAAGAAAAAGGATGAAGAATGTCTGTTATAAAGAAGATGAAGTTGTTAACATACCTTAGGAAATTATTTACGAGTGAAGTTAAAAGTGCTAATGCTATGATCTCGGTGTTATGATTTTGGGTAAAGGATGAAGAATGTTTGTTATGACGAAGTTGTGGAAAAAATGAGGGAAAACAAAAGGAAATCGAAAAATGGCACCATAAACAGAAAACTTAACAGTTTCATTTAGTGGTGATAAATTTGAAGGTTTATGTAAATACAGAGTATAAGATAATAATTTTGAAATAATGTGATAAATTTGAAAAGTGAAATTATACCCCCTCTAAATCTAAATGTTTTTCCCATTTAGAATTTCGACACTATTCACAATGGAAGAGAATCTTATGAATTACGTTCAATATATAAGAAAAATCATAGTCACGTGTGATCTTGTTTGATTCGTCTCAATGAGTTTTTTAACAATATGAAAATTTTAAAATTTTTAATAACACGCTGCTAGAGATATTAACGATTTAATATATGCATTGGCATGTGTGCCAAAGAAAAATGGAAAAAAACTTAGAAACAGAGGGAGTAAGATGGTATTCCCTCCGTCTCTTTTTGTTCTTTACGTTTGGTATTTTTCACGCATTTTAACGATTAATTAATTTGCATTGAGATTCCTCAATTTTTTTTATTTAAACAAGATGAATTACGTTTATTTATAATGTTTTCACTTTTATAAAAATTCGGATATTGAGAATTGAGAAAAATTTAATGTGTCAGTGAAAAAGTGTGAGAAATTAAATGACCCAATAAATTTAATAGAACAAAATGAGACGGAGGGAGTAATATCGTTAACTTAAAAAATACTCCGTAATTCTAAACAACAAATTCTGGAGTGTTTATTACTCAGCAGTAAATTGGATGTATGTATGACAAATACAACGTACATGTATGAAAGATGGTGATGGGAACGAATCTAACATGAAGCGGCATGCTGGGCCTCTAAGGTGAGAGGAAAATATCATCGCCACGTACGCAACACAAGTAGCGCACAGACCCGAAAAAGATCTCCTCTTCTTACTCCCCCCTTCTTCTCTCTCTCCTCTTTCTTTCTTTCTTTCTCTCCAAATCCCAAAATCTTCACACAAAAAACCCCCACTCCTTCAATTCTCGTTTCCGAAAATAAATAAATCAAGAATCTAAATCTTCTGCTTCAGATACTTTTTTTCATCTTGAACAGGTAATTTCATCTGTATTTTCATCTTCAGTTTTATTTCCAATATTTCAGTTAGATATTTGCTGTCTAATTTCTTCATTGTTTTTTAATTTGCTGTTATTTTACTGTCAATTGTTGTAAATACAAATCAATTACTTTCATTTTATTGATTAATTTCGATTTGTTCATTTTTTCGAATTTGATGAGTAGTTTCTAATTTTTTTTCTTGATTTTATGCAGAATTTTTGGGTGTGGTTTGTAAAAATATAGGAAATAAACCTTTATTACAGAGATTTGATTGAAAATCGAAGAAATCAGAGATATTTGAAGCTATAGGGGTACTATTGTGAATAATTATTGGTGGAGAGGGTTTAATAGTACCTCGTTTGAATAGTAATCTGGAAGAAGTAGCCCCAACTGTAAGTATTGATTATTTAATTTATGAAGAACTGATTAATTGGGAAGGCTGACTTCATTGTTCGATTTTTCGATGCTCGGTTCGGGATGCTGAATTATGAGATTCTAATCCTTGATTGGTAGCGGGCGTCGGATTGAAGTTTGTAGTTGGGTTGAAACTATAACATTATTTTCCACAGAATAAGAGAAATGGGGACCGTAGATTATATGACACGTCCATCGGAAAGGTGGATTGATGGCCTTCAGTTCTCCTCATTGTTTTGGCCGCCGCCTCAAGAAACACAACAACGGAAGGTAGTGTCACTTGCTCCATGTTTTATTCTATTGCTTTACACTGCCTTCAAATTTCATGTCATTTCGGTACGAGATTGGTATTTCCTGTTGTTGTAATGGTGTGGGCTTGTTAACATGGATACTTGGTAGCTTGGTGCCGATCATGAAATTCATGTCATTTGTACTAGATTATGTATTTCCTGTTGGTGTAATGGCGTGGGCTTGTTAACATGGATACTTGGTGGCTTGGTGCCTATGATGAAACTTCACATTCAAATGGTTTGACCGGTCAAGCTTGGAGTAGATTATTTAATACTTAGGGAGGCTAAGCCTGTTAGTAAAACTTGGAAACCCCTTTAAGGCTCCTTCTTAAAACCGAACTCGCATTAAAGAAATTAATCAAATGATTTTTGAACATATTAAAACTTAAATTTAGTTTGTTATAAAAAATAAAACTACGCTTGAATTAATCTATTTCAATGCATGTTCACCTAAGTGGAAATCATGGATTGATATTGGACCATCATAGACACACAACAAATTTCAAGATTACCTTGAGTACCTAAACGTGATATAACTATAACTATAACTATACTATGAACAATGGCGTATCCAGGATCTTAAAATAGGGTAGACATTTTTCACATGATATTACATAGCGTACATACTCTATAGTTTACATACTACGTATAAAATAAAATAAATTACATGTATATTACATAGAGTATGCGCATAAAATGATACAAGTAAAGGACATCATATATTCATATTGTATATAAAATAAAAAGAAGAAAAATGTTGTTGATATGGATAGAACCCGCAATCTGTTGAAAAATACCAAGTACTTAGAACCACTGAGATGCAAATACAGATGTGTATCTTTTATGGGTTAAATATTACTTATATAATTAATGCTGTTGTATTTTTCAGCACATTTATTTGTTTTTTTTTTTGGGAAGGGTAGACATTTGTCTACCCAAAGTGCAATATGGATCCGCCCCTGACTATGAAGCAACTTGAAATTGTACAAGTAGAGTAGTCGACTATAGGCTTGATAACTTGATAAGTGAATTAGAATTGATATTGTATAGTTTATTGGACCTTGATCATTAAAGTGTCCTTCACGGCTCTTTAATGAAATAAACTAGAATACTAAAGCGTTGTAGATGATTCATTAGCAAAGTAACCTTGAGAATTAGCAATTGACTAACCTTACTCGAGAAGTTTAAGTTGTTTAGAAGGTGCATTGTAGTGAGTGTAGGTAACTAATTCAAAGATTGTTTCTTTACCCATGTTCAATACTCAAATAGGGAACTTCCCCTTAGTTTACAAGCCATTTTCTTTCACTTCAGTGCTTTAATGTTTTTTGTTTGTTTTGACATTGGAAAAAAAAGTTGCACTACCATAAGATAATGTAAAATTTTATTTATCTCGCTTAGAAGAGCGAGCCATCAATTTAAGACACATTCTTTAAGCATTTAAACCTTATTAGAGACTTTGAAAAGTCTTCTAAAACTCTACAACTTCACTAACTTTTGAAAACGTGAATAAGTTGATATTCAATTACATCAAAGCCATCAACCAATTCTAATCATGATACTAAAATAAAGGCATAAAGGCATCTCTGACATAGAACATACGCACCTAACGCATGCAATTGTCCTCGGCTCATCGTTAGTCTTATTCCTAGACATGACTAATTTCAATCATGCTCCAAAAATATTACTTCATCTCACAAGAAACTTCTCATTTCTCTTGGGCACATAGTTTTAGAAAAGCGTTAGACGATAGGAATAACAATGGATCGGATTGGGGTCGGACTATACAAACTCCGAATCCGACCAGTTTAGTAATCAGATCACAAATTCACCCCCAATCCGGATCCGAGTCCGAAACCCCTACATCCAAATCCAATCCACTCGGATGGTCGGACCTCTGCTCCACATCCGGATCTGGTTAATAAGAAAAACAATTACCTGTTTTATGTCCTTCCATACAAAAATTTATAGGATTATACCAGCAACTCAACAAAATCATTTCATTACTTAGTCTGTACAAATTCATATTCACATTCAAAATTTAATTACAAATACACATCAGTTCATAAATCCGTATTTACATTTGATTGAAGTATCTTCGCATTGAACCCAGAAATTGATGAAAATTACGACTCCATGTTTAAGCATGGAAGGTATAAAGAGAAGGATATGTAAATCCTTATAAGAAATTAAGAGGAAATGTACTATGCAAGTACGCATTAGATCTTTAGGGGAGAGAGGAATATATGTGGTACAGGAGAGAGAAAAAGGAGAAGAGTGGAAGGAGTAACGACGTGGAACGAGTGTGAAAATAAGGGTTTGATAGTTATATGTATCCCACATATGTACCACAAGCCCAAAAAAACGCAGCATTTTTTTTATAACGTGTCGGACTTGGACTTAGGTCGGGTCTGTCGGGCCCTGAGTAGGATCCGAATTCGATCCAGCTCCCGTTCGGACTCGGATTCGGATTTTCTATTTTCAGATTCGAATCGAACTTTTTCCTTCGGATCGGGTCGGACTCGGATCGGATTGAATTATTATCCTTAGTTGGCGGTGGGGTTTTATTAATTATAATCGAATACAAAAAAATCAGGTAAATGGTATTGTTTATTGTTTTGTAATTGAGGTGATAGAAAGTGGGTATCATAATGCTTTAAGGGAGTAGAGAGGAGATGAATATAAAATGATAAAGGAGGAAAAGAAAAACAGTATTAGTGTATGTGGATCACCCATAATACTTCTTAGTACTAGTACGCACCTCTTGAACCGAAAATTCCTCTTAAAAGTACAATTAGAATCAACTTTCTATGAACTTAGAACCTACTCCATTAAAATAGTACAATGCTACAATTAGAATGTTAACTACATAACCAATGCTGCCTTGGCCTAGTGGTTAAGATTAAGGCGATAGGTCATAATTTCGAATTCCTCTCCTCCGTTTGTATATTTGTATCGCATTTGTGGCCCATTCGTAACATAACTACATAAAATAGCATTAGTCTCTAAAAACAAAATTTGATAACTTACTAATAATCTGATTTTAGAGATTTATCATGCTTCATAATAACAAACTATCAATGCTTAGATTTTAAAATCCATTTAAGGGAGTCATTAGAAGCCTTTAAGAGTTTAAGCCATTATTTCCAACCAAGTAGAATCTGATTTTTAGCTTCGGACGATCAAATTTAAACAGTTCTTCCATTGCACATATTTAAACACTAATATCTTCGAGTACATATTATTAAATTTTCAGTTAAATATTTTGAAACTACTCATGGCGAAGAATCTAATAGAACCTACACGAGTGTTTTCGTTTTAAATAAATTGATGAATACCCTTGGTCAAAGATTCTCTTTCTTCACATAAGTACCATTTTTGTTTCTCTAAATCCAAAATTTTCTCTGCTCTTTCATATGTCAATCTCTATTCATGAATCATGATAACAATGTTGATTTGAAGAATTATGAAAAGGGGATTAAAAGATGATAAAAAAATCTTAAAAATCATAAAAGTAATAATCTGAAAATTAGGGCATAACATGTGTATATACTTTTTAAATTTTTTTGTACGAATTTAACATCCCAAGACCATACTAACAATGGAAAACAACATCAACAACAGTCAGCTGGCAGCAGCAGATACGTGCTTGTTGTTGTTGCTGTATCGGTGATGCCGGAAGGGAGAGAGAAAGATGAGGGGAAAAGCTCACATAAAAGATGGTCTATGACGATGGGCAGCAAGCAGTAGAAGGTGAAGGGGAAGGAAGATGGGCTGCAGCCAACAGAGAAAAGAACTAGACAGAGAAAAGAAAAAAGAAAGAGGAAAAAAAGAATGACGTGAACAAGAAAAGAAAGAGAGCACGGAAAAAAGGAGAAGAAAATGACTTGAAAAGGAAAATAAAAGAAAGAAGGTAGCGTTTGTATTTTAATTTCTGATTTGTGGTCATTAATTGACTAATTACAAACCCTTATTAAGGTCCGGTCAACGAAAGACAAACTCCAAAAACTCTTTTTTTTCCTCCTTAAAAACTTTAGTTCCTCCTTTCTAAAATACGGGGTAGGTAGAAATGAACTTAGATGATTCTCCTTGATGTTATAAAATAGTTTGTCTTGGTAAAATGTGAAACCTTAATAAATGTTTATGCCCTTGATAAACCAATAATTAAATAGTTCACTATATTGATATTTAAGATACAACATTTAATATATTTCCTTGAAATATATTAAATCAATATTAAAAATTGGAGTATTACATACTTGACTGGCCTATTAGCCATAAATACATACTAAAGCCGAACAAAACCCTGTCCAGTTTGTATCGGATCTTCCCCAACTTTTATGTTAATTCTACTGTGTAGTTGATTGACAATTAGTTGGACTTTGTGTAACGGTACAAATGTACAATAGAGTTGATTGATTGTGTTCTTGTCAATAAAGGAGGAGCTGACTAGGTTGAGGTTGAAAAATGTACACTAAGGCATAGAGAATTACACAAAAAATGGAAAAATGAGTGGCTCACCCATTGAAATAATAGGAAGGTATTATGCATTTACTATTTCTGTGTTGGTCAAGCCGAGTGGAAGCTGTCCAAGCTGCGAGTTAATAGATAAAACTGGTATTGCTTCCATCATAGTTAGACTATATGCTTCGTCACAGTTATTGGCTGTAAGTTAGGGTTCACACAGCGGCTTTATGATAACAGCTTCCAACTACAAATTCAATCTCATATGAGATGTGTCTAGAAAATATTTATCTTTTCCCCTGAACAATGACGATCAAGCTTTAAGGCTTCTTATTTATCGTGAAGCAAATATTCCTGCTCAGAACTGCATTGCACACAGTTACAGTCAAACTCTAATCCCAGCTAGTGAGCCTTTTAGCACGGCTAAACCTATTCTGGATGGCTAACCATGTGGTAGAGATGCTCTTTTACGCTTTGTCATATAAGTTCCTTCCAAGGACCTTTGTGAAAAACTCCCTTGACATGCCTTATATTAATGTTAACAAAACATCCTCCCTAATTGCTTTGGTTGCTATTGCTTGATTGTGATGAACTTCTGAGTTATATCCTTAATAGTGATTTTACTTCTTATTTTATTATCAAATTGATTCTGTGTGCTACCTTTACGACTTTCTTGTTTACGACTCTTCAGCTCTTACAAATGTTTGTCTGATCTTTTGTTCCACTTTTCCCTTCTTTATTCTTTCAGGCTCAAATCACAGCTTACGTCGAGTATTTTGGACAATTTATATCAGAACAATTCCCAGAAGACATTGCTGAGGTATTTCCCTGCAACTTTGCGAAGTAAAATGCCCTTATTCTTATACAGTATTGAATAAAAAGATTTGTTTTCACCGGAACTCGCTAACTATTTCTTTTGCCTGTCACTTTCTTTTCTTTCTTTCCAGCTGATTCGGATTCGTTACCCATCTCATGAAAAACGTCTATTTGATGATGTCCTTGGTAAAAGTTTACCATTTTGATACTTTGTAGTTGATTTAAATGTTTTTGAAATGATCAATCCTTATTCTTTATTGGGGAAAGGCATTAACATACTGTTATGAATAATGTTCAGCCACTTTTGTCCTCCATCATCCAGAGCATGGGCATGCTGTTGTTCTTCCTATCATTTCTATTATTATTGATGGCACGCTAGTGTATGATAAGGATAAACCTCCATTTGCCTCTTTCATTTCCTTGTTCTGCCCAAATGACGAGGTATGTGCGCCATACTTTTTGATAAGCTTTTCTTGCATGTTTTTCTTTTGTGGAACTGTGTAATACTTCCTGCGACCTAACTATGGAACCTTTTCAGTTCCATAAATTTTAGGGGTGCAGAACTTATGGCATGTATTAATTGTTTTCATCTGTTAATGGGAAACGTAATCTCTTAGTTAAAGAAAACGTATATTCCTCACCTGTTTTGCAGAATGAGTATTCAGAGCAGTGGGCACTAGCTTGTGGGGAGATACTGCGAATTTTAACTCACTATAATCGCCCTATCTACAAAGTTGAGCGAAAGCAAAGTTCAGAAGATTGCAGCGGCAGTATGAATCATGCTACTTGCAGCAAAGCTTCTGGTGCCAAGCCATCTCATTCTTCTTCTGTGCAAAGTGAAAAAAGACCCTCAAGGCCTCTGTCACCTTGGATCACTGACATATTGCTCGCAGCACCTCTTGGTATTAAAAGCGATTACTTTAGATGGTATGTAATTAAGTTGATTCCGTTTCTCAGTCATAGAATTTATGTGCAGTCACGTGAATTTGTGCTTTCGATGTGGATGCCTATATTTGATGCATGTAGACATTGAGTTAGCAAAGTCACAAACTACTAACTATAATACCAGAGTTTAATGGCAGCTATGAGTTATGACCTTATCATCTTTACCTATAAAAATATAATTGCTCAAAAAAAAAAAAAATATTGTACAGTCGTAGCTGTTCTTCCCAGAATATCTTCTGAATAAGTAAAACCTTTTGAAAAATAGAAAAGTGATAGATATCTAGAATCATAGTAAGAACTCCGGCTATTAACAGAAGTCACCTTGTTACTTCTTTGTGGGCTATACGGCCTATACCTGGTTGGGGCTTTCCCAAAATTTTCCATAGGGGGGAAAAGAATTGGGTTAATTATCTTTGTGGTTATTTAGGTAACTATATGAGGACATCTTGTCCTCCTTTGTGCTTTGTTAGTTTGTTATGTGTAATTCTAAATTTCTAATAACCAAAGTTTCTGATTGAAAAGAGAATAAAGAAATAGATATTATGATTTCAGAAAACCTACCTGCAGAAATATTTGATGAGCAAATGGATTACTTTCAGGGTTGTGATTGGATCTAATTCTTTAAATTTCAAGTTTATTTGATTAGTTTCTTCTCTGACAAAGAAGTCTCTCTCTCTCTCTTGTGTGTGTGTGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGATGGTTGATTTGATGGAAATTATTGAATTAATAGCAAGCTACCTCAACTTGTTCCTGATGGTTCTGTTGCGTTGTTTCTATTACATTACCTGTACGTATGTTAATTAATCATGGTATGTTTGTTTGAAGTGTTTTAAAGTGATTGCCCTGCCATTTTATTTGTAAGGATGTATGTGTAATCCAACAATGTTTTGCAAAAAATGATAGGTGTGGTGGTGTAATGGGAAGGTACGCTGCTGGAGAACTCAAACCGCCTATGAGTGGTAAGTCACGATTTCTGAAAAAGAGATTTCTTTGAAGATCTCACATTTTTTTTCCATCTTCTTCTTGTTCTATTGTGACTATTTAACTGCTCGTGATTGTTCACTGAACACATGTATAGCCTTTTGACTTTGTATAGTTTGTGACCCTATTAATATTGTTAAATAAAGCAAGTTGAAGCTCTGTCTCCCTTCCAACGCACAAGCCCATCATCCTCTCACACACATAGCTCAATCTCTTAAAGTGATGGAAGATTAGTTGATACCATTTCCAATTCAGAAAAGGAGAAAAAAAGGAAACTTATTATATTTTCTATTTTCCCCCATTTGAATGTCAATATCAGCCAAATAATGGCCAAAGTTAATTTAAATGTAATGTCCGTTGCATGGTTTCTCCTCAGCTAATGTTAGGAATTCATGCAATGAAGTGTGACCATTCTGACCAAGTTTAAACTGGATATCAGTGAGGTAGAAATCTTTTCCAAACAAGTTAGATATGGTTTTAGCTTTAAATTGAGCCGTTATAGTCTGTAACTCTTCTATAACTGTGAGAAAATTCCTTAGAGAACTTAAGAGTAAATTTCATCTACCTTTTCCTCCTTCAGCTTCTTCTCGTGGAAATGGGAAGCATCCTCAGTTAATGCCATCAACTCCACGATGGGCTGTTGCAAATGGTGCTGGTGTTATATTAAGTGTTTGTGACGAGGAAGTCACCCGTTACGAGACTGCTAGTTTAACAGCAGTTGCAGTGCCTGCCCTTCTGCTTCCTCCCCCAACGACAGCTTTGGATGAGCATCTAGTTGCTGGTTTACCTGCCCTTGAACCGTATGCACGATTATTTCACCGGTAAATTATTGACAGATGAACTCATTGTTTGATTCTTATTTAGTGCTTGATTCTTATTTAGTGCAATTTTCTTTCTATATTTGTCTGATTACTTTCTTTTTCGTGGAGGAAGGTATTATGCTATTGCTACACCAAGTGCCACTCAGAGATTGTTGCTTGGACTTCTTGAAGCACCGCCGTCATGGGCTCCGGATGCACTTGATGCTGCGGTACAACTTGTGGAACTTCTTCGGGCAGCTGAAGATTATGCAACTGGGATGAGAGTAAGATCAAGTTCCATATAATTGTACTACTACACATGATCTTTCTTCTCTTCCTTTTGATTCTGGTGATGCAATAGACTCCATACTGGTTTCATTAAATTTGTTGTTGGTGACTATTGATTATGCTTATGAGCTTATCCATAATGATACTTCTGCACTGGACATTTAACAAATAATTATATTTATAATGAATTTGGCTTAATATTTAGAGTTTTGCCTGACAAGTATAAAAAAGTATTAATATTTAATAGTAAGTGAGAATCAGCTCTTAAAGAAGCTAAAAGTATCACGTGAAGGGGTAGTTACATATTACAATTGCTTTGTAGTAAAATTGTAATCGGAACATCCAGATGGTCCTGCATCCTTCGTCTATAAGGACAATAGCTATGTAACTCGTCTCCAATGTTATGCTAGAAGATGAGAAGTTCCCTTAAAAATATTGGAATATTGACATCTTATTTTTTCTTAAGCCAACATAAGCATGTAGTTGTAGTTCTTCATCAATGATGAAGTATTTGATTCAAGTTGTCTAGCCCGTCAAACTAAGTGGACACAAAACATTACCGAACAGAAAGGAACGAGAAAAAGATGAGTCTCCAACATAAAATAAAATATTGGGAATTAAATCTCGGCTGCATCACCTCATATTTTCTTTTCAAGAAATTCCGAGTTAATCTTACTGATTGAGAAAACTTTCTAATCAAATGGCAATTTCGTAAATATTTTACTGGAAAATAATAGGTGGTATAATCTTAGCCGTTGTCCGTATATTTGTAGTTCTTGATTGTATAGTAATAGGTGTGATATAGTACTAGTATTACTAAAATGTAATACTTCCTCCGTTCCATAAATATCGCACCATGGTTGACTTTTACTCCTCTAACACATTACTTTGACTTTTTAATTTCAAAAAAATAAAAATTTTATCAAATTGATATTGAGAAAATATGTATCGTATGGATCTAACATGACCCCACATGACTACAATTTTTTTAAGTATGAATCACAAAAAATTGCCGAAGTCGTAGTGTGAATAGTGTAAAAAACAAAATGCTGCAATATTTCCGGAACGGAGGAAGTATTGTTTATTTGCAAGTGAGTGATTGTATATGTATAGTTGTTTGTTGTATATTGCATAGTTGTTGATGTATTACGACAATGCAATGCTGGATGTACGTGTGACCCATCCGTTAATGTCATATGATTGTACTTGAGACCCACATTTAATGACAGTTTCGTAACAAAATCAACAATTATTTACGAAATGTCTTCTGGCTGGGAAAAAAATCCCAATTACTGAGATTAACTCGGAAGACCTCGAACTCTTTCTCCTGTATTGATGTATTAAAGTTCTATCAGTGGATTTTTTGTTTATTGAATTAGGTCCTTCATTCACCCTGAATCCAACCTGTTGCAATCCCTATTTAGCACATTATGCTGTAGGTTGTTTGCATGTGTTAGTAATGCACCAGCATCAGGTCATTACCCTCCGTGTTGGTCACTGTCTTTGTGTTTGTATCATACATAATTTGTTGGCCACAGAGGGTTCGTATGGAAAACCAAACCATTGTGTCCAGTAGTGCTTTAGCTGGTTCACATGTCATCAGTTGGAAAATTTTGTTGATCGTTAGTTTGGAGTATATGGAGCCTTCTAAATCTGTCAATGCTAATTCTTATCTACATATGCCTTAATTTTGTGCAGTTGCCAAGAAATTGGATGCATCTACATTTTCTGAGAGCTATTGGCATCGCAATGTCTATTAGAGAAGGTATTGCTGCCGATGCTGCAGCAGCGTTATTGTTCCGCATACTTTCTCAACCTGCATTGCTTTTTCCACCATTGAGACAAGTTGAGGGTGTCGACGTCCAGCAGGAACCTTTGGGAGGTTATATATCATGCTACAGAAAGCAGGTATTGTATTCATGTTTATAATAGGATGACCCGGATGAGAGGGAAGGGGAAGGAGAGACAAAATAATGTTCTCTACATTTGTTTTTTAAAAGAATTAGCTAATTGGTTGTTCTTCTTCCAAAATTTCATAAACTACAACTCTAGTCTTACCTTGAAGATATCGTCTGATTGTCTCAACTGGGACTTTGTCTTCAATTTTTTTTGATCAGATAGAAATGCCTGCTGCTGAAGCTACTATCGAAGCCACTGCCCAGGGAATTGCATCAATGCTTTGTGCCCATGGTCCAGAGGTGGAATGGAGAATTTGTACCATATGGGAAGCTGCTTATGGTTTGATACCGTTAAGTTCTTCAGCCGTCGATCTACCTGAAATAATAGTTGCTACTCCATTGCAGCCTCCCCTTTTATCATGGAACTTGTACATACCTCTCCTTAAAGTACTGGAGTACCTCCCACGAGGAAGCCCATCTGAAGCATGTCTTATGAAAATATTTGTGGCTACTGTTGAAGCAATTCTACGCAGAACATTTCCATTCGAGTCTTCTCCTGAAGAAAATCGAAGATCAAGATACAGTTCTGAAGCAGGCCCATCATCCAAAAACCTTGCTGTGGCTGAGCTCCGCACAATGGTTCATTCATTGTTCTTGGAATCATGTGTTTCTGCAGAGCTTGCTTCTCGATTGCTATTTGTGGTGCTAACTGTCTGTGTCAGTCATGAAGCTCAAACAAATGGTAAAAGTTCAAGGGCTGATGACAATATTGCTGATTGCACTACCAGAAACTCAGAATCTAGTCAATCTAGTCCCTCAAGATCAAATAATTCAAAGAGACAGAGAGAGCCTAGAAGTAAGAAATCCAAAAAACGAGGTCCTGTTGTGGCATTTGATTCATATGTGCTGGCTGCTGTTTGTGCTCTAGCCTGTGAACTTCAGCTGTTCCCTTTGATTGCAAATGTATCTAATCCTTCAAGCACTAAAGATTTAGTTGTAAATGGAACATCTCATGACTTCAAGGATGGCCTTGACTCTGCAATCCGTCATACTCGTAGAATTTTGGCGATATTGGAAGCATTGTTTTCGCTGAAACCATCAACCATTGGCACAACATGGGGTTACAGTTCAAATGAAATAGTTGCTGCTGCTATGGTTGCTGCCCACATTTCTGAATTATTTAGGCGATCAAAGGCTTGCATGAATGCTCTTTCTGTCTTAATGAGGTGTAAATTGGATAATGAAATTCACACCAGGGCTTCTTCACTGTATAATCTAATTGATATCCACAGAAAAGCAGTTGCATCTATTGCTAACAAGGCAGAACCACTAGAAGCACATTTATCCCAAACCCCGGTGTGGAGGGGTCCACCAATTGTTGCCAATCGAAGAAAGCAAAATGATTGTGCACGCACTGTTTGTATTCAATCAGAAGTGCCATTAACACTGTGTGAAGATTCTTCCCATTCGAAGAATTTACTAAATTGTGGAAAGGCTGTATACACAAATAATGATGCTGGAAATATTGCTGGTAAAAGGGTTGCGAACATCCAATTCGATGCCTCAGATTTGGCTCATTTCCTTACGATGGACAGACATATAGGTTTAAATTGCAGTGTACAGATTTTTCTGCAATCAGTACTTGAAGAGAAACAAGAGCTTTGTTTCTCAGTTGTTTCACTCCTTTGGCACAAGTTGATTGCATCACCTGAAACACAACCTAGTGCAGAAAGCACCTCTGCTCAGCAAGGATGGAGACAGGTATGCATGTCCAAATTCTGCAATAAGTTTGTTTGCTCGAGTAAATAAAACTTGGTTTATTTTTGTCATGGAAACCTATTTTCTGCTGTTAATATCGGTTCGCATCTTACAGATTCTACCAATTATTACTTGCCTTCACTTTATCATCATCTCCCATATCTCCTTGTGTAACTGCCATTTACCTTCAAAAAGTTTGTTCCTTTGATGTGAATATGCGCTTTGTGACCCGCATAATCTTTTTTAGCAATACATTAATTAATAAAGTTTTCCAAATTAGTATGTAAGACTTCAAAATATCGACATACTCGTGATGATTTTCTACAGGATTGCCCTGCCGTCTCACATTTTTAATTTGTCGATACAGGTTGTTGATGCACTATGCAATGTTGTATCTGCATCACCCACTAAAGCAGCAGCAGGGATAGTTCTTCAGGTTCAATATTTACTCCGTTTCTATGCCGTCTTATATTTCCACATAGTACATATTTTTAATTTGTTCGAACAATTAGAGGGTGGACCGAACTTGCCAAAATTCTGTTGCCTGCAATTTAATTAGGCATATTTACTATCCTACTGAATATAATCAAATTATTGGTGCTAAAGATGTCTCAAAGTTTTGATCTTACATGATAATATGCACTTTGTTTAGTAACTTCTATTATTTTTGTTAGGCGGAAAGGGAATTACAGCCTTGGATTGCCAAAGATGATGATCAAGGCCAGAAAATGTGGAGGATTAACCAGCGAATCGTAAAGCTGATGGTGGAACTGATGAGAAATTACGATGCTCCCGAATCATTGGTCATACTGGCTAGTGCTTCAGATCTTCTACTGCGTGCGACAGATGGAATGCTTGTTGATGGAGAAGCTTGCACTCTGCCTCAGTTGGAGGTAACATATCGGATGCACACGTCGTTATTCTACATACCTGCTTACACTTGCTTATCTTTGTTTCTAATGAAAACACTTGCTTTATCTTGTTGTTTCTTTCAAGATCGTTATTGAAGCTATGTTTCATTTCCTCACTGCCGTATATGAGTCTTATTATCTTTCAGAAGTGTCTTTCTATGCCAGTAACATTATTAAACTTTCCAAGTGGTAGGGATATCTAAGAAATGCATACAAACAAAGAATTACGGGACTTACTTTGGAGACAATTGGGTGATGGTGGTTGTTACATAAAAGATATGCAAGCATTAGTATGATGATTGTAATTGCATACTTTTAAATTTTAATGGTGGCTTACATCTGATCCCAAGAGCTGGATAAATTTTGATTTTCTATGTTACATTTTTAACAGCTGCTAGAAGCAACTGCCAGAGCAGTGCAATCAGTGCTGAAATGGGGAGAATCTGGGTTAGCTGTTGCAGATGGTCTCTCCAGTCTTCTAAAGGTACAAATTTTACCTGGAAGCATATGAGGAACCAGATCAAAAGACTGGCATAAAGAATTAAACGATTTCAGTTATGACAAATTCCTGGAAATTAAATTGCTACCATATTTGAATAGGCTCACATCCTTACCCCCTAGGAGAGAGAAAGAGGAGATTTCAGTGTTCTATTTTGACTAATTTTGTTTGCTCGAATTTTCAGTGCCGCCTACCAGCTACTATTACATGCCTCTCTCATCCAAGTGCACATGTCCGTGCTTTAAGTACCTCTGTTCTCCGAGACATACAGCAGAATGGATCAGTACAATTCAAATTTAAACAGGAAAATAGAAATGTCACCCACGAATCACCCTTCCAATACTTGCATTTAGGCATCATTGACTGGCACACAGATATTGAAAAGTGTCTGACATGGGAAGCTCATAGTCGATTAGCAAGAGGAAAGACTATCGAGTATCTTGATATGGCGGCTAAGGAGCTAGGGTGTGCAATAACCATCTGACATCCTTTTTGATACATTCCGAGTTCTTATGCTGTTATTGGTTTGTGAAAACTGAGGGAAATTAATCCTGACAGTCATCCAGGCAATTGATTCTTAGGAGGTGGATAGATAGTCTAGCACAGTGATGTAAAGCTCGTTTTATTTGTAGTTGAATGTGCTAGTAATAGTATATCCAAGTAAAATGATACATGTATTGTGAGACAGGGATTGGCATACTGAGTGACGACAAACAAGTAGATGATCATAGTCTCAAGTCTCAACCAATACCTGGTGAGTGACCACAGACAAGTAGATGATCATAGTCTCAAGTCTCAACCAATACCTGCATCGCCCCGGATACTATTTGGCTGTAGGTGACTTGTGGCCTTGGTATTAAGGAGAAAACTAGGAGCTACGGGGGATTATTTCCTTTAACTGTCTGAGCTGGATTACTTTGACTTGTTGCCTTGGATTGTTTCCTCGGTCCTCATGTCTCCCTTCATTGTAAAATATGGCCATGAAAAGAGAATCCAAGAACCATTTGTAGCAGGGTACAACTCGTACAAGTGGTAACTAGTTGTATAAATATAGTTAATATGGTGTTCTGACTATTTTTTCCCTCCTTTTTTTAGTAGTACATGAACATATTGTATGTAGTTGAATCAAGTACCAAGTACTGTGATGTCTCATATTTTATTTAAACATCCAATTTTATTTTAACACCAAATTCAAGCACTTAATTATATGATATGCAAAACTAGGTATTCCATGGC
mRNA sequence
ATGCCGTTACAGCCTTCTCCGCCACCACTGGCGAGAACCATAAATGCCGACGGATTGTCGGAAGAGTTCTTGCCGCCAACAACAGTATCATCGGATACTTTTTTTCATCTTGAACAGAATTTTTGGCCCCAACTAATAAGAGAAATGGGGACCGTAGATTATATGACACGTCCATCGGAAAGGTGGATTGATGGCCTTCAGTTCTCCTCATTGTTTTGGCCGCCGCCTCAAGAAACACAACAACGGAAGGCTCAAATCACAGCTTACGTCGAGTATTTTGGACAATTTATATCAGAACAATTCCCAGAAGACATTGCTGAGCTGATTCGGATTCGTTACCCATCTCATGAAAAACGTCTATTTGATGATGTCCTTGCCACTTTTGTCCTCCATCATCCAGAGCATGGGCATGCTGTTGTTCTTCCTATCATTTCTATTATTATTGATGGCACGCTAGTGTATGATAAGGATAAACCTCCATTTGCCTCTTTCATTTCCTTGTTCTGCCCAAATGACGAGAATGAGTATTCAGAGCAGTGGGCACTAGCTTGTGGGGAGATACTGCGAATTTTAACTCACTATAATCGCCCTATCTACAAAGTTGAGCGAAAGCAAAGTTCAGAAGATTGCAGCGGCAGTATGAATCATGCTACTTGCAGCAAAGCTTCTGGTGCCAAGCCATCTCATTCTTCTTCTGTGCAAAGTGAAAAAAGACCCTCAAGGCCTCTGTCACCTTGGATCACTGACATATTGCTCGCAGCACCTCTTGGTATTAAAAGCGATTACTTTAGATGGTGTGGTGGTGTAATGGGAAGGTACGCTGCTGGAGAACTCAAACCGCCTATGAGTGCTTCTTCTCGTGGAAATGGGAAGCATCCTCAGTTAATGCCATCAACTCCACGATGGGCTGTTGCAAATGGTGCTGGTGTTATATTAAGTGTTTGTGACGAGGAAGTCACCCGTTACGAGACTGCTAGTTTAACAGCAGTTGCAGTGCCTGCCCTTCTGCTTCCTCCCCCAACGACAGCTTTGGATGAGCATCTAGTTGCTGGTTTACCTGCCCTTGAACCGTATTATGCTATTGCTACACCAAGTGCCACTCAGAGATTGTTGCTTGGACTTCTTGAAGCACCGCCGTCATGGGCTCCGGATGCACTTGATGCTGCGGTACAACTTGTGGAACTTCTTCGGGCAGCTGAAGATTATGCAACTGGGATGAGATTGCCAAGAAATTGGATGCATCTACATTTTCTGAGAGCTATTGGCATCGCAATGTCTATTAGAGAAGGTATTGCTGCCGATGCTGCAGCAGCGTTATTGTTCCGCATACTTTCTCAACCTGCATTGCTTTTTCCACCATTGAGACAAGTTGAGGGTGTCGACGTCCAGCAGGAACCTTTGGGAGGTTATATATCATGCTACAGAAAGCAGATAGAAATGCCTGCTGCTGAAGCTACTATCGAAGCCACTGCCCAGGGAATTGCATCAATGCTTTGTGCCCATGGTCCAGAGGTGGAATGGAGAATTTGTACCATATGGGAAGCTGCTTATGGTTTGATACCGTTAAGTTCTTCAGCCGTCGATCTACCTGAAATAATAGTTGCTACTCCATTGCAGCCTCCCCTTTTATCATGGAACTTGTACATACCTCTCCTTAAAGTACTGGAGTACCTCCCACGAGGAAGCCCATCTGAAGCATGTCTTATGAAAATATTTGTGGCTACTGTTGAAGCAATTCTACGCAGAACATTTCCATTCGAGTCTTCTCCTGAAGAAAATCGAAGATCAAGATACAGTTCTGAAGCAGGCCCATCATCCAAAAACCTTGCTGTGGCTGAGCTCCGCACAATGGTTCATTCATTGTTCTTGGAATCATGTGTTTCTGCAGAGCTTGCTTCTCGATTGCTATTTGTGGTGCTAACTGTCTGTGTCAGTCATGAAGCTCAAACAAATGGTAAAAGTTCAAGGGCTGATGACAATATTGCTGATTGCACTACCAGAAACTCAGAATCTAGTCAATCTAGTCCCTCAAGATCAAATAATTCAAAGAGACAGAGAGAGCCTAGAAGTAAGAAATCCAAAAAACGAGGTCCTGTTGTGGCATTTGATTCATATGTGCTGGCTGCTGTTTGTGCTCTAGCCTGTGAACTTCAGCTGTTCCCTTTGATTGCAAATGTATCTAATCCTTCAAGCACTAAAGATTTAGTTGTAAATGGAACATCTCATGACTTCAAGGATGGCCTTGACTCTGCAATCCGTCATACTCGTAGAATTTTGGCGATATTGGAAGCATTGTTTTCGCTGAAACCATCAACCATTGGCACAACATGGGGTTACAGTTCAAATGAAATAGTTGCTGCTGCTATGGTTGCTGCCCACATTTCTGAATTATTTAGGCGATCAAAGGCTTGCATGAATGCTCTTTCTGTCTTAATGAGGTGTAAATTGGATAATGAAATTCACACCAGGGCTTCTTCACTGTATAATCTAATTGATATCCACAGAAAAGCAGTTGCATCTATTGCTAACAAGGCAGAACCACTAGAAGCACATTTATCCCAAACCCCGGTGTGGAGGGGTCCACCAATTGTTGCCAATCGAAGAAAGCAAAATGATTGTGCACGCACTGTTTGTATTCAATCAGAAGTGCCATTAACACTGTGTGAAGATTCTTCCCATTCGAAGAATTTACTAAATTGTGGAAAGGCTGTATACACAAATAATGATGCTGGAAATATTGCTGGTAAAAGGGTTGCGAACATCCAATTCGATGCCTCAGATTTGGCTCATTTCCTTACGATGGACAGACATATAGGTTTAAATTGCAGTGTACAGATTTTTCTGCAATCAGTACTTGAAGAGAAACAAGAGCTTTGTTTCTCAGTTGTTTCACTCCTTTGGCACAAGTTGATTGCATCACCTGAAACACAACCTAGTGCAGAAAGCACCTCTGCTCAGCAAGGATGGAGACAGGTTGTTGATGCACTATGCAATGTTGTATCTGCATCACCCACTAAAGCAGCAGCAGGGATAGTTCTTCAGGCGGAAAGGGAATTACAGCCTTGGATTGCCAAAGATGATGATCAAGGCCAGAAAATGTGGAGGATTAACCAGCGAATCGTAAAGCTGATGGTGGAACTGATGAGAAATTACGATGCTCCCGAATCATTGGTCATACTGGCTAGTGCTTCAGATCTTCTACTGCGTGCGACAGATGGAATGCTTGTTGATGGAGAAGCTTGCACTCTGCCTCAGTTGGAGCTGCTAGAAGCAACTGCCAGAGCAGTGCAATCAGTGCTGAAATGGGGAGAATCTGGGTTAGCTGTTGCAGATGGTCTCTCCAGTCTTCTAAAGTGCCGCCTACCAGCTACTATTACATGCCTCTCTCATCCAAGTGCACATGTCCGTGCTTTAAGTACCTCTGTTCTCCGAGACATACAGCAGAATGGATCAGTACAATTCAAATTTAAACAGGAAAATAGAAATGTCACCCACGAATCACCCTTCCAATACTTGCATTTAGGCATCATTGACTGGCACACAGATATTGAAAAGTGTCTGACATGGGAAGCTCATAGTCGATTAGCAAGAGGAAAGACTATCGAGTATCTTGATATGGCGGCTAAGGAGCTAGGGTGTGCAATAACCATCTGACATCCTTTTTGATACATTCCGAGTTCTTATGCTGTTATTGGTTTGTGAAAACTGAGGGAAATTAATCCTGACAGTCATCCAGGCAATTGATTCTTAGGAGGTGGATAGATAGTCTAGCACAGTGATGTAAAGCTCGTTTTATTTGTAGTTGAATGTGCTAGTAATAGTATATCCAAGTAAAATGATACATGTATTGTGAGACAGGGATTGGCATACTGAGTGACGACAAACAAGTAGATGATCATAGTCTCAAGTCTCAACCAATACCTGGTGAGTGACCACAGACAAGTAGATGATCATAGTCTCAAGTCTCAACCAATACCTGCATCGCCCCGGATACTATTTGGCTGTAGGTGACTTGTGGCCTTGGTATTAAGGAGAAAACTAGGAGCTACGGGGGATTATTTCCTTTAACTGTCTGAGCTGGATTACTTTGACTTGTTGCCTTGGATTGTTTCCTCGGTCCTCATGTCTCCCTTCATTGTAAAATATGGCCATGAAAAGAGAATCCAAGAACCATTTGTAGCAGGGTACAACTCGTACAAGTGGTAACTAGTTGTATAAATATAGTTAATATGGTGTTCTGACTATTTTTTCCCTCCTTTTTTTAGTAGTACATGAACATATTGTATGTAGTTGAATCAAGTACCAAGTACTGTGATGTCTCATATTTTATTTAAACATCCAATTTTATTTTAACACCAAATTCAAGCACTTAATTATATGATATGCAAAACTAGGTATTCCATGGC
Coding sequence (CDS)
ATGCCGTTACAGCCTTCTCCGCCACCACTGGCGAGAACCATAAATGCCGACGGATTGTCGGAAGAGTTCTTGCCGCCAACAACAGTATCATCGGATACTTTTTTTCATCTTGAACAGAATTTTTGGCCCCAACTAATAAGAGAAATGGGGACCGTAGATTATATGACACGTCCATCGGAAAGGTGGATTGATGGCCTTCAGTTCTCCTCATTGTTTTGGCCGCCGCCTCAAGAAACACAACAACGGAAGGCTCAAATCACAGCTTACGTCGAGTATTTTGGACAATTTATATCAGAACAATTCCCAGAAGACATTGCTGAGCTGATTCGGATTCGTTACCCATCTCATGAAAAACGTCTATTTGATGATGTCCTTGCCACTTTTGTCCTCCATCATCCAGAGCATGGGCATGCTGTTGTTCTTCCTATCATTTCTATTATTATTGATGGCACGCTAGTGTATGATAAGGATAAACCTCCATTTGCCTCTTTCATTTCCTTGTTCTGCCCAAATGACGAGAATGAGTATTCAGAGCAGTGGGCACTAGCTTGTGGGGAGATACTGCGAATTTTAACTCACTATAATCGCCCTATCTACAAAGTTGAGCGAAAGCAAAGTTCAGAAGATTGCAGCGGCAGTATGAATCATGCTACTTGCAGCAAAGCTTCTGGTGCCAAGCCATCTCATTCTTCTTCTGTGCAAAGTGAAAAAAGACCCTCAAGGCCTCTGTCACCTTGGATCACTGACATATTGCTCGCAGCACCTCTTGGTATTAAAAGCGATTACTTTAGATGGTGTGGTGGTGTAATGGGAAGGTACGCTGCTGGAGAACTCAAACCGCCTATGAGTGCTTCTTCTCGTGGAAATGGGAAGCATCCTCAGTTAATGCCATCAACTCCACGATGGGCTGTTGCAAATGGTGCTGGTGTTATATTAAGTGTTTGTGACGAGGAAGTCACCCGTTACGAGACTGCTAGTTTAACAGCAGTTGCAGTGCCTGCCCTTCTGCTTCCTCCCCCAACGACAGCTTTGGATGAGCATCTAGTTGCTGGTTTACCTGCCCTTGAACCGTATTATGCTATTGCTACACCAAGTGCCACTCAGAGATTGTTGCTTGGACTTCTTGAAGCACCGCCGTCATGGGCTCCGGATGCACTTGATGCTGCGGTACAACTTGTGGAACTTCTTCGGGCAGCTGAAGATTATGCAACTGGGATGAGATTGCCAAGAAATTGGATGCATCTACATTTTCTGAGAGCTATTGGCATCGCAATGTCTATTAGAGAAGGTATTGCTGCCGATGCTGCAGCAGCGTTATTGTTCCGCATACTTTCTCAACCTGCATTGCTTTTTCCACCATTGAGACAAGTTGAGGGTGTCGACGTCCAGCAGGAACCTTTGGGAGGTTATATATCATGCTACAGAAAGCAGATAGAAATGCCTGCTGCTGAAGCTACTATCGAAGCCACTGCCCAGGGAATTGCATCAATGCTTTGTGCCCATGGTCCAGAGGTGGAATGGAGAATTTGTACCATATGGGAAGCTGCTTATGGTTTGATACCGTTAAGTTCTTCAGCCGTCGATCTACCTGAAATAATAGTTGCTACTCCATTGCAGCCTCCCCTTTTATCATGGAACTTGTACATACCTCTCCTTAAAGTACTGGAGTACCTCCCACGAGGAAGCCCATCTGAAGCATGTCTTATGAAAATATTTGTGGCTACTGTTGAAGCAATTCTACGCAGAACATTTCCATTCGAGTCTTCTCCTGAAGAAAATCGAAGATCAAGATACAGTTCTGAAGCAGGCCCATCATCCAAAAACCTTGCTGTGGCTGAGCTCCGCACAATGGTTCATTCATTGTTCTTGGAATCATGTGTTTCTGCAGAGCTTGCTTCTCGATTGCTATTTGTGGTGCTAACTGTCTGTGTCAGTCATGAAGCTCAAACAAATGGTAAAAGTTCAAGGGCTGATGACAATATTGCTGATTGCACTACCAGAAACTCAGAATCTAGTCAATCTAGTCCCTCAAGATCAAATAATTCAAAGAGACAGAGAGAGCCTAGAAGTAAGAAATCCAAAAAACGAGGTCCTGTTGTGGCATTTGATTCATATGTGCTGGCTGCTGTTTGTGCTCTAGCCTGTGAACTTCAGCTGTTCCCTTTGATTGCAAATGTATCTAATCCTTCAAGCACTAAAGATTTAGTTGTAAATGGAACATCTCATGACTTCAAGGATGGCCTTGACTCTGCAATCCGTCATACTCGTAGAATTTTGGCGATATTGGAAGCATTGTTTTCGCTGAAACCATCAACCATTGGCACAACATGGGGTTACAGTTCAAATGAAATAGTTGCTGCTGCTATGGTTGCTGCCCACATTTCTGAATTATTTAGGCGATCAAAGGCTTGCATGAATGCTCTTTCTGTCTTAATGAGGTGTAAATTGGATAATGAAATTCACACCAGGGCTTCTTCACTGTATAATCTAATTGATATCCACAGAAAAGCAGTTGCATCTATTGCTAACAAGGCAGAACCACTAGAAGCACATTTATCCCAAACCCCGGTGTGGAGGGGTCCACCAATTGTTGCCAATCGAAGAAAGCAAAATGATTGTGCACGCACTGTTTGTATTCAATCAGAAGTGCCATTAACACTGTGTGAAGATTCTTCCCATTCGAAGAATTTACTAAATTGTGGAAAGGCTGTATACACAAATAATGATGCTGGAAATATTGCTGGTAAAAGGGTTGCGAACATCCAATTCGATGCCTCAGATTTGGCTCATTTCCTTACGATGGACAGACATATAGGTTTAAATTGCAGTGTACAGATTTTTCTGCAATCAGTACTTGAAGAGAAACAAGAGCTTTGTTTCTCAGTTGTTTCACTCCTTTGGCACAAGTTGATTGCATCACCTGAAACACAACCTAGTGCAGAAAGCACCTCTGCTCAGCAAGGATGGAGACAGGTTGTTGATGCACTATGCAATGTTGTATCTGCATCACCCACTAAAGCAGCAGCAGGGATAGTTCTTCAGGCGGAAAGGGAATTACAGCCTTGGATTGCCAAAGATGATGATCAAGGCCAGAAAATGTGGAGGATTAACCAGCGAATCGTAAAGCTGATGGTGGAACTGATGAGAAATTACGATGCTCCCGAATCATTGGTCATACTGGCTAGTGCTTCAGATCTTCTACTGCGTGCGACAGATGGAATGCTTGTTGATGGAGAAGCTTGCACTCTGCCTCAGTTGGAGCTGCTAGAAGCAACTGCCAGAGCAGTGCAATCAGTGCTGAAATGGGGAGAATCTGGGTTAGCTGTTGCAGATGGTCTCTCCAGTCTTCTAAAGTGCCGCCTACCAGCTACTATTACATGCCTCTCTCATCCAAGTGCACATGTCCGTGCTTTAAGTACCTCTGTTCTCCGAGACATACAGCAGAATGGATCAGTACAATTCAAATTTAAACAGGAAAATAGAAATGTCACCCACGAATCACCCTTCCAATACTTGCATTTAGGCATCATTGACTGGCACACAGATATTGAAAAGTGTCTGACATGGGAAGCTCATAGTCGATTAGCAAGAGGAAAGACTATCGAGTATCTTGATATGGCGGCTAAGGAGCTAGGGTGTGCAATAACCATCTGA
Protein sequence
MPLQPSPPPLARTINADGLSEEFLPPTTVSSDTFFHLEQNFWPQLIREMGTVDYMTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSSVQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLPALEPYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSYVLAAVCALACELQLFPLIANVSNPSSTKDLVVNGTSHDFKDGLDSAIRHTRRILAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQSEVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVVSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI
Homology
BLAST of Spo04360.1 vs. NCBI nr
Match:
gi|902233471|gb|KNA23068.1| (hypothetical protein SOVF_027320 [Spinacia oleracea])
HSP 1 Score: 2291.2 bits (5936), Expect = 0.000e+0
Identity = 1168/1176 (99.32%), Postives = 1168/1176 (99.32%), Query Frame = 1
Query: 49 MGTVDYMTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAEL 108
MGTVDYMTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAEL
Sbjct: 1 MGTVDYMTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAEL 60
Query: 109 IRIRYPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLF 168
IRIRYPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLF
Sbjct: 61 IRIRYPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLF 120
Query: 169 CPNDENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS 228
CPNDENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS
Sbjct: 121 CPNDENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS 180
Query: 229 HSSSVQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRG 288
HSSSVQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRG
Sbjct: 181 HSSSVQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRG 240
Query: 289 NGKHPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHL 348
NGKHPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHL
Sbjct: 241 NGKHPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHL 300
Query: 349 VAGLPALEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAED 408
VAGLPALEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAED
Sbjct: 301 VAGLPALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAED 360
Query: 409 YATGMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVD 468
YATGMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVD
Sbjct: 361 YATGMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVD 420
Query: 469 VQQEPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIP 528
VQQEPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIP
Sbjct: 421 VQQEPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIP 480
Query: 529 LSSSAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILR 588
LSSSAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILR
Sbjct: 481 LSSSAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILR 540
Query: 589 RTFPFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLT 648
RTFPFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLT
Sbjct: 541 RTFPFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLT 600
Query: 649 VCVSHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVA 708
VCVSHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVA
Sbjct: 601 VCVSHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVA 660
Query: 709 FDSYVLAAVCALACELQLFPLIANVSNPSSTKDLVVNGTSHDFKDGLDSAIRHTRRILAI 768
FDSYVLAAVCALACELQLFPLIANVSNPSSTKDLVVNGTSHDFKDGLDSAIRHTRRILAI
Sbjct: 661 FDSYVLAAVCALACELQLFPLIANVSNPSSTKDLVVNGTSHDFKDGLDSAIRHTRRILAI 720
Query: 769 LEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIH 828
LEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIH
Sbjct: 721 LEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIH 780
Query: 829 TRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQS 888
TRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQS
Sbjct: 781 TRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQS 840
Query: 889 EVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGLN 948
EVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGLN
Sbjct: 841 EVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGLN 900
Query: 949 CSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVVS 1008
CSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVVS
Sbjct: 901 CSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVVS 960
Query: 1009 ASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVILA 1068
ASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVILA
Sbjct: 961 ASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVILA 1020
Query: 1069 SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCRL 1128
SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCRL
Sbjct: 1021 SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCRL 1080
Query: 1129 PATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDWHT 1188
PATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDWHT
Sbjct: 1081 PATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDWHT 1140
Query: 1189 DIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
DIEKCLTWEAHSRLARGKTIEYLDMAAKELGC ITI
Sbjct: 1141 DIEKCLTWEAHSRLARGKTIEYLDMAAKELGCEITI 1176
BLAST of Spo04360.1 vs. NCBI nr
Match:
gi|731313653|ref|XP_010681263.1| (PREDICTED: protein GIGANTEA [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 2053.9 bits (5320), Expect = 0.000e+0
Identity = 1051/1178 (89.22%), Postives = 1098/1178 (93.21%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M RPSERWIDGLQFSSLFWPPPQETQQRKAQ TAYVEYFGQFISEQFPED+AELIR RYP
Sbjct: 1 MERPSERWIDGLQFSSLFWPPPQETQQRKAQTTAYVEYFGQFISEQFPEDLAELIRSRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
EKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDK++PPFASFISLFCPNDEN
Sbjct: 61 FDEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKERPPFASFISLFCPNDEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSSVQ 234
EYSEQWALACGEILRILTHYNRPIYK ER++S E+C+ ++ATCS AS AK SHSSSVQ
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKHERQKSPENCTSCKDYATCSNASDAKSSHSSSVQ 180
Query: 235 SEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHPQ 294
SE+R SRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPM ASSRGNGKHPQ
Sbjct: 181 SERRTSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMIASSRGNGKHPQ 240
Query: 295 LMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLPA 354
LMPSTPRWAVANGAGVILSVCDEEV RYETASLTAVAVPALLLPPPTT+LDEHLVAGLPA
Sbjct: 241 LMPSTPRWAVANGAGVILSVCDEEVARYETASLTAVAVPALLLPPPTTSLDEHLVAGLPA 300
Query: 355 LEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR 414
LEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR
Sbjct: 301 LEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR 360
Query: 415 LPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEPL 474
LPRNWMHLHFLRAIGIAMS+REGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQ EPL
Sbjct: 361 LPRNWMHLHFLRAIGIAMSMREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQHEPL 420
Query: 475 GGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 534
GGYISCYRKQIEMP+AEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV
Sbjct: 421 GGYISCYRKQIEMPSAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 480
Query: 535 DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPFE 594
DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVE++L+RTFP E
Sbjct: 481 DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVESVLQRTFPLE 540
Query: 595 SSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSHE 654
SS EENRRSRYSSEAG +SKNLAVAELRTMVHSLFLESC S ELASRLLFVVLTVCVSHE
Sbjct: 541 SSMEENRRSRYSSEAGAASKNLAVAELRTMVHSLFLESCASEELASRLLFVVLTVCVSHE 600
Query: 655 AQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSYVL 714
A TNGK SR +D+ +D ++S+S+QSSPS S SK+Q+E RSKK KK+GPVVAFDSYVL
Sbjct: 601 AHTNGKRSRVEDSYSDRAAKHSKSTQSSPSSSTKSKKQKESRSKKPKKQGPVVAFDSYVL 660
Query: 715 AAVCALACELQLFPLIANVSNPSSTKDLV-------VNGTSHDFKDGLDSAIRHTRRILA 774
AAVCALACELQLFPLIA +SNPSS+KD V +NG+SH+FKDG+DSAIRHTRRILA
Sbjct: 661 AAVCALACELQLFPLIAGISNPSSSKDSVEIAKPVKLNGSSHEFKDGIDSAIRHTRRILA 720
Query: 775 ILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEI 834
ILEALFSLKPSTIGT+WGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCK DNEI
Sbjct: 721 ILEALFSLKPSTIGTSWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKWDNEI 780
Query: 835 HTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQ 894
H+RASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWR IV N RK ND A TVC
Sbjct: 781 HSRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRDSSIVTNGRKHNDFAGTVCFL 840
Query: 895 SEVPLTL-CEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIG 954
EVP TL CED +HSKN LNCGKAV+TNND GN AGK VA+ QFDASDLA FLTMDRHIG
Sbjct: 841 PEVPSTLTCEDPAHSKNSLNCGKAVHTNNDTGNTAGKSVASFQFDASDLAQFLTMDRHIG 900
Query: 955 LNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNV 1014
NCSVQIFLQSVLEEKQELCFSVVSLLW KLIASPETQPSAESTSAQQGWRQVVDALCNV
Sbjct: 901 FNCSVQIFLQSVLEEKQELCFSVVSLLWQKLIASPETQPSAESTSAQQGWRQVVDALCNV 960
Query: 1015 VSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVI 1074
VSASP KAAAGIVLQAERELQPWIAKDDDQGQKMW+INQRIVKLMVELMRNYD PESLVI
Sbjct: 961 VSASPAKAAAGIVLQAERELQPWIAKDDDQGQKMWKINQRIVKLMVELMRNYDTPESLVI 1020
Query: 1075 LASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKC 1134
LASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VLKWGESGLAVADGLS+LLKC
Sbjct: 1021 LASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQPVLKWGESGLAVADGLSNLLKC 1080
Query: 1135 RLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDW 1194
RLPATITCLSHPSAHVRALSTSVLRDIQQNGS++F FKQE+RN TH++ F+YLH+GIIDW
Sbjct: 1081 RLPATITCLSHPSAHVRALSTSVLRDIQQNGSIKFSFKQESRNGTHKTAFEYLHIGIIDW 1140
Query: 1195 HTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
HTDIEKCLTWEAHSRL RGKTIEYLDMAAKELGCAITI
Sbjct: 1141 HTDIEKCLTWEAHSRLTRGKTIEYLDMAAKELGCAITI 1178
BLAST of Spo04360.1 vs. NCBI nr
Match:
gi|225470820|ref|XP_002264755.1| (PREDICTED: protein GIGANTEA [Vitis vinifera])
HSP 1 Score: 1765.7 bits (4572), Expect = 0.000e+0
Identity = 919/1180 (77.88%), Postives = 1012/1180 (85.76%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M ERWIDGLQFSSLFWPPPQ+ QQRKAQITAYV+YFGQF SEQFPEDIAELIR RYP
Sbjct: 1 MASSCERWIDGLQFSSLFWPPPQDVQQRKAQITAYVDYFGQFTSEQFPEDIAELIRSRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
S E+RLFDDVLATFVLHHPEHGHAVVLPIIS IIDGTLVYD+ PPFASFISL CP+ EN
Sbjct: 61 SKEQRLFDDVLATFVLHHPEHGHAVVLPIISCIIDGTLVYDRCTPPFASFISLVCPSSEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSSVQ 234
EYSEQWALACGEILRILTHYNRPIYKVE + S D S S HAT S + K S +Q
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKVEHQSSEADRSSSGRHATTSDSVDGKSSQGPLLQ 180
Query: 235 SEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHPQ 294
+E++PSRPLSPWITDILLAAPLGI+SDYFRWCGGVMG+YAAGELKPP +AS+RG+GKHPQ
Sbjct: 181 NERKPSRPLSPWITDILLAAPLGIRSDYFRWCGGVMGKYAAGELKPPSTASTRGSGKHPQ 240
Query: 295 LMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLPA 354
L+PSTPRWAVANGAGVILSVCDEEV RYETA+LTA AVPALLLPPPTTALDEHLVAGLPA
Sbjct: 241 LIPSTPRWAVANGAGVILSVCDEEVARYETATLTAAAVPALLLPPPTTALDEHLVAGLPA 300
Query: 355 LEPY-------YAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR 414
LEPY YAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYA+GMR
Sbjct: 301 LEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYASGMR 360
Query: 415 LPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEPL 474
LPRNWMHLHFLRAIG AMS+R GIAADAAAALLFR+LSQPALLFPPLRQVEG + Q EPL
Sbjct: 361 LPRNWMHLHFLRAIGTAMSMRAGIAADAAAALLFRVLSQPALLFPPLRQVEGFEFQHEPL 420
Query: 475 GGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 534
GYIS Y+KQIE+PA EATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV
Sbjct: 421 DGYISSYKKQIEVPATEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 480
Query: 535 DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPFE 594
DLPEIIVATPLQPP+LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVE+IL+RTFP E
Sbjct: 481 DLPEIIVATPLQPPILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVESILQRTFPAE 540
Query: 595 SSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSHE 654
SS E R++RY G +SKNLAVAELRTMVH+LFLESC S ELASRLLFVVLTVCVSHE
Sbjct: 541 SSRENIRKTRYLFGIGSASKNLAVAELRTMVHALFLESCASVELASRLLFVVLTVCVSHE 600
Query: 655 A-QTNG-KSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSY 714
A Q NG K R +D+ S + + S+ S QR+ +++K KK+GPV AFDSY
Sbjct: 601 AAQQNGSKRPRGEDSHL--------SEEITEDLSDASGNQRDTKTRKMKKQGPVAAFDSY 660
Query: 715 VLAAVCALACELQLFPLIANVSNPSSTKDLVV-------NGTSHDFKDGLDSAIRHTRRI 774
VLAAVCALACELQLFPLIA +N S++KD+ + NG+S +F++ +DSAIRHT RI
Sbjct: 661 VLAAVCALACELQLFPLIARGTNHSASKDVQIRAKPAKLNGSSSEFRNSIDSAIRHTHRI 720
Query: 775 LAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDN 834
LAILEALFSLKPS++GT+W YSSNEIVAAAMVAAH+SELFRRSKACM+ALSVLMRCK D
Sbjct: 721 LAILEALFSLKPSSVGTSWSYSSNEIVAAAMVAAHVSELFRRSKACMHALSVLMRCKWDE 780
Query: 835 EIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVC 894
EI+TRASSLYNLIDIH KAVASI NKAEPLEAHL VW+ P + K++DCA T C
Sbjct: 781 EIYTRASSLYNLIDIHSKAVASIVNKAEPLEAHLIHATVWKDSPGHKDGSKEDDCASTSC 840
Query: 895 IQSEVPLTL-CEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRH 954
+S PL L EDS++SK+L KA + N GN GK +A+ DAS+LA+FLTMDRH
Sbjct: 841 FKSVNPLLLHSEDSAYSKSLPQFEKAPHLNEGTGNSLGKGIASFPLDASELANFLTMDRH 900
Query: 955 IGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALC 1014
IG +CS Q+ L+SVL EKQELCFSVVSLLWHKLIA+PET+PSAESTSAQQGWRQVVDALC
Sbjct: 901 IGFSCSAQVLLRSVLAEKQELCFSVVSLLWHKLIAAPETKPSAESTSAQQGWRQVVDALC 960
Query: 1015 NVVSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESL 1074
NVVSASP KAA +VLQAERELQPWIAKDDD GQKMWRINQRIVKL+VELMRN+D PESL
Sbjct: 961 NVVSASPAKAATAVVLQAERELQPWIAKDDDLGQKMWRINQRIVKLIVELMRNHDRPESL 1020
Query: 1075 VILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLL 1134
VIL+SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VL+WGESGLAVADGLS+LL
Sbjct: 1021 VILSSASDLLLRATDGMLVDGEACTLPQLELLEATARAVQLVLEWGESGLAVADGLSNLL 1080
Query: 1135 KCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGII 1194
KCR+PATI CLSHPSAHVRALSTSVLRD+ Q+GS++ KQ RN H +QY++LGII
Sbjct: 1081 KCRVPATIRCLSHPSAHVRALSTSVLRDVLQSGSIKPHIKQGGRNGIHS--YQYVNLGII 1140
Query: 1195 DWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
DW DIEKCLTWEAHSRLA G T ++LD+AAKELGC I+I
Sbjct: 1141 DWQADIEKCLTWEAHSRLATGMTNQFLDVAAKELGCTISI 1170
BLAST of Spo04360.1 vs. NCBI nr
Match:
gi|590601187|ref|XP_007019601.1| (Gigantea protein isoform 1 [Theobroma cacao])
HSP 1 Score: 1765.7 bits (4572), Expect = 0.000e+0
Identity = 919/1180 (77.88%), Postives = 1004/1180 (85.08%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M PSERWIDGLQFSSLFWPPPQ+ QQRK QITAYVEYFGQF SEQFPEDIAEL+R RYP
Sbjct: 1 MASPSERWIDGLQFSSLFWPPPQDPQQRKVQITAYVEYFGQFTSEQFPEDIAELVRNRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
E+RLFDDVLA FVLHHPEHGHAVVLPIIS IIDGTLVYDK PPFASFISL CP+ EN
Sbjct: 61 HKEQRLFDDVLAMFVLHHPEHGHAVVLPIISCIIDGTLVYDKSTPPFASFISLVCPSSEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS-HSSSV 234
EYSEQWALACGEILRILTHYNRPIYK+E++ S D S S AT S+ +PS H +
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKMEQQNSETDRSNSSGQATTSEPVDGEPSFHIPLM 180
Query: 235 QSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHP 294
Q E++P RPLSPWITDILLAAPLGI+SDYFRWC GVMG+YAAG+LKPP +ASSRG+GKHP
Sbjct: 181 QQERKPLRPLSPWITDILLAAPLGIRSDYFRWCSGVMGKYAAGDLKPPSTASSRGSGKHP 240
Query: 295 QLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLP 354
QLMPSTPRWAVANGAGVILSVCDEEV RYETA+LTA AVPALLLPPPTTALDEHLVAGLP
Sbjct: 241 QLMPSTPRWAVANGAGVILSVCDEEVARYETATLTAAAVPALLLPPPTTALDEHLVAGLP 300
Query: 355 ALEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGM 414
ALEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATG+
Sbjct: 301 ALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGI 360
Query: 415 RLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEP 474
RLPRNWMHLHFLRAIG AMS+R GIAADAAAALLFRILSQPALLFPPLRQVEGV+VQ EP
Sbjct: 361 RLPRNWMHLHFLRAIGTAMSMRAGIAADAAAALLFRILSQPALLFPPLRQVEGVEVQHEP 420
Query: 475 LGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 534
GGYISCYRKQIE+PAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA
Sbjct: 421 SGGYISCYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 480
Query: 535 VDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPF 594
VDLPEIIVATPLQP +LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAIL+RTFP
Sbjct: 481 VDLPEIIVATPLQPAILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILQRTFPP 540
Query: 595 ESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSH 654
ESS + R++RYS G +SKNLAVAELRTMVHSLFLESC S ELASRLLFVVLTVCVSH
Sbjct: 541 ESSRVQTRKTRYS--IGSASKNLAVAELRTMVHSLFLESCASVELASRLLFVVLTVCVSH 600
Query: 655 EAQTNG-KSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSY 714
EAQ +G K R +++ E SQS S+R R+ + +K+KK+GPV AFDSY
Sbjct: 601 EAQFSGSKRPRCEESYP--PDEGIEESQSP------SERPRDIKPRKTKKQGPVAAFDSY 660
Query: 715 VLAAVCALACELQLFPLIANVSNPSSTKDL-------VVNGTSHDFKDGLDSAIRHTRRI 774
VLAAVCALACELQLFPL+ SN S+ KD+ +NG+S ++ +DSAI HT RI
Sbjct: 661 VLAAVCALACELQLFPLVTRGSNHSTAKDVQAIAKPAKLNGSSIEYGHSIDSAIHHTHRI 720
Query: 775 LAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDN 834
LAILEALFSLKPS++GT+W YSSNEIVAAAMVAAH+SELFRRSKACM+ALSVLMRCK DN
Sbjct: 721 LAILEALFSLKPSSVGTSWSYSSNEIVAAAMVAAHVSELFRRSKACMHALSVLMRCKWDN 780
Query: 835 EIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVC 894
EI+TRASSLYNLIDIH KAVASI NKAEPLEA L PVW+ P+ + RKQN T C
Sbjct: 781 EIYTRASSLYNLIDIHSKAVASIVNKAEPLEAQLIHAPVWKDSPVCLDGRKQNKRTNTTC 840
Query: 895 IQ-SEVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRH 954
+ + CEDS+HS L C + + ++ +GN GK +A+ DASDLA+FLTMDRH
Sbjct: 841 FDPGQSSASECEDSTHSDKNLRCERVLASDEGSGNSLGKGIASFPLDASDLANFLTMDRH 900
Query: 955 IGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALC 1014
IG NCS QI L+SVL EKQELCFSVVSLLWHKLIA+PETQPSAESTSAQQGWRQVVDALC
Sbjct: 901 IGFNCSAQILLRSVLVEKQELCFSVVSLLWHKLIAAPETQPSAESTSAQQGWRQVVDALC 960
Query: 1015 NVVSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESL 1074
NVVSASPTKAA +VLQAERE QPWI KDDDQGQKMWRINQRIVKL+VELMRN+D+PESL
Sbjct: 961 NVVSASPTKAATAVVLQAEREFQPWITKDDDQGQKMWRINQRIVKLIVELMRNHDSPESL 1020
Query: 1075 VILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLL 1134
VI+ASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VL+WGESGLAVADGLS+LL
Sbjct: 1021 VIVASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQPVLEWGESGLAVADGLSNLL 1080
Query: 1135 KCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGII 1194
KCRLPAT CLSHPSAHVRALSTSVLR+I GS++ KQ N H +QY +G+I
Sbjct: 1081 KCRLPATTRCLSHPSAHVRALSTSVLRNILHAGSIKPNSKQVEINGIHGPSYQYFSVGVI 1140
Query: 1195 DWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
DWHTDIEKCLTWEAHS+LARG I +LD AAKELGC+I+I
Sbjct: 1141 DWHTDIEKCLTWEAHSQLARGMPIRFLDTAAKELGCSISI 1170
BLAST of Spo04360.1 vs. NCBI nr
Match:
gi|590601204|ref|XP_007019603.1| (Gigantea protein isoform 3 [Theobroma cacao])
HSP 1 Score: 1761.1 bits (4560), Expect = 0.000e+0
Identity = 919/1181 (77.82%), Postives = 1004/1181 (85.01%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M PSERWIDGLQFSSLFWPPPQ+ QQRK QITAYVEYFGQF SEQFPEDIAEL+R RYP
Sbjct: 1 MASPSERWIDGLQFSSLFWPPPQDPQQRKVQITAYVEYFGQFTSEQFPEDIAELVRNRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
E+RLFDDVLA FVLHHPEHGHAVVLPIIS IIDGTLVYDK PPFASFISL CP+ EN
Sbjct: 61 HKEQRLFDDVLAMFVLHHPEHGHAVVLPIISCIIDGTLVYDKSTPPFASFISLVCPSSEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS-HSSSV 234
EYSEQWALACGEILRILTHYNRPIYK+E++ S D S S AT S+ +PS H +
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKMEQQNSETDRSNSSGQATTSEPVDGEPSFHIPLM 180
Query: 235 QSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHP 294
Q E++P RPLSPWITDILLAAPLGI+SDYFRWC GVMG+YAAG+LKPP +ASSRG+GKHP
Sbjct: 181 QQERKPLRPLSPWITDILLAAPLGIRSDYFRWCSGVMGKYAAGDLKPPSTASSRGSGKHP 240
Query: 295 QLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLP 354
QLMPSTPRWAVANGAGVILSVCDEEV RYETA+LTA AVPALLLPPPTTALDEHLVAGLP
Sbjct: 241 QLMPSTPRWAVANGAGVILSVCDEEVARYETATLTAAAVPALLLPPPTTALDEHLVAGLP 300
Query: 355 ALEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGM 414
ALEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATG+
Sbjct: 301 ALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGI 360
Query: 415 RLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEP 474
RLPRNWMHLHFLRAIG AMS+R GIAADAAAALLFRILSQPALLFPPLRQVEGV+VQ EP
Sbjct: 361 RLPRNWMHLHFLRAIGTAMSMRAGIAADAAAALLFRILSQPALLFPPLRQVEGVEVQHEP 420
Query: 475 LGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 534
GGYISCYRKQIE+PAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA
Sbjct: 421 SGGYISCYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 480
Query: 535 VDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPF 594
VDLPEIIVATPLQP +LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAIL+RTFP
Sbjct: 481 VDLPEIIVATPLQPAILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILQRTFPP 540
Query: 595 ESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSH 654
ESS + R++RYS G +SKNLAVAELRTMVHSLFLESC S ELASRLLFVVLTVCVSH
Sbjct: 541 ESSRVQTRKTRYS--IGSASKNLAVAELRTMVHSLFLESCASVELASRLLFVVLTVCVSH 600
Query: 655 EAQTNG-KSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSY 714
EAQ +G K R +++ E SQS S+R R+ + +K+KK+GPV AFDSY
Sbjct: 601 EAQFSGSKRPRCEESYP--PDEGIEESQSP------SERPRDIKPRKTKKQGPVAAFDSY 660
Query: 715 VLAAVCALACELQLFPLIANVSNPSSTKDL-------VVNGTSHDFKDGLDSAIRHTRRI 774
VLAAVCALACELQLFPL+ SN S+ KD+ +NG+S ++ +DSAI HT RI
Sbjct: 661 VLAAVCALACELQLFPLVTRGSNHSTAKDVQAIAKPAKLNGSSIEYGHSIDSAIHHTHRI 720
Query: 775 LAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDN 834
LAILEALFSLKPS++GT+W YSSNEIVAAAMVAAH+SELFRRSKACM+ALSVLMRCK DN
Sbjct: 721 LAILEALFSLKPSSVGTSWSYSSNEIVAAAMVAAHVSELFRRSKACMHALSVLMRCKWDN 780
Query: 835 EIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVC 894
EI+TRASSLYNLIDIH KAVASI NKAEPLEA L PVW+ P+ + RKQN T C
Sbjct: 781 EIYTRASSLYNLIDIHSKAVASIVNKAEPLEAQLIHAPVWKDSPVCLDGRKQNKRTNTTC 840
Query: 895 IQ-SEVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRH 954
+ + CEDS+HS L C + + ++ +GN GK +A+ DASDLA+FLTMDRH
Sbjct: 841 FDPGQSSASECEDSTHSDKNLRCERVLASDEGSGNSLGKGIASFPLDASDLANFLTMDRH 900
Query: 955 IGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALC 1014
IG NCS QI L+SVL EKQELCFSVVSLLWHKLIA+PETQPSAESTSAQQGWRQVVDALC
Sbjct: 901 IGFNCSAQILLRSVLVEKQELCFSVVSLLWHKLIAAPETQPSAESTSAQQGWRQVVDALC 960
Query: 1015 NVVSASPTKAAAGIVL-QAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPES 1074
NVVSASPTKAA +VL QAERE QPWI KDDDQGQKMWRINQRIVKL+VELMRN+D+PES
Sbjct: 961 NVVSASPTKAATAVVLQQAEREFQPWITKDDDQGQKMWRINQRIVKLIVELMRNHDSPES 1020
Query: 1075 LVILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSL 1134
LVI+ASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VL+WGESGLAVADGLS+L
Sbjct: 1021 LVIVASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQPVLEWGESGLAVADGLSNL 1080
Query: 1135 LKCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGI 1194
LKCRLPAT CLSHPSAHVRALSTSVLR+I GS++ KQ N H +QY +G+
Sbjct: 1081 LKCRLPATTRCLSHPSAHVRALSTSVLRNILHAGSIKPNSKQVEINGIHGPSYQYFSVGV 1140
Query: 1195 IDWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
IDWHTDIEKCLTWEAHS+LARG I +LD AAKELGC+I+I
Sbjct: 1141 IDWHTDIEKCLTWEAHSQLARGMPIRFLDTAAKELGCSISI 1171
BLAST of Spo04360.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9RUB0_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_027320 PE=4 SV=1)
HSP 1 Score: 2291.2 bits (5936), Expect = 0.000e+0
Identity = 1168/1176 (99.32%), Postives = 1168/1176 (99.32%), Query Frame = 1
Query: 49 MGTVDYMTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAEL 108
MGTVDYMTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAEL
Sbjct: 1 MGTVDYMTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAEL 60
Query: 109 IRIRYPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLF 168
IRIRYPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLF
Sbjct: 61 IRIRYPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLF 120
Query: 169 CPNDENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS 228
CPNDENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS
Sbjct: 121 CPNDENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS 180
Query: 229 HSSSVQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRG 288
HSSSVQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRG
Sbjct: 181 HSSSVQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRG 240
Query: 289 NGKHPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHL 348
NGKHPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHL
Sbjct: 241 NGKHPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHL 300
Query: 349 VAGLPALEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAED 408
VAGLPALEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAED
Sbjct: 301 VAGLPALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAED 360
Query: 409 YATGMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVD 468
YATGMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVD
Sbjct: 361 YATGMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVD 420
Query: 469 VQQEPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIP 528
VQQEPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIP
Sbjct: 421 VQQEPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIP 480
Query: 529 LSSSAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILR 588
LSSSAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILR
Sbjct: 481 LSSSAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILR 540
Query: 589 RTFPFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLT 648
RTFPFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLT
Sbjct: 541 RTFPFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLT 600
Query: 649 VCVSHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVA 708
VCVSHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVA
Sbjct: 601 VCVSHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVA 660
Query: 709 FDSYVLAAVCALACELQLFPLIANVSNPSSTKDLVVNGTSHDFKDGLDSAIRHTRRILAI 768
FDSYVLAAVCALACELQLFPLIANVSNPSSTKDLVVNGTSHDFKDGLDSAIRHTRRILAI
Sbjct: 661 FDSYVLAAVCALACELQLFPLIANVSNPSSTKDLVVNGTSHDFKDGLDSAIRHTRRILAI 720
Query: 769 LEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIH 828
LEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIH
Sbjct: 721 LEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIH 780
Query: 829 TRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQS 888
TRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQS
Sbjct: 781 TRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQS 840
Query: 889 EVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGLN 948
EVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGLN
Sbjct: 841 EVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGLN 900
Query: 949 CSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVVS 1008
CSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVVS
Sbjct: 901 CSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVVS 960
Query: 1009 ASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVILA 1068
ASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVILA
Sbjct: 961 ASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVILA 1020
Query: 1069 SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCRL 1128
SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCRL
Sbjct: 1021 SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCRL 1080
Query: 1129 PATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDWHT 1188
PATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDWHT
Sbjct: 1081 PATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDWHT 1140
Query: 1189 DIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
DIEKCLTWEAHSRLARGKTIEYLDMAAKELGC ITI
Sbjct: 1141 DIEKCLTWEAHSRLARGKTIEYLDMAAKELGCEITI 1176
BLAST of Spo04360.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8D5Y6_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g012190 PE=4 SV=1)
HSP 1 Score: 2053.9 bits (5320), Expect = 0.000e+0
Identity = 1051/1178 (89.22%), Postives = 1098/1178 (93.21%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M RPSERWIDGLQFSSLFWPPPQETQQRKAQ TAYVEYFGQFISEQFPED+AELIR RYP
Sbjct: 1 MERPSERWIDGLQFSSLFWPPPQETQQRKAQTTAYVEYFGQFISEQFPEDLAELIRSRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
EKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDK++PPFASFISLFCPNDEN
Sbjct: 61 FDEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKERPPFASFISLFCPNDEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSSVQ 234
EYSEQWALACGEILRILTHYNRPIYK ER++S E+C+ ++ATCS AS AK SHSSSVQ
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKHERQKSPENCTSCKDYATCSNASDAKSSHSSSVQ 180
Query: 235 SEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHPQ 294
SE+R SRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPM ASSRGNGKHPQ
Sbjct: 181 SERRTSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMIASSRGNGKHPQ 240
Query: 295 LMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLPA 354
LMPSTPRWAVANGAGVILSVCDEEV RYETASLTAVAVPALLLPPPTT+LDEHLVAGLPA
Sbjct: 241 LMPSTPRWAVANGAGVILSVCDEEVARYETASLTAVAVPALLLPPPTTSLDEHLVAGLPA 300
Query: 355 LEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR 414
LEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR
Sbjct: 301 LEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR 360
Query: 415 LPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEPL 474
LPRNWMHLHFLRAIGIAMS+REGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQ EPL
Sbjct: 361 LPRNWMHLHFLRAIGIAMSMREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQHEPL 420
Query: 475 GGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 534
GGYISCYRKQIEMP+AEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV
Sbjct: 421 GGYISCYRKQIEMPSAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 480
Query: 535 DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPFE 594
DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVE++L+RTFP E
Sbjct: 481 DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVESVLQRTFPLE 540
Query: 595 SSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSHE 654
SS EENRRSRYSSEAG +SKNLAVAELRTMVHSLFLESC S ELASRLLFVVLTVCVSHE
Sbjct: 541 SSMEENRRSRYSSEAGAASKNLAVAELRTMVHSLFLESCASEELASRLLFVVLTVCVSHE 600
Query: 655 AQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSYVL 714
A TNGK SR +D+ +D ++S+S+QSSPS S SK+Q+E RSKK KK+GPVVAFDSYVL
Sbjct: 601 AHTNGKRSRVEDSYSDRAAKHSKSTQSSPSSSTKSKKQKESRSKKPKKQGPVVAFDSYVL 660
Query: 715 AAVCALACELQLFPLIANVSNPSSTKDLV-------VNGTSHDFKDGLDSAIRHTRRILA 774
AAVCALACELQLFPLIA +SNPSS+KD V +NG+SH+FKDG+DSAIRHTRRILA
Sbjct: 661 AAVCALACELQLFPLIAGISNPSSSKDSVEIAKPVKLNGSSHEFKDGIDSAIRHTRRILA 720
Query: 775 ILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEI 834
ILEALFSLKPSTIGT+WGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCK DNEI
Sbjct: 721 ILEALFSLKPSTIGTSWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKWDNEI 780
Query: 835 HTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQ 894
H+RASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWR IV N RK ND A TVC
Sbjct: 781 HSRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRDSSIVTNGRKHNDFAGTVCFL 840
Query: 895 SEVPLTL-CEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIG 954
EVP TL CED +HSKN LNCGKAV+TNND GN AGK VA+ QFDASDLA FLTMDRHIG
Sbjct: 841 PEVPSTLTCEDPAHSKNSLNCGKAVHTNNDTGNTAGKSVASFQFDASDLAQFLTMDRHIG 900
Query: 955 LNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNV 1014
NCSVQIFLQSVLEEKQELCFSVVSLLW KLIASPETQPSAESTSAQQGWRQVVDALCNV
Sbjct: 901 FNCSVQIFLQSVLEEKQELCFSVVSLLWQKLIASPETQPSAESTSAQQGWRQVVDALCNV 960
Query: 1015 VSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVI 1074
VSASP KAAAGIVLQAERELQPWIAKDDDQGQKMW+INQRIVKLMVELMRNYD PESLVI
Sbjct: 961 VSASPAKAAAGIVLQAERELQPWIAKDDDQGQKMWKINQRIVKLMVELMRNYDTPESLVI 1020
Query: 1075 LASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKC 1134
LASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VLKWGESGLAVADGLS+LLKC
Sbjct: 1021 LASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQPVLKWGESGLAVADGLSNLLKC 1080
Query: 1135 RLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGIIDW 1194
RLPATITCLSHPSAHVRALSTSVLRDIQQNGS++F FKQE+RN TH++ F+YLH+GIIDW
Sbjct: 1081 RLPATITCLSHPSAHVRALSTSVLRDIQQNGSIKFSFKQESRNGTHKTAFEYLHIGIIDW 1140
Query: 1195 HTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
HTDIEKCLTWEAHSRL RGKTIEYLDMAAKELGCAITI
Sbjct: 1141 HTDIEKCLTWEAHSRLTRGKTIEYLDMAAKELGCAITI 1178
BLAST of Spo04360.1 vs. UniProtKB/TrEMBL
Match:
F6H6Q8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0157g00020 PE=4 SV=1)
HSP 1 Score: 1765.7 bits (4572), Expect = 0.000e+0
Identity = 919/1180 (77.88%), Postives = 1012/1180 (85.76%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M ERWIDGLQFSSLFWPPPQ+ QQRKAQITAYV+YFGQF SEQFPEDIAELIR RYP
Sbjct: 1 MASSCERWIDGLQFSSLFWPPPQDVQQRKAQITAYVDYFGQFTSEQFPEDIAELIRSRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
S E+RLFDDVLATFVLHHPEHGHAVVLPIIS IIDGTLVYD+ PPFASFISL CP+ EN
Sbjct: 61 SKEQRLFDDVLATFVLHHPEHGHAVVLPIISCIIDGTLVYDRCTPPFASFISLVCPSSEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSSVQ 234
EYSEQWALACGEILRILTHYNRPIYKVE + S D S S HAT S + K S +Q
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKVEHQSSEADRSSSGRHATTSDSVDGKSSQGPLLQ 180
Query: 235 SEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHPQ 294
+E++PSRPLSPWITDILLAAPLGI+SDYFRWCGGVMG+YAAGELKPP +AS+RG+GKHPQ
Sbjct: 181 NERKPSRPLSPWITDILLAAPLGIRSDYFRWCGGVMGKYAAGELKPPSTASTRGSGKHPQ 240
Query: 295 LMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLPA 354
L+PSTPRWAVANGAGVILSVCDEEV RYETA+LTA AVPALLLPPPTTALDEHLVAGLPA
Sbjct: 241 LIPSTPRWAVANGAGVILSVCDEEVARYETATLTAAAVPALLLPPPTTALDEHLVAGLPA 300
Query: 355 LEPY-------YAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMR 414
LEPY YAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYA+GMR
Sbjct: 301 LEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYASGMR 360
Query: 415 LPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEPL 474
LPRNWMHLHFLRAIG AMS+R GIAADAAAALLFR+LSQPALLFPPLRQVEG + Q EPL
Sbjct: 361 LPRNWMHLHFLRAIGTAMSMRAGIAADAAAALLFRVLSQPALLFPPLRQVEGFEFQHEPL 420
Query: 475 GGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 534
GYIS Y+KQIE+PA EATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV
Sbjct: 421 DGYISSYKKQIEVPATEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAV 480
Query: 535 DLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPFE 594
DLPEIIVATPLQPP+LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVE+IL+RTFP E
Sbjct: 481 DLPEIIVATPLQPPILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVESILQRTFPAE 540
Query: 595 SSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSHE 654
SS E R++RY G +SKNLAVAELRTMVH+LFLESC S ELASRLLFVVLTVCVSHE
Sbjct: 541 SSRENIRKTRYLFGIGSASKNLAVAELRTMVHALFLESCASVELASRLLFVVLTVCVSHE 600
Query: 655 A-QTNG-KSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSY 714
A Q NG K R +D+ S + + S+ S QR+ +++K KK+GPV AFDSY
Sbjct: 601 AAQQNGSKRPRGEDSHL--------SEEITEDLSDASGNQRDTKTRKMKKQGPVAAFDSY 660
Query: 715 VLAAVCALACELQLFPLIANVSNPSSTKDLVV-------NGTSHDFKDGLDSAIRHTRRI 774
VLAAVCALACELQLFPLIA +N S++KD+ + NG+S +F++ +DSAIRHT RI
Sbjct: 661 VLAAVCALACELQLFPLIARGTNHSASKDVQIRAKPAKLNGSSSEFRNSIDSAIRHTHRI 720
Query: 775 LAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDN 834
LAILEALFSLKPS++GT+W YSSNEIVAAAMVAAH+SELFRRSKACM+ALSVLMRCK D
Sbjct: 721 LAILEALFSLKPSSVGTSWSYSSNEIVAAAMVAAHVSELFRRSKACMHALSVLMRCKWDE 780
Query: 835 EIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVC 894
EI+TRASSLYNLIDIH KAVASI NKAEPLEAHL VW+ P + K++DCA T C
Sbjct: 781 EIYTRASSLYNLIDIHSKAVASIVNKAEPLEAHLIHATVWKDSPGHKDGSKEDDCASTSC 840
Query: 895 IQSEVPLTL-CEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRH 954
+S PL L EDS++SK+L KA + N GN GK +A+ DAS+LA+FLTMDRH
Sbjct: 841 FKSVNPLLLHSEDSAYSKSLPQFEKAPHLNEGTGNSLGKGIASFPLDASELANFLTMDRH 900
Query: 955 IGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALC 1014
IG +CS Q+ L+SVL EKQELCFSVVSLLWHKLIA+PET+PSAESTSAQQGWRQVVDALC
Sbjct: 901 IGFSCSAQVLLRSVLAEKQELCFSVVSLLWHKLIAAPETKPSAESTSAQQGWRQVVDALC 960
Query: 1015 NVVSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESL 1074
NVVSASP KAA +VLQAERELQPWIAKDDD GQKMWRINQRIVKL+VELMRN+D PESL
Sbjct: 961 NVVSASPAKAATAVVLQAERELQPWIAKDDDLGQKMWRINQRIVKLIVELMRNHDRPESL 1020
Query: 1075 VILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLL 1134
VIL+SASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VL+WGESGLAVADGLS+LL
Sbjct: 1021 VILSSASDLLLRATDGMLVDGEACTLPQLELLEATARAVQLVLEWGESGLAVADGLSNLL 1080
Query: 1135 KCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGII 1194
KCR+PATI CLSHPSAHVRALSTSVLRD+ Q+GS++ KQ RN H +QY++LGII
Sbjct: 1081 KCRVPATIRCLSHPSAHVRALSTSVLRDVLQSGSIKPHIKQGGRNGIHS--YQYVNLGII 1140
Query: 1195 DWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
DW DIEKCLTWEAHSRLA G T ++LD+AAKELGC I+I
Sbjct: 1141 DWQADIEKCLTWEAHSRLATGMTNQFLDVAAKELGCTISI 1170
BLAST of Spo04360.1 vs. UniProtKB/TrEMBL
Match:
A0A061FJH7_THECC (Gigantea protein isoform 1 OS=Theobroma cacao GN=TCM_035715 PE=4 SV=1)
HSP 1 Score: 1765.7 bits (4572), Expect = 0.000e+0
Identity = 919/1180 (77.88%), Postives = 1004/1180 (85.08%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M PSERWIDGLQFSSLFWPPPQ+ QQRK QITAYVEYFGQF SEQFPEDIAEL+R RYP
Sbjct: 1 MASPSERWIDGLQFSSLFWPPPQDPQQRKVQITAYVEYFGQFTSEQFPEDIAELVRNRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
E+RLFDDVLA FVLHHPEHGHAVVLPIIS IIDGTLVYDK PPFASFISL CP+ EN
Sbjct: 61 HKEQRLFDDVLAMFVLHHPEHGHAVVLPIISCIIDGTLVYDKSTPPFASFISLVCPSSEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS-HSSSV 234
EYSEQWALACGEILRILTHYNRPIYK+E++ S D S S AT S+ +PS H +
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKMEQQNSETDRSNSSGQATTSEPVDGEPSFHIPLM 180
Query: 235 QSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHP 294
Q E++P RPLSPWITDILLAAPLGI+SDYFRWC GVMG+YAAG+LKPP +ASSRG+GKHP
Sbjct: 181 QQERKPLRPLSPWITDILLAAPLGIRSDYFRWCSGVMGKYAAGDLKPPSTASSRGSGKHP 240
Query: 295 QLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLP 354
QLMPSTPRWAVANGAGVILSVCDEEV RYETA+LTA AVPALLLPPPTTALDEHLVAGLP
Sbjct: 241 QLMPSTPRWAVANGAGVILSVCDEEVARYETATLTAAAVPALLLPPPTTALDEHLVAGLP 300
Query: 355 ALEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGM 414
ALEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATG+
Sbjct: 301 ALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGI 360
Query: 415 RLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEP 474
RLPRNWMHLHFLRAIG AMS+R GIAADAAAALLFRILSQPALLFPPLRQVEGV+VQ EP
Sbjct: 361 RLPRNWMHLHFLRAIGTAMSMRAGIAADAAAALLFRILSQPALLFPPLRQVEGVEVQHEP 420
Query: 475 LGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 534
GGYISCYRKQIE+PAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA
Sbjct: 421 SGGYISCYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 480
Query: 535 VDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPF 594
VDLPEIIVATPLQP +LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAIL+RTFP
Sbjct: 481 VDLPEIIVATPLQPAILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILQRTFPP 540
Query: 595 ESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSH 654
ESS + R++RYS G +SKNLAVAELRTMVHSLFLESC S ELASRLLFVVLTVCVSH
Sbjct: 541 ESSRVQTRKTRYS--IGSASKNLAVAELRTMVHSLFLESCASVELASRLLFVVLTVCVSH 600
Query: 655 EAQTNG-KSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSY 714
EAQ +G K R +++ E SQS S+R R+ + +K+KK+GPV AFDSY
Sbjct: 601 EAQFSGSKRPRCEESYP--PDEGIEESQSP------SERPRDIKPRKTKKQGPVAAFDSY 660
Query: 715 VLAAVCALACELQLFPLIANVSNPSSTKDL-------VVNGTSHDFKDGLDSAIRHTRRI 774
VLAAVCALACELQLFPL+ SN S+ KD+ +NG+S ++ +DSAI HT RI
Sbjct: 661 VLAAVCALACELQLFPLVTRGSNHSTAKDVQAIAKPAKLNGSSIEYGHSIDSAIHHTHRI 720
Query: 775 LAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDN 834
LAILEALFSLKPS++GT+W YSSNEIVAAAMVAAH+SELFRRSKACM+ALSVLMRCK DN
Sbjct: 721 LAILEALFSLKPSSVGTSWSYSSNEIVAAAMVAAHVSELFRRSKACMHALSVLMRCKWDN 780
Query: 835 EIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVC 894
EI+TRASSLYNLIDIH KAVASI NKAEPLEA L PVW+ P+ + RKQN T C
Sbjct: 781 EIYTRASSLYNLIDIHSKAVASIVNKAEPLEAQLIHAPVWKDSPVCLDGRKQNKRTNTTC 840
Query: 895 IQ-SEVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRH 954
+ + CEDS+HS L C + + ++ +GN GK +A+ DASDLA+FLTMDRH
Sbjct: 841 FDPGQSSASECEDSTHSDKNLRCERVLASDEGSGNSLGKGIASFPLDASDLANFLTMDRH 900
Query: 955 IGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALC 1014
IG NCS QI L+SVL EKQELCFSVVSLLWHKLIA+PETQPSAESTSAQQGWRQVVDALC
Sbjct: 901 IGFNCSAQILLRSVLVEKQELCFSVVSLLWHKLIAAPETQPSAESTSAQQGWRQVVDALC 960
Query: 1015 NVVSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESL 1074
NVVSASPTKAA +VLQAERE QPWI KDDDQGQKMWRINQRIVKL+VELMRN+D+PESL
Sbjct: 961 NVVSASPTKAATAVVLQAEREFQPWITKDDDQGQKMWRINQRIVKLIVELMRNHDSPESL 1020
Query: 1075 VILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLL 1134
VI+ASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VL+WGESGLAVADGLS+LL
Sbjct: 1021 VIVASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQPVLEWGESGLAVADGLSNLL 1080
Query: 1135 KCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGII 1194
KCRLPAT CLSHPSAHVRALSTSVLR+I GS++ KQ N H +QY +G+I
Sbjct: 1081 KCRLPATTRCLSHPSAHVRALSTSVLRNILHAGSIKPNSKQVEINGIHGPSYQYFSVGVI 1140
Query: 1195 DWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
DWHTDIEKCLTWEAHS+LARG I +LD AAKELGC+I+I
Sbjct: 1141 DWHTDIEKCLTWEAHSQLARGMPIRFLDTAAKELGCSISI 1170
BLAST of Spo04360.1 vs. UniProtKB/TrEMBL
Match:
A0A061FIU7_THECC (Gigantea protein isoform 3 OS=Theobroma cacao GN=TCM_035715 PE=4 SV=1)
HSP 1 Score: 1761.1 bits (4560), Expect = 0.000e+0
Identity = 919/1181 (77.82%), Postives = 1004/1181 (85.01%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYP 114
M PSERWIDGLQFSSLFWPPPQ+ QQRK QITAYVEYFGQF SEQFPEDIAEL+R RYP
Sbjct: 1 MASPSERWIDGLQFSSLFWPPPQDPQQRKVQITAYVEYFGQFTSEQFPEDIAELVRNRYP 60
Query: 115 SHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDEN 174
E+RLFDDVLA FVLHHPEHGHAVVLPIIS IIDGTLVYDK PPFASFISL CP+ EN
Sbjct: 61 HKEQRLFDDVLAMFVLHHPEHGHAVVLPIISCIIDGTLVYDKSTPPFASFISLVCPSSEN 120
Query: 175 EYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPS-HSSSV 234
EYSEQWALACGEILRILTHYNRPIYK+E++ S D S S AT S+ +PS H +
Sbjct: 121 EYSEQWALACGEILRILTHYNRPIYKMEQQNSETDRSNSSGQATTSEPVDGEPSFHIPLM 180
Query: 235 QSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHP 294
Q E++P RPLSPWITDILLAAPLGI+SDYFRWC GVMG+YAAG+LKPP +ASSRG+GKHP
Sbjct: 181 QQERKPLRPLSPWITDILLAAPLGIRSDYFRWCSGVMGKYAAGDLKPPSTASSRGSGKHP 240
Query: 295 QLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLP 354
QLMPSTPRWAVANGAGVILSVCDEEV RYETA+LTA AVPALLLPPPTTALDEHLVAGLP
Sbjct: 241 QLMPSTPRWAVANGAGVILSVCDEEVARYETATLTAAAVPALLLPPPTTALDEHLVAGLP 300
Query: 355 ALEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGM 414
ALEP YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATG+
Sbjct: 301 ALEPYARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGI 360
Query: 415 RLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEP 474
RLPRNWMHLHFLRAIG AMS+R GIAADAAAALLFRILSQPALLFPPLRQVEGV+VQ EP
Sbjct: 361 RLPRNWMHLHFLRAIGTAMSMRAGIAADAAAALLFRILSQPALLFPPLRQVEGVEVQHEP 420
Query: 475 LGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 534
GGYISCYRKQIE+PAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA
Sbjct: 421 SGGYISCYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSA 480
Query: 535 VDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPF 594
VDLPEIIVATPLQP +LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAIL+RTFP
Sbjct: 481 VDLPEIIVATPLQPAILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILQRTFPP 540
Query: 595 ESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSH 654
ESS + R++RYS G +SKNLAVAELRTMVHSLFLESC S ELASRLLFVVLTVCVSH
Sbjct: 541 ESSRVQTRKTRYS--IGSASKNLAVAELRTMVHSLFLESCASVELASRLLFVVLTVCVSH 600
Query: 655 EAQTNG-KSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSY 714
EAQ +G K R +++ E SQS S+R R+ + +K+KK+GPV AFDSY
Sbjct: 601 EAQFSGSKRPRCEESYP--PDEGIEESQSP------SERPRDIKPRKTKKQGPVAAFDSY 660
Query: 715 VLAAVCALACELQLFPLIANVSNPSSTKDL-------VVNGTSHDFKDGLDSAIRHTRRI 774
VLAAVCALACELQLFPL+ SN S+ KD+ +NG+S ++ +DSAI HT RI
Sbjct: 661 VLAAVCALACELQLFPLVTRGSNHSTAKDVQAIAKPAKLNGSSIEYGHSIDSAIHHTHRI 720
Query: 775 LAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDN 834
LAILEALFSLKPS++GT+W YSSNEIVAAAMVAAH+SELFRRSKACM+ALSVLMRCK DN
Sbjct: 721 LAILEALFSLKPSSVGTSWSYSSNEIVAAAMVAAHVSELFRRSKACMHALSVLMRCKWDN 780
Query: 835 EIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVC 894
EI+TRASSLYNLIDIH KAVASI NKAEPLEA L PVW+ P+ + RKQN T C
Sbjct: 781 EIYTRASSLYNLIDIHSKAVASIVNKAEPLEAQLIHAPVWKDSPVCLDGRKQNKRTNTTC 840
Query: 895 IQ-SEVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRH 954
+ + CEDS+HS L C + + ++ +GN GK +A+ DASDLA+FLTMDRH
Sbjct: 841 FDPGQSSASECEDSTHSDKNLRCERVLASDEGSGNSLGKGIASFPLDASDLANFLTMDRH 900
Query: 955 IGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALC 1014
IG NCS QI L+SVL EKQELCFSVVSLLWHKLIA+PETQPSAESTSAQQGWRQVVDALC
Sbjct: 901 IGFNCSAQILLRSVLVEKQELCFSVVSLLWHKLIAAPETQPSAESTSAQQGWRQVVDALC 960
Query: 1015 NVVSASPTKAAAGIVL-QAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPES 1074
NVVSASPTKAA +VL QAERE QPWI KDDDQGQKMWRINQRIVKL+VELMRN+D+PES
Sbjct: 961 NVVSASPTKAATAVVLQQAEREFQPWITKDDDQGQKMWRINQRIVKLIVELMRNHDSPES 1020
Query: 1075 LVILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSL 1134
LVI+ASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQ VL+WGESGLAVADGLS+L
Sbjct: 1021 LVIVASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQPVLEWGESGLAVADGLSNL 1080
Query: 1135 LKCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQENRNVTHESPFQYLHLGI 1194
LKCRLPAT CLSHPSAHVRALSTSVLR+I GS++ KQ N H +QY +G+
Sbjct: 1081 LKCRLPATTRCLSHPSAHVRALSTSVLRNILHAGSIKPNSKQVEINGIHGPSYQYFSVGV 1140
Query: 1195 IDWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
IDWHTDIEKCLTWEAHS+LARG I +LD AAKELGC+I+I
Sbjct: 1141 IDWHTDIEKCLTWEAHSQLARGMPIRFLDTAAKELGCSISI 1171
BLAST of Spo04360.1 vs. ExPASy Swiss-Prot
Match:
GIGAN_ARATH (Protein GIGANTEA OS=Arabidopsis thaliana GN=GI PE=1 SV=2)
HSP 1 Score: 1591.6 bits (4120), Expect = 0.000e+0
Identity = 842/1181 (71.30%), Postives = 957/1181 (81.03%), Query Frame = 1
Query: 59 SERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYPSHEK 118
SERWIDGLQFSSL WPPP++ QQ K Q+ AYVEYFGQF SEQFP+DIAEL+R +YPS EK
Sbjct: 7 SERWIDGLQFSSLLWPPPRDPQQHKDQVVAYVEYFGQFTSEQFPDDIAELVRHQYPSTEK 66
Query: 119 RLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDENEYSE 178
RL DDVLA FVLHHPEHGHAV+LPIIS +IDG+LVY K+ PFASFISL CP+ EN+YSE
Sbjct: 67 RLLDDVLAMFVLHHPEHGHAVILPIISCLIDGSLVYSKEAHPFASFISLVCPSSENDYSE 126
Query: 179 QWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSSVQSEKR 238
QWALACGEILRILTHYNRPIYK E+ Q+ + ++ AT S + ++P S Q E++
Sbjct: 127 QWALACGEILRILTHYNRPIYKTEQ-QNGDTERNCLSKATTSGSPTSEPKAGSPTQHERK 186
Query: 239 PSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHPQLMPS 298
P RPLSPWI+DILLAAPLGI+SDYFRWC GVMG+YAAGELKPP AS RG+GKHPQLMPS
Sbjct: 187 PLRPLSPWISDILLAAPLGIRSDYFRWCSGVMGKYAAGELKPPTIAS-RGSGKHPQLMPS 246
Query: 299 TPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLPALEP- 358
TPRWAVANGAGVILSVCD+EV RYETA+LTAVAVPALLLPPPTT+LDEHLVAGLPALEP
Sbjct: 247 TPRWAVANGAGVILSVCDDEVARYETATLTAVAVPALLLPPPTTSLDEHLVAGLPALEPY 306
Query: 359 ------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMRLPRN 418
YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYA+G+RLPRN
Sbjct: 307 ARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYASGVRLPRN 366
Query: 419 WMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEPLGGYI 478
WMHLHFLRAIGIAMS+R G+AADAAAALLFRILSQPALLFPPL QVEGV++Q P+GGY
Sbjct: 367 WMHLHFLRAIGIAMSMRAGVAADAAAALLFRILSQPALLFPPLSQVEGVEIQHAPIGGYS 426
Query: 479 SCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAVDLPE 538
S YRKQIE+PAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPL+SSAVDLPE
Sbjct: 427 SNYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLNSSAVDLPE 486
Query: 539 IIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPFESSPE 598
IIVATPLQPP+LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVE IL RTFP ESS E
Sbjct: 487 IIVATPLQPPILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVETILSRTFPPESSRE 546
Query: 599 ENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSHEAQTN 658
R++R S ++KNLA++ELR MVH+LFLESC ELASRLLFVVLTVCVSHEAQ++
Sbjct: 547 LTRKARSSFTTRSATKNLAMSELRAMVHALFLESCAGVELASRLLFVVLTVCVSHEAQSS 606
Query: 659 GKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSYVLAAVC 718
G S R A TT N E++Q SNN R +S+ K +GPV AFDSYVLAAVC
Sbjct: 607 G-SKRPRSEYAS-TTENIEANQPV---SNNQTANR--KSRNVKGQGPVAAFDSYVLAAVC 666
Query: 719 ALACELQLFPLIANVSNPSS-------TKDLVVNGTSHDFKDGLDSAIRHTRRILAILEA 778
ALACE+QL+P+I+ N S+ TK + +NG+S ++ G+DSAI HTRRILAILEA
Sbjct: 667 ALACEVQLYPMISGGGNFSNSAVAGTITKPVKINGSSKEYGAGIDSAISHTRRILAILEA 726
Query: 779 LFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIHTRA 838
LFSLKPS++GT W YSS+EIVAAAMVAAHISELFRRSKA +ALS LMRCK D EIH RA
Sbjct: 727 LFSLKPSSVGTPWSYSSSEIVAAAMVAAHISELFRRSKALTHALSGLMRCKWDKEIHKRA 786
Query: 839 SSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQSEVP 898
SSLYNLID+H K VASI +KAEPLEA+L TPV + N +++N CA T C + V
Sbjct: 787 SSLYNLIDVHSKVVASIVDKAEPLEAYLKNTPVQKDSVTCLNWKQENTCASTTCFDTAV- 846
Query: 899 LTLCEDSSHSKNLL----NCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGL 958
+S S+ + N A +++ +G + K + + DASDLA+FLT DR G
Sbjct: 847 ------TSASRTEMNPRGNHKYARHSDEGSGRPSEKGIKDFLLDASDLANFLTADRLAGF 906
Query: 959 NCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVV 1018
C Q L+SVL EK EL FSVVSLLWHKLIA+PE QP+AESTSAQQGWRQVVDALCNVV
Sbjct: 907 YCGTQKLLRSVLAEKPELSFSVVSLLWHKLIAAPEIQPTAESTSAQQGWRQVVDALCNVV 966
Query: 1019 SASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVIL 1078
SA+P KAAA +VLQAERELQPWIAKDD++GQKMW+INQRIVK++VELMRN+D PESLVIL
Sbjct: 967 SATPAKAAAAVVLQAERELQPWIAKDDEEGQKMWKINQRIVKVLVELMRNHDRPESLVIL 1026
Query: 1079 ASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCR 1138
ASASDLLLRATDGMLVDGEACTLPQLELLEATARA+Q VL WG SGLAV DGLS+LLKCR
Sbjct: 1027 ASASDLLLRATDGMLVDGEACTLPQLELLEATARAIQPVLAWGPSGLAVVDGLSNLLKCR 1086
Query: 1139 LPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQE----NRNVTHESPFQYLHLGI 1198
LPATI CLSHPSAHVRALSTSVLRDI S+ K + +N + +++ +
Sbjct: 1087 LPATIRCLSHPSAHVRALSTSVLRDIMNQSSIPIKVTPKLPTTEKNGMNSPSYRFFNAAS 1146
Query: 1199 IDWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
IDW DI+ CL WEAHS L+ ++LD AA+ELGC I++
Sbjct: 1147 IDWKADIQNCLNWEAHSLLSTTMPTQFLDTAARELGCTISL 1171
BLAST of Spo04360.1 vs. ExPASy Swiss-Prot
Match:
GIGAN_ORYSJ (Protein GIGANTEA OS=Oryza sativa subsp. japonica GN=GI PE=2 SV=2)
HSP 1 Score: 1535.8 bits (3975), Expect = 0.000e+0
Identity = 805/1182 (68.10%), Postives = 940/1182 (79.53%), Query Frame = 1
Query: 55 MTRPSERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFI--SEQFPEDIAELIRIR 114
M+ +E+WIDGLQFSSLFWPPPQ++QQ++AQI AYVEYFGQF SEQFPEDIA+LI+
Sbjct: 1 MSASNEKWIDGLQFSSLFWPPPQDSQQKQAQILAYVEYFGQFTADSEQFPEDIAQLIQSC 60
Query: 115 YPSHEKRLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPND 174
YPS EKRL D+VLATFVLHHPEHGHAVV PI+S IIDGTL YD++ PF SFISLF
Sbjct: 61 YPSKEKRLVDEVLATFVLHHPEHGHAVVHPILSRIIDGTLSYDRNGFPFMSFISLFSHTS 120
Query: 175 ENEYSEQWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSS 234
E EYSEQWALACGEILR+LTHYNRPI+KV+ + S +CS + + A+ ++ + + S
Sbjct: 121 EKEYSEQWALACGEILRVLTHYNRPIFKVDHQHSEAECSSTSDQASSCESMEKRANGSPR 180
Query: 235 VQSEKRPSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAG-ELKPPMSASSRGNGK 294
+ +++P RPLSPWITDILLAAPLGI+SDYFRWCGGVMG+YAAG ELKPP +A SRG+GK
Sbjct: 181 NEPDRKPLRPLSPWITDILLAAPLGIRSDYFRWCGGVMGKYAAGGELKPPTTAYSRGSGK 240
Query: 295 HPQLMPSTPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAG 354
HPQLMPSTPRWAVANGAGVILSVCDEEV RYETA+LTA AVPALLLPPPTT LDEHLVAG
Sbjct: 241 HPQLMPSTPRWAVANGAGVILSVCDEEVARYETANLTAAAVPALLLPPPTTPLDEHLVAG 300
Query: 355 LPALEP-------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYAT 414
LP LEP YYAIATPSATQRLL GLLEAPPSWAPDALDAAVQLVELLRAAEDY +
Sbjct: 301 LPPLEPYARLFHRYYAIATPSATQRLLFGLLEAPPSWAPDALDAAVQLVELLRAAEDYDS 360
Query: 415 GMRLPRNWMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQ 474
GMRLP+NWMHLHFLRAIG AMS+R GIAAD +AALLFRILSQP LLFPPLR EGV++
Sbjct: 361 GMRLPKNWMHLHFLRAIGTAMSMRAGIAADTSAALLFRILSQPTLLFPPLRHAEGVELHH 420
Query: 475 EPLGGYISCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSS 534
EPLGGY+S Y++Q+E+PA+EATI+ATAQGIASMLCAHGP+VEWRICTIWEAAYGL+PLSS
Sbjct: 421 EPLGGYVSSYKRQLEVPASEATIDATAQGIASMLCAHGPDVEWRICTIWEAAYGLLPLSS 480
Query: 535 SAVDLPEIIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTF 594
SAVDLPEI+VA PLQPP LSW+LY+PLLKV EYLPRGSPSEACLM+IFVATVEAILRRTF
Sbjct: 481 SAVDLPEIVVAAPLQPPTLSWSLYLPLLKVFEYLPRGSPSEACLMRIFVATVEAILRRTF 540
Query: 595 PFESSPEENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCV 654
P E+S E++R+ R SKNLAVAELRTM+HSLF+ESC S +LASRLLFVVLTVCV
Sbjct: 541 PSETS-EQSRKPR------SQSKNLAVAELRTMIHSLFVESCASMDLASRLLFVVLTVCV 600
Query: 655 SHEAQTNGKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDS 714
SH+A G + R + S S N R R++ K++GPV FDS
Sbjct: 601 SHQALPGG------------SKRPTGSDNHSSEEVTNDSRLTNGRNRCKKRQGPVATFDS 660
Query: 715 YVLAAVCALACELQLFPLIANVSNPSSTKDLV-------VNGTSHDFKDGLDSAIRHTRR 774
YVLAAVCAL+CELQLFP I+ N S+ KD + G S++ + + SAI HTRR
Sbjct: 661 YVLAAVCALSCELQLFPFISKNGNHSNLKDSIKIVIPGKTTGISNELHNSISSAILHTRR 720
Query: 775 ILAILEALFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLD 834
IL ILEALFSLKPS++GT+W YSSNEIVAAAMVAAH+SELFRRS+ C+NALS L +CK D
Sbjct: 721 ILGILEALFSLKPSSVGTSWSYSSNEIVAAAMVAAHVSELFRRSRPCLNALSALKQCKWD 780
Query: 835 NEIHTRASSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRG-PPIVANRRKQNDCART 894
EI TRASSLY+LID+H K V SI NKAEPLEAHL+ TPV + PPI +D
Sbjct: 781 AEISTRASSLYHLIDLHGKTVTSIVNKAEPLEAHLTLTPVKKDEPPIEEKNINSSDGG-- 840
Query: 895 VCIQSEVPLTLCEDSSHSKNLLNCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDR 954
++ + + ++ LL C + V N D + +GK +A++Q +ASDLA+FLTMDR
Sbjct: 841 -ALEKKDASRSHRKNGFARPLLKCAEDVILNGDVASTSGKAIASLQVEASDLANFLTMDR 900
Query: 955 HIGLNCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDAL 1014
+ G S Q L+SVL EKQELCFSVVSLLW KLIASPE Q SAESTSA QGWR+VVDAL
Sbjct: 901 NGGYRGS-QTLLRSVLSEKQELCFSVVSLLWQKLIASPEMQMSAESTSAHQGWRKVVDAL 960
Query: 1015 CNVVSASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPES 1074
C++VSASPTKA+A IVLQAE++LQPWIA+DD+QGQKMWR+NQRIVKL+ ELMRN+D+PE+
Sbjct: 961 CDIVSASPTKASAAIVLQAEKDLQPWIARDDEQGQKMWRVNQRIVKLIAELMRNHDSPEA 1020
Query: 1075 LVILASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSL 1134
LVILASASDLLLRATDGMLVDGEACTLPQLELLE TARAV +++WG+SG++VADGLS+L
Sbjct: 1021 LVILASASDLLLRATDGMLVDGEACTLPQLELLEVTARAVHLIVEWGDSGVSVADGLSNL 1080
Query: 1135 LKCRLPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFK--FKQENRNVTHESPFQYLHL 1194
LKCRL TI CLSHPSAHVRALS SVLRDI +G + + E+RN +Q L
Sbjct: 1081 LKCRLSTTIRCLSHPSAHVRALSMSVLRDILNSGQINSSKLIQGEHRNGIQSPTYQCLAA 1140
Query: 1195 GIIDWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAIT 1217
II+W D+E+C+ WEAHSR A G T+ +L AAKELGC +T
Sbjct: 1141 SIINWQADVERCIEWEAHSRRATGLTLAFLTAAAKELGCPLT 1159
BLAST of Spo04360.1 vs. TAIR (Arabidopsis)
Match:
AT1G22770.1 (gigantea protein (GI))
HSP 1 Score: 1591.6 bits (4120), Expect = 0.000e+0
Identity = 842/1181 (71.30%), Postives = 957/1181 (81.03%), Query Frame = 1
Query: 59 SERWIDGLQFSSLFWPPPQETQQRKAQITAYVEYFGQFISEQFPEDIAELIRIRYPSHEK 118
SERWIDGLQFSSL WPPP++ QQ K Q+ AYVEYFGQF SEQFP+DIAEL+R +YPS EK
Sbjct: 7 SERWIDGLQFSSLLWPPPRDPQQHKDQVVAYVEYFGQFTSEQFPDDIAELVRHQYPSTEK 66
Query: 119 RLFDDVLATFVLHHPEHGHAVVLPIISIIIDGTLVYDKDKPPFASFISLFCPNDENEYSE 178
RL DDVLA FVLHHPEHGHAV+LPIIS +IDG+LVY K+ PFASFISL CP+ EN+YSE
Sbjct: 67 RLLDDVLAMFVLHHPEHGHAVILPIISCLIDGSLVYSKEAHPFASFISLVCPSSENDYSE 126
Query: 179 QWALACGEILRILTHYNRPIYKVERKQSSEDCSGSMNHATCSKASGAKPSHSSSVQSEKR 238
QWALACGEILRILTHYNRPIYK E+ Q+ + ++ AT S + ++P S Q E++
Sbjct: 127 QWALACGEILRILTHYNRPIYKTEQ-QNGDTERNCLSKATTSGSPTSEPKAGSPTQHERK 186
Query: 239 PSRPLSPWITDILLAAPLGIKSDYFRWCGGVMGRYAAGELKPPMSASSRGNGKHPQLMPS 298
P RPLSPWI+DILLAAPLGI+SDYFRWC GVMG+YAAGELKPP AS RG+GKHPQLMPS
Sbjct: 187 PLRPLSPWISDILLAAPLGIRSDYFRWCSGVMGKYAAGELKPPTIAS-RGSGKHPQLMPS 246
Query: 299 TPRWAVANGAGVILSVCDEEVTRYETASLTAVAVPALLLPPPTTALDEHLVAGLPALEP- 358
TPRWAVANGAGVILSVCD+EV RYETA+LTAVAVPALLLPPPTT+LDEHLVAGLPALEP
Sbjct: 247 TPRWAVANGAGVILSVCDDEVARYETATLTAVAVPALLLPPPTTSLDEHLVAGLPALEPY 306
Query: 359 ------YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYATGMRLPRN 418
YYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYA+G+RLPRN
Sbjct: 307 ARLFHRYYAIATPSATQRLLLGLLEAPPSWAPDALDAAVQLVELLRAAEDYASGVRLPRN 366
Query: 419 WMHLHFLRAIGIAMSIREGIAADAAAALLFRILSQPALLFPPLRQVEGVDVQQEPLGGYI 478
WMHLHFLRAIGIAMS+R G+AADAAAALLFRILSQPALLFPPL QVEGV++Q P+GGY
Sbjct: 367 WMHLHFLRAIGIAMSMRAGVAADAAAALLFRILSQPALLFPPLSQVEGVEIQHAPIGGYS 426
Query: 479 SCYRKQIEMPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLSSSAVDLPE 538
S YRKQIE+PAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPL+SSAVDLPE
Sbjct: 427 SNYRKQIEVPAAEATIEATAQGIASMLCAHGPEVEWRICTIWEAAYGLIPLNSSAVDLPE 486
Query: 539 IIVATPLQPPLLSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVEAILRRTFPFESSPE 598
IIVATPLQPP+LSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVE IL RTFP ESS E
Sbjct: 487 IIVATPLQPPILSWNLYIPLLKVLEYLPRGSPSEACLMKIFVATVETILSRTFPPESSRE 546
Query: 599 ENRRSRYSSEAGPSSKNLAVAELRTMVHSLFLESCVSAELASRLLFVVLTVCVSHEAQTN 658
R++R S ++KNLA++ELR MVH+LFLESC ELASRLLFVVLTVCVSHEAQ++
Sbjct: 547 LTRKARSSFTTRSATKNLAMSELRAMVHALFLESCAGVELASRLLFVVLTVCVSHEAQSS 606
Query: 659 GKSSRADDNIADCTTRNSESSQSSPSRSNNSKRQREPRSKKSKKRGPVVAFDSYVLAAVC 718
G S R A TT N E++Q SNN R +S+ K +GPV AFDSYVLAAVC
Sbjct: 607 G-SKRPRSEYAS-TTENIEANQPV---SNNQTANR--KSRNVKGQGPVAAFDSYVLAAVC 666
Query: 719 ALACELQLFPLIANVSNPSS-------TKDLVVNGTSHDFKDGLDSAIRHTRRILAILEA 778
ALACE+QL+P+I+ N S+ TK + +NG+S ++ G+DSAI HTRRILAILEA
Sbjct: 667 ALACEVQLYPMISGGGNFSNSAVAGTITKPVKINGSSKEYGAGIDSAISHTRRILAILEA 726
Query: 779 LFSLKPSTIGTTWGYSSNEIVAAAMVAAHISELFRRSKACMNALSVLMRCKLDNEIHTRA 838
LFSLKPS++GT W YSS+EIVAAAMVAAHISELFRRSKA +ALS LMRCK D EIH RA
Sbjct: 727 LFSLKPSSVGTPWSYSSSEIVAAAMVAAHISELFRRSKALTHALSGLMRCKWDKEIHKRA 786
Query: 839 SSLYNLIDIHRKAVASIANKAEPLEAHLSQTPVWRGPPIVANRRKQNDCARTVCIQSEVP 898
SSLYNLID+H K VASI +KAEPLEA+L TPV + N +++N CA T C + V
Sbjct: 787 SSLYNLIDVHSKVVASIVDKAEPLEAYLKNTPVQKDSVTCLNWKQENTCASTTCFDTAV- 846
Query: 899 LTLCEDSSHSKNLL----NCGKAVYTNNDAGNIAGKRVANIQFDASDLAHFLTMDRHIGL 958
+S S+ + N A +++ +G + K + + DASDLA+FLT DR G
Sbjct: 847 ------TSASRTEMNPRGNHKYARHSDEGSGRPSEKGIKDFLLDASDLANFLTADRLAGF 906
Query: 959 NCSVQIFLQSVLEEKQELCFSVVSLLWHKLIASPETQPSAESTSAQQGWRQVVDALCNVV 1018
C Q L+SVL EK EL FSVVSLLWHKLIA+PE QP+AESTSAQQGWRQVVDALCNVV
Sbjct: 907 YCGTQKLLRSVLAEKPELSFSVVSLLWHKLIAAPEIQPTAESTSAQQGWRQVVDALCNVV 966
Query: 1019 SASPTKAAAGIVLQAERELQPWIAKDDDQGQKMWRINQRIVKLMVELMRNYDAPESLVIL 1078
SA+P KAAA +VLQAERELQPWIAKDD++GQKMW+INQRIVK++VELMRN+D PESLVIL
Sbjct: 967 SATPAKAAAAVVLQAERELQPWIAKDDEEGQKMWKINQRIVKVLVELMRNHDRPESLVIL 1026
Query: 1079 ASASDLLLRATDGMLVDGEACTLPQLELLEATARAVQSVLKWGESGLAVADGLSSLLKCR 1138
ASASDLLLRATDGMLVDGEACTLPQLELLEATARA+Q VL WG SGLAV DGLS+LLKCR
Sbjct: 1027 ASASDLLLRATDGMLVDGEACTLPQLELLEATARAIQPVLAWGPSGLAVVDGLSNLLKCR 1086
Query: 1139 LPATITCLSHPSAHVRALSTSVLRDIQQNGSVQFKFKQE----NRNVTHESPFQYLHLGI 1198
LPATI CLSHPSAHVRALSTSVLRDI S+ K + +N + +++ +
Sbjct: 1087 LPATIRCLSHPSAHVRALSTSVLRDIMNQSSIPIKVTPKLPTTEKNGMNSPSYRFFNAAS 1146
Query: 1199 IDWHTDIEKCLTWEAHSRLARGKTIEYLDMAAKELGCAITI 1218
IDW DI+ CL WEAHS L+ ++LD AA+ELGC I++
Sbjct: 1147 IDWKADIQNCLNWEAHSLLSTTMPTQFLDTAARELGCTISL 1171
The following BLAST results are available for this feature: