Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSexonthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTTCTTTTTTCCCGGGTTTCGAAATTGGGGCTCAGGTTTGTTCGATTTTGATTATGAGTGAACATGTGCCCAGGAGTTCGATTGTGACATCGAATTTTGTAAATTAGGCTTGGAAATGTGTATGAACTTGAATTAGGGGTCCGATATCAGGATTGGACCCGGGGATTGTATATTTGAATTGAAAGTTAGGATTGAAAATCGAACTTGGGGATTGAATTTGTAGAGGTTTGAATTAGGAATCGAAATTGGAATTTTGGAATTACAATGTGTTTTAATTGGATTTTGTACTTGACTTTTGAACTTAAGTTCGGAATTCAATTGAATTGAATTGAATCATATCTTTTGCAATCGATCGCTTTCATACAGAAATCAATCGCCCAAATCTATTGACGGTTTCGGTATCGGACTCGGGAACCCAAATGTGGTGTCTACGTTATGCATTCAAACATCCTTCCAATGATTAAAAGCAAACTACGAGATTTACAAACTTCCTTGAATCATACCCAACAACCAGGATCCGTCTGATTGAATCCAACGTCTGATTGAATCCAACGGCATAAGATAGATCTCGCGGCACTTTGCCCTTCACCTTCTCTCTCTCCCTCTATCTCTCTTCGCGATCAATGTAATTGAATCAAGCCCTAATCAGACCTCTACACTGTCAACAAATCCATCAATTTCTTACCAAATTCCCCTTTCTAATCACAATCAAATCCCTGAAATCACCAGAATTCAGAGAAATTGGATCGAGGTAAGCTTGAATTTCAAATCAATCTTCAATGTTACGAAAATGGCGATGATTCAATTTGATTTTGATTTGGTTCAAATTTGATCTTAGGATAGTGAAAGTGGGACGATGAACTTTTTGCTGCGGCAAAGCCCTAACGCCGCCGCAGCAGCAGCAACGCCAGCAACAACACCAGCAAACGCCGATCAACTTCCACCGGTTTCGCCGTGGTTGAAGAAAACTCCGTCGCAGAGTGTGACGACACTGGAGGGATTGATTGCGGAAGATAAGTCGCATTTAGTGGATCATTCCACTGCAAGTAATCGTGATAGTTTAACTGGTGAGAATGGTATTATGAGTTCCGCAAGTATTAAGGATTTCGCTCCGCTTGCTGATAAGCATGTAGACGTTGCCGAGGATGAAGGGTGGATCACCATTCCTCATGGTATGTTAATTCTTCTTCCTCCTTTCATTTCCAGCTTGAATTGTTGAATTTCAGTAATTGAATTTATATGCAGCCGGTGATCATTGTTTTGAGTGTGTTAATAATTGGTGCTGTACAGTTTGGGTTTGATTTAATGTTATTGGTTACAATTGAGGTAGAGTTTGTAATTTTTGGTAGTAAATTGTTTCAAGGGTCAACTGAAAATCACTCCTTTTGTTATTACAAAGGCATTATTTAGTTTGGGGATCGAAGTGCGTATTGTGCAGAGAGCTAATTGAGGTAAAGCTAGGAATCAAAACTCATTTGTTATTAATTAGACATTATGTAGGTTATTGTGTTGAGGGCTAGTTGTTCGTAAGGGGAAGGATTAGAAAAATAACTCCTTTTGTTATTACAAAATACAAAGGCATTTTGTAGTCTGGGAATCGAATTGTGAAATTGTTTGAGAGCTAGCTCTGCTACATGGAAAAAATTGGAAATAAAACTTCTGTTTAGGGTTTGGAAGAGTGAGGTTTATGGAGTTTTGGTTTTGTTATCATAGATGTTGTCTCTATGGAATTGTTGACCTCTACATTGGAAAAGGAGGAAAGCATTGACTAAGGTTTGGAAGTACTCTCCAGTTGTTGGAAACTACTTAACGACCTCATGACATGTGTAATTTAGCCAATCAGAATACAATGCTGGATAATTTCCCTGTAAAGGCCTTTTTGGTGTTCAATGCAACATCTTTCTGTCCTATGCACGCTGTACATAATTACTTTAGGTACATTTCTTGGTGAAGGTTGAGAGTTTCTAGCACATTCTGGACATTGTTTTTGTTGTCAAGCTTTGGTGTATTTGATCGCTTAGAGCATTTTATTTATTTGACTGGAACATAGACTCGCAGAACGGTTAACATGGAAATCTATATACGACAAAGAGCCTTTGCATGGTGAAGTGTGTGAAGGATTAAAACCAAAGAATTATTGTTCAATATAACAAGATTAAAGATAAATAAAGGGAATACTTTGATGATATGTTTAATGGAGGTCAAGTGGGTGATACAGGAGACATCTTTATTTTGGATGAACATGGATTATACGTGGAGGTAGAGTGGTAAAGAAGGCAATCAAGGCACTGAAGAAGATGCAACTGAGGAAAGATGTTAGGCCATATGGTGTTCCTATTGAGATCTAGAGAGGCGTACGCAAGGTTGGTATGGAGTGGTTTACAAATCTTTTTAATAAGATTTGGAGGACAAACAAGATATCAGATGATTGGAGTTGATGCCTATTTAGGTATTTATATGAACAAAAAGGAGATGCACAAGACTAAAGTATCCATGGATAAAGCTTATTAGTCATCCCATGAAATCGTGGGAGAGAGTAATTGAACATAGATGTGGAAATTTGCATCTCAAAGTATGAGTGGGTTTATGCCTGGACAACCATCGACCTTATAAGGAAAACAATGAAACATTATAGGGCAAGCAAGGACTTATACATGGTCTTTATTGATCTATAAAAGGTGGTTATACGGAGAGAAGTGGTATGGTGGGCAATGGCGAAGAAGGGAATACTATGTAAGTACATTGGCCGACTCAAGAATATGTATCGACATTTTAAAGAACGTTAAGGTATTGTTTGGAAACTTGGATTTCATTTGAAATACATGAATTGAAATCTGGAAATTGAAATATGGAAATTGAAATTCTGAATTTGAATTTCAAATCAAGTGTTTGGAAAACATTATTAAATTTCAATTCTAGAATTTCAATTACACTATAATGTTTGGGAAGAAAGTGAAATTTCAATTCTGTAATTTCATTATTAAAGATATACTACAATATACTTTGTACTACAATATACTTTGTACTACAAAATGCAAGATCGAGTGTTATTGGCATTTTAACAAAATTGAAGGTGTTCAATACTTTGTAAAATCTAACAACATGCACTTCAAAATAAAAAATGTGTTAATTCATGGAATTATTTAATTAATCTACAACTGACTACAAAAAATCAAAACCAGCATCGAATTTGGGGGAAATCTAAGCACTAACTTTTCGAATTTGGAGAAAAATCAAACCCTAACTCTTTGAATTTGGGGAAAACCAAATCCTAACTCTTTGAATTTGGGGAAAACATTTAACTCATTGTATTTTCCTTGGGAAGAGAGTAATGTTGATCTAATATCCAAAAACTTCAGTCGGCGGTGGAGGCAACAATCAAATTGAAGGAGGTTGAGAGGAGGAGAGAGTCTATCGAGTTCTGAGATGAGAATGCTAAGGTCAACAATGGTGGATGAGGAGAGAGAAGGGAGGAGAGGATACTTTGAAGAAGATGAGCATTCACGGTGGATTTCAAATGCAAGCTATTCACTTTGGATTTCAAATTCTGAAACTGGGCTTGTTTCTTTAAGTGTTTGGAATTGGGCCAAATCCAAATCCAAATTATTTTAGTTTCCGAACAGTTTATTTGTGCCAATTCCATCATTTGAAATGAAATGACAGTTTCCAGTAGGCCATAGAGGATATTCCTATTACAATGGGACTACACCAAGGATCTACCCTTGAAACAGGAAAAAGGACAAAGTAACAAGGTCTATCTAACTTAGTGTTTGTTAGGGTGATGGACTATGGAGATCTCAAACACAATCGAGGAAGGCCAAAGATGCGTTGGTGGACGGAGGAGTTATAAAGGACATGGAGAGAGTAGGCTTGCAAGATGATATAGCATTAGATAGAAGCAAATGGAGGAAGAGGCTCTTTGTGGATGACCACATGAGATATGCTTTCCGTGCATACAACCAACCTATCTCTTTGGGAGTAAGGCTATGGCTTTGTCGTTTTGACCGAAACTAGTTCCACTTTTGAGTTTTTAATTTATTGTTTATTGACATCGCTGAACTTTATCATGCCCATGTTTGAGATCCTATCGATCACGAGTTCATTTGTCTTCTTCACCAAATACTTATACGTTCATAAACTTTTACTTTATAAGAAAAGTGCCAATGCAACTCATTCTTTGACATGCTGTTTGTTACCATGCTTGATGAACTGATAGTTTTGCTATCTTTTTAATTGAGCTAAGTCAAAAGGTTTTATCTGGCTGCTTTATACCTCATTTTAGTAATTTTGTCTAGTTGTAGATTATATAATGTTCATGTCTTTTCTGATACTTGGTATGCATCTGCAGAGAAACTTCCTGACGATTGGGCTGGTGCTCCAGATATTCAGTCGTTCCACTCTCTTGACCGTTCTTTTGTTTTTCCTGGTACCATCCTCTTTTTATTTCTTGTTTTCTACCATTCTAATTTGCATTCATATTGTTTGCACATTCTGTAAGCTGAGTATGAGCCAATACCTGACAATGCAGGGGAACAGGTTCATATCTTAGCATGTTTATCTGCATCCAAACAAGACACTGAAATCATAACCCCATTCAAAGTTGCTGCTGTGATGAATAAAAATGGTTACACACAAGGTTCTAAGGACCAAAATGGAAATGTTGCAGAACTAGAATCACCTTCTGGAGCTCAGCCAGCTGGGTTTGATGATAATCTAGATGCAAAAAATGTAAATAAGCATCAAGTAGACGTATCTGCGAGTGAGTCTTACCTTAGAATGGAGAATCATAGAAGACAAGTGCAAACGTTGCTGCAGAGGTTTAAGGACTCCCATTTTTTTGTGCGAATTTCTGAGGCAGGAGAGCTTCTATGGTCAGACAGAAGCACATCAGAAGCACCTGAGACGGAAGATGACGCTGATAGTATGGAGAACAATAAAGCTGCTGGCTCAAGAACCTCAATAAGTGCAGTTATTGATAGTGGAAGGTTTGATGCTCGGGCTTCTGGTGGTGTGGCGAGAAATTCTGTTAAGTGTTATTCTCTTGCTAATGGAGACATAGTGGTATGTTTCAAAAAATCTTATGGCCTCTGCATGTCACTATGCTTCCTAATTTTAAAAGTATTGATGCATAGAAAAGATGTTGTTCCATTACCTTATGTAACTTGCATTCCCTTTTCGTTTGATGGTCTAGGCCTCACAGTGAGGTAGCATGTCGTTGATGGCTGGCTGTTTAATTTATCGGGGTATGTTAATTTGTTACTGATGTGGTATAAATGCAGATAATTTCTTGACAGTACAAATGTAAAACATATTGAGAAAATGAATTCATTTTTTGATTTCATGTATGGGCAATACTCATCATCTCATTATCTTCCAGTTTTTTTATTTAAATGTTTCTTATTTTTAATATAGTGCATGTGATAAAACTATTTTCAAGGCTCCAATTGGAGTTTAAGGATATTATATTTCCTACCGTTGCTAGACTATCAAAGAGTACTATTATGATTGTAACTGAGGTTGTATGAGTCCTTTTATCGGGGCTTTTCAGGCTGCAACAGGAAGCTCTTGTTGGATATTTAACTTTTCTGTTTGGAAAATGGGTAGTCTTGTACCTTTATTTAGATTTTCGTGCGTTTTCTATAGAGTTAGTCAATTCACACTCTTTTTCCTATACAGATTCGAAGCTGCACAACTAAGACTTGTTTTATTCTATATAGTTCTTACACTAAAGGTTGACTATTTTTCAATGGTTTCTATGCACTTGATTTTTATTTCCTGGTATAACTATCCTCCTCTTCATGTTTCGTGTGTACATTGTTTGCTGCTATGTTTATTTGGATTTAAACTTGCAACTGTATACGTTTTTCTTCTTTTGATCTGAAATTAATGTCATGCATTCTTATTCTTATCTCACTTATTTTGTCTTATCCCCATTCTTGTTTGGAAGGATGCAGGTTCTTTTACAAGTGAATGTTGGTGTTGATTCTTTGAGAGATCCTATACTAGAGATTCTTCAGTTTGAGAAACATAATGACAGAACTACGGCTTTAGAAAGTGATTTACCTGTCAAGGAAGAACATAGCACATGTGGGGAACTGTTGAAATGGTTGCTTCCTTTGGACAACTCTCAGCTTCCTGTAAACCGACCTTTATCCCCACCCTTACTCAATTCATCCCATAAAACAAGCTTTTCAGGTTCGAGTGGTTCACAGCTTTTTTCTTTTGGTCACTTTAGAAGTTATTCTATGTCGTCACTACCACAAAATACTGCACCTTCTGCATCACCGTCATCAGTTTCTTCTGCTTCTTCGAAGCCAAATTTGGATTTAGAAGACTGGAATCGTTTTGGGTCACAAAAATCCCTGAAGAGCCCGAGAAGTGGAAGCCAGGGTTTGCTGTCCTTTCGTGGAGTTACATTAGAGCCTGAAAGATTCTCTGTTTGCTGTGGACTGGAGGGAATTTACATACCAGGAAGAAGGTGGAGGAAGCAGCTTGAAATTATTCAACCTGTTGACATTCACTCATTTGCTGCAAACTGCAATACGGATGACATGCTTTGTGTTCAGATAAAGGTAAGTTTTAATTAGCAGTCATGCTTTTGTCATATATGTGTATATTTAAGGGGCTTACTCATAGTGTTATTAGTTATTACAGGTTTGGTTTTGAACTGTGCTACAATGGTTTGAAAATTCTACTTTTATTTTTGCAGAATATATGTCCAGCGCATGTTCCAGATATTGTTGTATACATAGATGCTATAACTGTTATTTTAGAAGAGGCATCAAAGGAGGCAGACCGGCCTTTGTCTGTTCCAATTGCATGTGTTGAAGCTGGAACCGGCCAAACTCTGCCAAATTTGGCTCTGAGGTTGTTTTTATCCTTTGAAGCTTGTTAAATTTTGGTTGTATGGTGAACGGAATCACACTGTCTCTTAATTAGTCAGTTTATTCCTTTTTCTTTGAGGGGTGGTGGGTTGGGGTATTAGGAAATGTATTGAAGTTTTATAGTATGTGACAATAGGTAGAAGTGTAGAACATTGTCCTTTGAAGAGAAGTTGTGTGGCTAGTCTCTCCTGTGTGTATGGATTACGTAAAAACTTCACTCCTTTGAAAGATGTTCTCCCACTCGTGTGGGTAGGCCTTAGCGGTGGGTTCATTATAATGTACCCCTTCTAATGTCTAGAATGGTTGCTCGGATGTACCTTGGTCTGCTTTCTTTTCTTTTTCTGTCTGCAAGAAGGTTATACATATTCCTCAGATTCAAAGGACTTTAGTTTCCGCCTTCCTTAATGTCATGCCTTCAAATAGGCTCCACTCTGGGCATGGTAAAATGGAGGTGGGAGGTAGTGTATACAGTGTCTTTATCCCTGCGACAACTGATAGGTTGTTTTCTATTGACCCTTGACTCAGTATGTAATGACTGCCTATTACTTCATAACCGAGAAGCACATATAGTTTGATTTTGGGGATTTTTACCATGTACCCTTGAGGTTTTAAAGAATGTGTGAGGTACCTCTGTGTTTTTCACCTTGTACTCCTAAGATTTGGCATAATATACGAATGGGGAAAATTAATGGATGTTAAACAGAGTTTGATAAAGATTACAGAGAAAAGTTAAGCTCCATTATCCCTGCACACTCTCTCCTCTATTCCACGCTCTTGCCACTGTCTCAATTGCACTCTTTTTCGGCCATCCTCATCCTTCCACACTCTCTCTCCTTTATCCCTTATCCTCCAACCCTGGTACTCTATCCCTCCTCCCTCACCACTGCTATCCTCCCTACGGTTGTAAGTCTCTCTACTGTGATCTCGCATTCTGGTAACAAGCGCTTGCTAAATCAAGCTAATTCATGGTCAATGGATTCACTCAGTTTATTTCTCTGTATTGTAGTACTTAGTAACTTAGTATGTTGGCTGAGGTCAAATTTCGGCAGATCTACTTTGTTGATTTTGGACCTTGCAACCATCCCCTGTGTTATTTATCAGACTGAGGCTTCCTCCGCTTTCCAGATTGTTTTTGTTTTCAATTATTTTGGTTGAATTGCAGAAATTTTGGACATTGTTAAATAGAAAGTGAAGAGAGCAGTGATGAGAAAGAGATAAATTGGGTGGGGAAGGGTTACTTGGCGGATTTGTGGAATGTTTTTTTTTTTTTTTTTAATTTTTTTTTTTATATTCAACTTTAGAGGTACTTGTTGCGTTACTCAAAACCAAGGGTACCTCGTGTATTCTGTAAAACCTTAGAGGTACGCGAAGAAAATTCCTTTTGATTTTTAGCTTAATCCATCATCATCTAAAAGCTAAAGTTTAATAAACCCTTGACTCTTCTGGAAATGGAATTGATTTCTAGAGGTGTATTTCTTTTGTGGCACAGAAAATTTGACTTGTAAACTTTACCATTGTCACCAAAGCAGGGACCATAGGAAGTTTGTTTTGTGTTCGTATCAGCAGTATCTTTTTCTCAAGCTATTAGTAGCTTGATCATGTTCCTCTCACTGTACACTTTGTCTCTCCTATTTGTTGACTATTTATGTTAAGGGTTCACCTTCGAATTTAGAACAATACTACAGACTGAGGAAATTGAAGGTGTTTTTAGATCATGTTCCTCTCACTGTACACTTTGTCTCTCCTATTTGTTGACTATTTATGTTAAGGGTTCACCTTCGAATTTAGAACAATACTACAGACTGAGGAAATTGAAGGTGTTTTTAGATCATGTTCCTCTCACTGTACACTTTGTCTCTCCTATTTGTTGACTATTTATGTTAAGGGTTCACCTTCGAATTTAGAACAATACTACAGACTGAGGAAATTGAAGGTGTTTTTATTCCCTTGTTGACGCATGGAAGATATATTAGGAGTTGCATCTTTCTGACGATAAAGGATTTAAACAACTAACTGGGAAAGCTGAGCTTTGAGATTTAAACAACTAATGAGTACATCTGCTTGTAGAAATAGATGTGTTTTCCTCCGCCATTGGGAAGTAATGGATGTCATCTACTCATCTGGCCCCTTTAAAACATTCATGAAGGACCCAATGTTCTTTTTGGGGGACATTGAGAATTTGCCGTCACTTCTTTCGCGAGTTATCTGTTCATCTGTTTATTAACAGATCTCTTGGGTGGGAGCAGTTCATTTTTTTTTAGATTTCTTATTTTCCTGTAAATTTTCCATTGTATGAACGATGCGCAGTTGGATTTCAATGAAAAAAAAATTAATTGGTTGTAGTTTTACCTGTTATTTTGGTGATACTGGTCATGAATTCATGATATGTGGATGTGCGAGGGTTGTGTCATGAGTTGTAGTTGCATTTTGCCTTGTTGAACATTTTTTGAGTGATTTTAAGTTTGTATTTAGGTTCTACTGCCTCTTTTGCAGAGTAATTGTCCTTCTACCAGTCCTAAATGCGCTACTGCAATAAAACAAATTTTTTTATTAGTTCTATTCTGCTTTTTGTGATATGATTAAAAATTCAATTTGGCCAAACACTTCCAATTGCAGGAATCATTTTATTTTTTTTAACTCTGATCTTTCTTCCGTGAAGAAATACTTGTTAACAGCTGATAAGAGACATAACAAGATATAGCGGTTTTATTTAAGATGATCATTTGTATTGAGATTCATTATTCCTTTTCCATTGGCGAGAGGAAATTGTTTTGTTTCAAAATTATGACGTGCAAGGAAAAATTTGTATGATAAAGACTTTGTCCAGCTTATTTCAGCTAGCTTAGCTATTAGTTTGCAAAGTATCTATGTGATAGTCCATGGCCCTAGTTAAACTTTCTCAACTTAGGCTTTGTACCTCCTCTTCACCCATCAGTTGTACTGGGAAATCGTAGACTTCTTATTTTTGTGGCTATGCTGTGTACTGTGGCTATAAAACTATGGTAAGATGCTTTTAGTCTTATGTCTGGTGAATAACCTTAACTTGATTGTTGCTTCAATTAGTATCACGTATTTTTTTTATTACCTTCATGGAATTAATGAAAATACAGTTCTTTGAAATCCTATATGGCTATGCCAATTATTTCTGTTCGCAATTTCATGGAAACCATCTCGTTGCAGGAGAGGTGAAGAGCACTCGTTTATTCTCAAGCCAGCAATTTCAGCCTTAAAGAAGTCGAAGATACCTGATGATAAACTTACTAAGTTGCATTCACGAGCTGGTAGCAAACCAGCAAGTTTTCGGCTGCCTCCCGTAGCTGTTGAAGAAACAAAGAATATATCTACCCCTCATAAGTATGCCATATTGATATCATGTCGTTGTAACTACTCTGGTAATCCCTAGTACTTTTCTTAACTTGTCCATTTTGATTAATTGTCTTGACTGTTCTTTAGTTGACTTTTGTATTTTCCGTAGAATCGAGACTTTTTTTCAAGAAGCCCACTGATTGGCGACCACGCATTTCAAGGGACTTTATGATATCAGTTGCATCTGAAACATCAAAACAAACTCTGGATTGTACTGGAAAAGTCTCACAGCTGCCGGTTCAGGTCTCTTGCCAACACCACCTATGAAGAGCTTTTGTTATTTTTTGAAATGTGTTGTCTGATGATTTAAAAAATTTCAGGCGTTGACTCTTCAGGCATCAAACCTTACGTCTGAAGATCTTACATTGATCATACTTGCTCCAGCATCAGTGACTCCTGCTCCTTCAGTGCTGTCACTAAATTCTACACCATCATCACCAATGAGCCCATTTAGGGGTTTTGCTACAACACAAAGATTAGGCACAGTAACTCGTTCTACAGAGGATAAATCACCAAGTGGCGCACCGTCTCTTTCTTCTGATCAGCAGGGTATTTCAGTTGCTGATGTCATTCCAACAACTGCTTCGAGCTGTACTCACTTGTGGTTGCATAGTAGGGTTCCATTAGGGTACATACCATACTGCCTTTTGGTTCTTTCCAATTAGCCTTTTCACTTGTAATGTAAGAGTTTGGTTTGACCTTGGTGCATTGTTGTCTTTACAGATGTGTTCCTTCTCAATCTACTGCCAAGATCAAGCTTGAGCTGCTGCCACTGACAGATGGCATAATTACGCTTGATACTTTGCAGATTTATGTTAAGGAGACAGGTAATCGATATATTAATAAAAATAAAAATGAAGCCATCTCTATGATTAGAAGTTGTCTTGGGGCACATTGCAAGTTTTATAATACCTACCAGATTGCAAGTTGCTGTTATGTAGTGTTTCCTAACTTCCCTTCATTTTATGAACCTTATATCTTTGGTTCATTTGGATTTTGTTTAAACATGTACTTTGTATTATTTCTCGCTGCGTGAACTGTTGAGAGAGCTATGATCCAATATTACAATATTGCACGTATATGACTGAGCTTCTCTCGCTCTTACCACTTAACACATCACACACACACACATGGTGGTACTTTTTTCTTTTCTCTTGGGGGCTTAAAGAGTAGGTTTTATGTTTCATATAGCTTATTTTAAACTTGGATGAGCATCATGTGTTCTAACGACATTATTTTGCAAACAAGAAGGTTTTTTCTATAAAGTTTAATTTTAGACATCGAGTTAGGCAGAAAATAAGAACAGGGGGGGGGGGGGGGAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGAGAAGCCTTGATAGTTGATACTTCTTTAAGCATGCAGAGTATCGTCGAGTTCTTGAATGATCCTGAAATTTTACTTTGTTCAATCAATCCATATGTTTTTAGATGTGTATTTCAAAGCTTAAATTTTTGTGAATGACCCTGTCATGTATCCAAGGGGACCAGAGGATGACAGATGAATATTTTTTTTCTTTTAATGAATTGAAGGTTTCCTCTTTCTTTTTCTTTTCTTTTCTTTTACTTTTTTTGATAAAAAAATGTGTTTTGCTAGTCTGATAAACATTTTGTTGTCCAGTCTTATCCTTGACTACTTTTGTCTTATATAATCCGGTCAAATCACCATTTATGCTTTAGCTAACTACTTTTGCCCAACTTTTATTTCCATTGGCGACCATTCCATGCCTTAAGGAAAAGACGGGGAAAGGTGAAGCATTAGTAAGGGTAAATAGCGAGGTATGGGATTGATGGTGTTTGGATGAGGAGAACTTTATTAATAATGGAGTTTTAACATTTGATCTCACATTGCTAAGGCTTGCTCAGCCTGTTCTCATGCAAATTGTGACATCTCCTATAGGAATGAAGGAATATAATTCTTCCTCAGAAGAAAGCCACAAAGTAGCGGTTGGATGAATGATGACTAGTATTATCTTTTCACGTTTGTGGTTAGTGAAGGTCAGTCTTTTGAGAGTTATGTTAGATAGACCAAAAGGGCCAGTGCCATTTTGTAGCCTACAAAATAAAAACTTTCATCCAAGAACCTACGCCTCATTGTTAGTTAGGATGTTATTTCACTCTTCTGATTGCCGTTCATAGCAGAATAGTGGAGACGGCAGTTATGGTAGAGACATGGTTTTGAGAACCATCTTTTTCCTGATAGATATTTTCTTAAAATGTTCTTGACCCTAATCAAGGCTTCTGAAGAACATGTGTTAAATTTTGTACTCTGTATTCATCTAGGAGGGGTTTAGGCATTGACAAGCATTTACTATTGGCTCTCTGCAGGTTTTGATGCCCTCTTTTAACCAAGTTTGTTTAACTATTAGACGCGCCTTTTACTTGCTTAATATTCCTTTGGTCACAACTATAGGTGATGTTTGAGATCCCGTGATTGTATCTGAAGATGGTGGGGTGATGTTCTGATGATATTTGTCTTTGGAAAGATAATATAAATTGAGTCACTTTTCTTTTTTGTTTTCATTGTGTGTACATCCATGTTCGGATTCCTCAACATGGTGAAGAAAGCGGGGGAAGGTTATTATAGGACTTCTTATGGTTAGGTGTTCTGCAATATTTGTGGATCATTTCAAGGGGTGGGAGAACGGGAAGGCTCATAGACCTTAGAAAAATAGTGTCTTTCTGAGAATTACTATGGTTGTTCGATTTGGCTTAAGTAGTTTGGCCATGTAAGACGCAGGTCAAATGATGCGCCGTTTAAAAAGGATTGAAGAATGAAGAGTTGCAAACGGATGGAATTGTGAGGGGTAGGGGAAGACCAAAGTAAAAGTGGAAGAGGGTGATTGAGAGTGATATGAGTTTATTGGAGATTGAGGAAAATACAGCTTTGGATCGGGCAGAGCGGAGGGAGATAATATATGCCGACACTCAACATGACCTCCTTCTCCTCTCTTTTTTTTCTTGTTTTCTATTTTCTTTGCAAATCACTCTCTTTTTCTTTTTTTTCTTTATTATTTTCTTTTTTTTTCAAAACATATTTGCGCCTCTTCCTTAAACCAACTATCTTGTTTCTTGTTCCCTGTTAACCTCATCTTCACTTTCCCTTTTTTGCACGTTAACTCTCTTTAACATAACCTTTTTTCTTAATGACGGGTAATGTTAGCCGACCCCAATCTTTTGGGACTAAGGCTTGGTTTTTGTTGTTGTTCGATTTTGCTCAAGTGTATTACTTAGAAAATAGGAATTACAATGCAAAATCAATAAGAACAACTCACAAGGATTTCATTACTGTGGTTATATTAATCCACCTCTTCTTAAATTAATAATACTTCAAATAACAGCTACCATGTGTTGGCCTTCCTACAACTACATAGTCTGCATAGCTTGCCTATCACAACTTCATAAAAATACCCCTTATTATCTTGTATTTAAAGGCCTACATGATCTAATTCTACTAAGATATGTAAATTACCTTGACACTTCTAATGTAACTAGGAAATAATATACTTGTGGAATAAGTAAATATTTCTTCGTTTTTTTCCTAAACTTCTCAAAAAATATACTTTCATAATATTCCTAGTAAACTAGGAGACTTCACATGGTTCCAATAACGTTGACTACTTCTCAGTCACTTTGCCGATGCGAAAGACTTGCGTACCAAACTTCAACCTCAATACTTTTAGTCAATGGAAACTTCTCGTGTGGCTAGTACTTGGGATGCAAACCAACAAAGTGCTCCTCAGCAAGTAGGAATGATGTTGCTTACGCACAATTCTCATAACTCTCTTTTGTTGACAATCTGTTGGAGGGGGTAATTTTTTGGTGTTGTCTGGATAGGAAAATGGGTTTCCGTAGAAACTTTCCCTTTCTTATGTAATTTGTCTCACAATTCTTCTGTCAATATTAAATAACATTTAGAAATTTTATAAAAACACTATGATTCAAATTGATTATCTAGTTCTGATAATGTTAACAGTACTTGTATATTTGATGTGTTGTTTTTACTTAGTTATCTTACTATAGATGTAAGGTTCTGTTCACCTTAAATCCACTTATTTCAGGAAAAATAAGTTCGGTTAAGTTCAGATAAGTTTAGATAAGATAAGTTCAGAAAAAATAAGGATTACCCAAGTACAATTTATAACTAAAATAAGTTCGATAAGTTTAGGTAAAATTCGGGAAAAAAATTGAAATCAGGTGAATAGAACGCACCCTAAATCAAGTTTTTTCAATCTACTTATGCGAACTCAACTCAAGTCGAACTCGTCAAAAATTCTAGTCAAGCTCAAGTTGTGTTGGGCTTGATGGCTTAATTCTACTCAAGTTTAACCTAGTTTTGATAAAGTTTGTCTCTATAAATCTTCAAGCTGTCTTATCCTGTTACCCCCTTCCCCCCCCAGTATAATGGGAGGATAGAAGAGCTCATCGCTCTGCCAAGATATTTTGTAAGGTTTGGGCGTGCTGATTATATTGAATATACTCTTGTGAATCCTTCTGCAGCTATCCCACATTCCACTTTCTTTCACCTTGATGTCTTTCCCTTTGCGTCCTGTCTGAATAGAACTTACGCTGCCTGTCCTAGAGTTAAGAATTTTTAGAAGGCTCTTTCTCATGTTTATGGGTGTTGCTTCTATAATTAGAATTCTTTCCTCCCTTGCTTTATAGTATCTGGAATTCGAGAGTTGGCTCTTTGATGTCTTATTTGAAGGCCATGTCATCCCGAAAACTTTGTTCTTCATTCAGAAGTTTGTGGTGTTGAAGACTAAAAGGTAAGGCAGAGTATTGTGGAAGCGTGTTAAGGTTTGTCTAATGAATTTTGGAAGTCTCGCCTCAGGGTTGATTTTGGTGGGAAGAGACTGTAGTTCACCTGTTGGCTCTTGGTGACATTGCTCCTCTACTCTTGTTCGGAAATCCAACGCAGTTGAAAAACTGTATGGGTTAACTTTTTAAATACTATGATACTGAAAGAATGGCTTTCTATCTTTCTATTTTCATTGGTTAATTAATGAACAGTTTTGTTCGTAAAAAATCTAGAAAAATAGATTTGAGGGCCAGGTGCATAGTCATTTAAGCTTTCTAGAAATGAAAGTACGACTTCCATGTTAGAGAGGTCAAGTCAAGCTTTATGCTTAAATGCTCTCTTATCCTTTTCCTTAATTTTTTTTTATCATAGTTTAATTGTAGGATCTCTTATTATGAACAATGGAACTAGATTTCAAATTTGCTCAAATCCCAGTCCCTCTTTTTCTTACTGATTGGAATGGAAATGGAATTTAATTTTAACAAAGATGAAAGTTCCTCACTTTGTGATGAAATAGAGCTAAATATGTAATAATCTTCCTTATTGCAACTTTGTTTGATCTTTGGGTGGATAGGAGGTCAGTTTTTTTTTTTTTTTTGGCAGCTCCACACACCCCTACGCCCGGGTTGTCCACAGCAAGCCCATGGGCTTCACCAACTACTGCTCTACTTACCTTATTACACTGGACATAAGTACACTGATTCACTACTTGCCTAAAGTCTTGTAATATGTTACTAATATCCTTTGGACTAGTAACTGGACAATTTAGTCAATCCTGCATTCTTTTGAAGAAATTCCAGTTTTTTTTATAACAAAAGTCCTTTTCTAGAAAGCTGGACCTTGTCAATCTGCATCATATTGTTTTCTGATCTGTATTACATCAGTGTTTCACTTCAGTTTTCAGTGTAAAATATCTGTAATACACATTTAGTACTTGCAGGTTACAGTCTTTTATAGTAACATGAGATTGTGGACCTCTTATTCAGATTGAAAACATTCTATTTAATTGTTGTATCGATTGAATTGTTTATGGTTGTTTTTGTATGTATGTGAGTAGAAGGAAAAGTTGTTGAAGATGAATACATTCTAATGTTCAAATTTGCACCATGCAACAGGTCTAACTTATATCCCTGAGAATGCGCTGAAGATCAATGCAACTTCTAGCATTGCTACTGGAATCATCTAAGCTAGCTACCGGCGGTTGGTCATTCACATTCTTTGTATTTGTATGTTCTCTCCTTGATGAAGAAACCCAAAATTAAGAGGGGTGTTTCCCTCCCGTTGTACATCACCACCTTTTTTTTTTGCATTAATGTAGGATATAAATTCGAGGTGCTCGTATTTCTACCTCCAGAATTAGAGGCTGTATTTATGTCTCGAGGTTTGTTTCTGAGGCACTGTTATATCTATCATCAGCATGATGGAGGTATGGCATTTGTTAATACCTCTGCTATATAGATTTTGCTGCTTTATGTAGATGTATTTGTGTAAAATATCATTGAGTTTTGAGTTGAATCGTAAAATATACGAACTAGATGTTACAGGAATTGCGAGTTTTCTGAGATAATCCAGGTGCTTTGCATGGCAGTCTACCTCACTGTACTTTCCAATCTTCTCTTGTCAGTTGTCACTAAGTACTCGATACTGGCATATCGGGCCGTTAAACCCAGTCCATTATACGTGAAAGCGTATCCCAGCACCTAATCGCCAACGATTTGATTTACCCTCCTAATCGCCAACGATTCGATTTACCCTCTGTTATTGGCAAGATATTGTTTAGGAACTTACAAAAAAAAAGACATTATATTAGTGTGAGGTGATTTATTATAAGAAAGTGTATAATTGGATAGTTTTCCCAAAATATTTTTGCTGGTCACTGAAATGGAATTAAGTGTATGTTCTACTCC
mRNA sequence
ATGATTTCTTTTTTCCCGGGTTTCGAAATTGGGGCTCAGATCTCGCGGCACTTTGCCCTTCACCTTCTCTCTCTCCCTCTATCTCTCTTCGCGATCAATGTAATTGAATCAAGCCCTAATCAGACCTCTACACTGTCAACAAATCCATCAATTTCTTACCAAATTCCCCTTTCTAATCACAATCAAATCCCTGAAATCACCAGAATTCAGAGAAATTGGATCGAGGATAGTGAAAGTGGGACGATGAACTTTTTGCTGCGGCAAAGCCCTAACGCCGCCGCAGCAGCAGCAACGCCAGCAACAACACCAGCAAACGCCGATCAACTTCCACCGGTTTCGCCGTGGTTGAAGAAAACTCCGTCGCAGAGTGTGACGACACTGGAGGGATTGATTGCGGAAGATAAGTCGCATTTAGTGGATCATTCCACTGCAAGTAATCGTGATAGTTTAACTGGTGAGAATGGTATTATGAGTTCCGCAAGTATTAAGGATTTCGCTCCGCTTGCTGATAAGCATGTAGACGTTGCCGAGGATGAAGGGTGGATCACCATTCCTCATGAGAAACTTCCTGACGATTGGGCTGGTGCTCCAGATATTCAGTCGTTCCACTCTCTTGACCGTTCTTTTGTTTTTCCTGGGGAACAGGTTCATATCTTAGCATGTTTATCTGCATCCAAACAAGACACTGAAATCATAACCCCATTCAAAGTTGCTGCTGTGATGAATAAAAATGGTTACACACAAGGTTCTAAGGACCAAAATGGAAATGTTGCAGAACTAGAATCACCTTCTGGAGCTCAGCCAGCTGGGTTTGATGATAATCTAGATGCAAAAAATGTAAATAAGCATCAAGTAGACGTATCTGCGAGTGAGTCTTACCTTAGAATGGAGAATCATAGAAGACAAGTGCAAACGTTGCTGCAGAGGTTTAAGGACTCCCATTTTTTTGTGCGAATTTCTGAGGCAGGAGAGCTTCTATGGTCAGACAGAAGCACATCAGAAGCACCTGAGACGGAAGATGACGCTGATAGTATGGAGAACAATAAAGCTGCTGGCTCAAGAACCTCAATAAGTGCAGTTATTGATAGTGGAAGGTTTGATGCTCGGGCTTCTGGTGGTGTGGCGAGAAATTCTGTTAAGTGTTATTCTCTTGCTAATGGAGACATAGTGGTTCTTTTACAAGTGAATGTTGGTGTTGATTCTTTGAGAGATCCTATACTAGAGATTCTTCAGTTTGAGAAACATAATGACAGAACTACGGCTTTAGAAAGTGATTTACCTGTCAAGGAAGAACATAGCACATGTGGGGAACTGTTGAAATGGTTGCTTCCTTTGGACAACTCTCAGCTTCCTGTAAACCGACCTTTATCCCCACCCTTACTCAATTCATCCCATAAAACAAGCTTTTCAGGTTCGAGTGGTTCACAGCTTTTTTCTTTTGGTCACTTTAGAAGTTATTCTATGTCGTCACTACCACAAAATACTGCACCTTCTGCATCACCGTCATCAGTTTCTTCTGCTTCTTCGAAGCCAAATTTGGATTTAGAAGACTGGAATCGTTTTGGGTCACAAAAATCCCTGAAGAGCCCGAGAAGTGGAAGCCAGGGTTTGCTGTCCTTTCGTGGAGTTACATTAGAGCCTGAAAGATTCTCTGTTTGCTGTGGACTGGAGGGAATTTACATACCAGGAAGAAGGTGGAGGAAGCAGCTTGAAATTATTCAACCTGTTGACATTCACTCATTTGCTGCAAACTGCAATACGGATGACATGCTTTGTGTTCAGATAAAGAATATATGTCCAGCGCATGTTCCAGATATTGTTGTATACATAGATGCTATAACTGTTATTTTAGAAGAGGCATCAAAGGAGGCAGACCGGCCTTTGTCTGTTCCAATTGCATGTGTTGAAGCTGGAACCGGCCAAACTCTGCCAAATTTGGCTCTGAGGAGAGGTGAAGAGCACTCGTTTATTCTCAAGCCAGCAATTTCAGCCTTAAAGAAGTCGAAGATACCTGATGATAAACTTACTAAGTTGCATTCACGAGCTGGTAGCAAACCAGCAAGTTTTCGGCTGCCTCCCGTAGCTGTTGAAGAAACAAAGAATATATCTACCCCTCATAAGTATGCCATATTGATATCATGTCGTTGTAACTACTCTGAATCGAGACTTTTTTTCAAGAAGCCCACTGATTGGCGACCACGCATTTCAAGGGACTTTATGATATCAGTTGCATCTGAAACATCAAAACAAACTCTGGATTGTACTGGAAAAGTCTCACAGCTGCCGGTTCAGGCGTTGACTCTTCAGGCATCAAACCTTACGTCTGAAGATCTTACATTGATCATACTTGCTCCAGCATCAGTGACTCCTGCTCCTTCAGTGCTGTCACTAAATTCTACACCATCATCACCAATGAGCCCATTTAGGGGTTTTGCTACAACACAAAGATTAGGCACAGTAACTCGTTCTACAGAGGATAAATCACCAAGTGGCGCACCGTCTCTTTCTTCTGATCAGCAGGGTATTTCAGTTGCTGATGTCATTCCAACAACTGCTTCGAGCTGTACTCACTTGTGGTTGCATAGTAGGGTTCCATTAGGATGTGTTCCTTCTCAATCTACTGCCAAGATCAAGCTTGAGCTGCTGCCACTGACAGATGGCATAATTACGCTTGATACTTTGCAGATTTATGTTAAGGAGACAGGTCTAACTTATATCCCTGAGAATGCGCTGAAGATCAATGCAACTTCTAGCATTGCTACTGGAATCATCTAAGCTAGCTACCGGCGGTTGGTCATTCACATTCTTTGTATTTGATATAAATTCGAGGTGCTCGTATTTCTACCTCCAGAATTAGAGGCTGTATTTATGTCTCGAGGTTTGTTTCTGAGGCACTGTTATATCTATCATCAGCATGATGGAGGAATTGCGAGTTTTCTGAGATAATCCAGGTGCTTTGCATGGCAGTCTACCTCACTGTACTTTCCAATCTTCTCTTGTCAGTTGTCACTAAGTACTCGATACTGGCATATCGGGCCGTTAAACCCAGTCCATTATACGTGAAAGCGTATCCCAGCACCTAATCGCCAACGATTTGATTTACCCTCCTAATCGCCAACGATTCGATTTACCCTCTGTTATTGGCAAGATATTGTTTAGGAACTTACAAAAAAAAAGACATTATATTAGTGTGAGGTGATTTATTATAAGAAAGTGTATAATTGGATAGTTTTCCCAAAATATTTTTGCTGGTCACTGAAATGGAATTAAGTGTATGTTCTACTCC
Coding sequence (CDS)
ATGATTTCTTTTTTCCCGGGTTTCGAAATTGGGGCTCAGATCTCGCGGCACTTTGCCCTTCACCTTCTCTCTCTCCCTCTATCTCTCTTCGCGATCAATGTAATTGAATCAAGCCCTAATCAGACCTCTACACTGTCAACAAATCCATCAATTTCTTACCAAATTCCCCTTTCTAATCACAATCAAATCCCTGAAATCACCAGAATTCAGAGAAATTGGATCGAGGATAGTGAAAGTGGGACGATGAACTTTTTGCTGCGGCAAAGCCCTAACGCCGCCGCAGCAGCAGCAACGCCAGCAACAACACCAGCAAACGCCGATCAACTTCCACCGGTTTCGCCGTGGTTGAAGAAAACTCCGTCGCAGAGTGTGACGACACTGGAGGGATTGATTGCGGAAGATAAGTCGCATTTAGTGGATCATTCCACTGCAAGTAATCGTGATAGTTTAACTGGTGAGAATGGTATTATGAGTTCCGCAAGTATTAAGGATTTCGCTCCGCTTGCTGATAAGCATGTAGACGTTGCCGAGGATGAAGGGTGGATCACCATTCCTCATGAGAAACTTCCTGACGATTGGGCTGGTGCTCCAGATATTCAGTCGTTCCACTCTCTTGACCGTTCTTTTGTTTTTCCTGGGGAACAGGTTCATATCTTAGCATGTTTATCTGCATCCAAACAAGACACTGAAATCATAACCCCATTCAAAGTTGCTGCTGTGATGAATAAAAATGGTTACACACAAGGTTCTAAGGACCAAAATGGAAATGTTGCAGAACTAGAATCACCTTCTGGAGCTCAGCCAGCTGGGTTTGATGATAATCTAGATGCAAAAAATGTAAATAAGCATCAAGTAGACGTATCTGCGAGTGAGTCTTACCTTAGAATGGAGAATCATAGAAGACAAGTGCAAACGTTGCTGCAGAGGTTTAAGGACTCCCATTTTTTTGTGCGAATTTCTGAGGCAGGAGAGCTTCTATGGTCAGACAGAAGCACATCAGAAGCACCTGAGACGGAAGATGACGCTGATAGTATGGAGAACAATAAAGCTGCTGGCTCAAGAACCTCAATAAGTGCAGTTATTGATAGTGGAAGGTTTGATGCTCGGGCTTCTGGTGGTGTGGCGAGAAATTCTGTTAAGTGTTATTCTCTTGCTAATGGAGACATAGTGGTTCTTTTACAAGTGAATGTTGGTGTTGATTCTTTGAGAGATCCTATACTAGAGATTCTTCAGTTTGAGAAACATAATGACAGAACTACGGCTTTAGAAAGTGATTTACCTGTCAAGGAAGAACATAGCACATGTGGGGAACTGTTGAAATGGTTGCTTCCTTTGGACAACTCTCAGCTTCCTGTAAACCGACCTTTATCCCCACCCTTACTCAATTCATCCCATAAAACAAGCTTTTCAGGTTCGAGTGGTTCACAGCTTTTTTCTTTTGGTCACTTTAGAAGTTATTCTATGTCGTCACTACCACAAAATACTGCACCTTCTGCATCACCGTCATCAGTTTCTTCTGCTTCTTCGAAGCCAAATTTGGATTTAGAAGACTGGAATCGTTTTGGGTCACAAAAATCCCTGAAGAGCCCGAGAAGTGGAAGCCAGGGTTTGCTGTCCTTTCGTGGAGTTACATTAGAGCCTGAAAGATTCTCTGTTTGCTGTGGACTGGAGGGAATTTACATACCAGGAAGAAGGTGGAGGAAGCAGCTTGAAATTATTCAACCTGTTGACATTCACTCATTTGCTGCAAACTGCAATACGGATGACATGCTTTGTGTTCAGATAAAGAATATATGTCCAGCGCATGTTCCAGATATTGTTGTATACATAGATGCTATAACTGTTATTTTAGAAGAGGCATCAAAGGAGGCAGACCGGCCTTTGTCTGTTCCAATTGCATGTGTTGAAGCTGGAACCGGCCAAACTCTGCCAAATTTGGCTCTGAGGAGAGGTGAAGAGCACTCGTTTATTCTCAAGCCAGCAATTTCAGCCTTAAAGAAGTCGAAGATACCTGATGATAAACTTACTAAGTTGCATTCACGAGCTGGTAGCAAACCAGCAAGTTTTCGGCTGCCTCCCGTAGCTGTTGAAGAAACAAAGAATATATCTACCCCTCATAAGTATGCCATATTGATATCATGTCGTTGTAACTACTCTGAATCGAGACTTTTTTTCAAGAAGCCCACTGATTGGCGACCACGCATTTCAAGGGACTTTATGATATCAGTTGCATCTGAAACATCAAAACAAACTCTGGATTGTACTGGAAAAGTCTCACAGCTGCCGGTTCAGGCGTTGACTCTTCAGGCATCAAACCTTACGTCTGAAGATCTTACATTGATCATACTTGCTCCAGCATCAGTGACTCCTGCTCCTTCAGTGCTGTCACTAAATTCTACACCATCATCACCAATGAGCCCATTTAGGGGTTTTGCTACAACACAAAGATTAGGCACAGTAACTCGTTCTACAGAGGATAAATCACCAAGTGGCGCACCGTCTCTTTCTTCTGATCAGCAGGGTATTTCAGTTGCTGATGTCATTCCAACAACTGCTTCGAGCTGTACTCACTTGTGGTTGCATAGTAGGGTTCCATTAGGATGTGTTCCTTCTCAATCTACTGCCAAGATCAAGCTTGAGCTGCTGCCACTGACAGATGGCATAATTACGCTTGATACTTTGCAGATTTATGTTAAGGAGACAGGTCTAACTTATATCCCTGAGAATGCGCTGAAGATCAATGCAACTTCTAGCATTGCTACTGGAATCATCTAA
Protein sequence
MISFFPGFEIGAQISRHFALHLLSLPLSLFAINVIESSPNQTSTLSTNPSISYQIPLSNHNQIPEITRIQRNWIEDSESGTMNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDHSTASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAELESPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRISEAGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVKCYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLKWLLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTPSSPMSPFRGFATTQRLGTVTRSTEDKSPSGAPSLSSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIATGII
Homology
BLAST of Spo25780.1 vs. NCBI nr
Match:
gi|902232442|gb|KNA22632.1| (hypothetical protein SOVF_032600 [Spinacia oleracea])
HSP 1 Score: 1630.5 bits (4221), Expect = 0.000e+0
Identity = 842/843 (99.88%), Postives = 842/843 (99.88%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 141
MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH
Sbjct: 1 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 60
Query: 142 STASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQS 201
STASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQS
Sbjct: 61 STASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQS 120
Query: 202 FHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAELE 261
FHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAELE
Sbjct: 121 FHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAELE 180
Query: 262 SPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRISE 321
SPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRISE
Sbjct: 181 SPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRISE 240
Query: 322 AGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVKC 381
AGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVKC
Sbjct: 241 AGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVKC 300
Query: 382 YSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLKW 441
YSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLKW
Sbjct: 301 YSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLKW 360
Query: 442 LLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASP 501
LLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASP
Sbjct: 361 LLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASP 420
Query: 502 SSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYI 561
SSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYI
Sbjct: 421 SSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYI 480
Query: 562 PGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEEAS 621
PGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEEAS
Sbjct: 481 PGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEEAS 540
Query: 622 KEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSR 681
KEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSR
Sbjct: 541 KEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSR 600
Query: 682 AGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDFMI 741
AGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDFMI
Sbjct: 601 AGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDFMI 660
Query: 742 SVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTP 801
SVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTP
Sbjct: 661 SVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTP 720
Query: 802 SSPMSPFRGFATTQRLGTVTRSTEDKSPSGAPSLSSDQQGISVADVIPTTASSCTHLWLH 861
SSPMSPFRGFATTQRLGTVTRSTEDKSPSGA SLSSDQQGISVADVIPTTASSCTHLWLH
Sbjct: 721 SSPMSPFRGFATTQRLGTVTRSTEDKSPSGAQSLSSDQQGISVADVIPTTASSCTHLWLH 780
Query: 862 SRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIAT 921
SRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIAT
Sbjct: 781 SRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIAT 840
Query: 922 GII 925
GII
Sbjct: 841 GII 843
BLAST of Spo25780.1 vs. NCBI nr
Match:
gi|731344163|ref|XP_010683264.1| (PREDICTED: uncharacterized protein LOC104897980 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 1331.2 bits (3444), Expect = 0.000e+0
Identity = 703/845 (83.20%), Postives = 752/845 (88.99%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 141
MNFLLRQSP+AAAAAATPA PA ADQLPP+SPWL+KTPSQSVTTLEGLIAED S++VD+
Sbjct: 1 MNFLLRQSPSAAAAAATPAAAPATADQLPPISPWLRKTPSQSVTTLEGLIAEDHSNVVDY 60
Query: 142 STASNRDSLTGEN-GIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQ 201
+T NRDS T +N G +SSAS KDF+PL DKHVDV EDEGWITIP + LPDDW+GAPDI
Sbjct: 61 TTEINRDSFTSDNSGKLSSASSKDFSPLVDKHVDVTEDEGWITIPCKNLPDDWSGAPDIH 120
Query: 202 SFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAEL 261
F S DRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGY G+K +NGN AE
Sbjct: 121 EFQSFDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYGPGAKARNGNAAEQ 180
Query: 262 ESPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRIS 321
ES SG QP DDN+D+KNVNKHQ DVSASESYLRMENHRR QTLLQRFKDSHFF RI+
Sbjct: 181 ESLSGTQPIELDDNVDSKNVNKHQ-DVSASESYLRMENHRRITQTLLQRFKDSHFFARIA 240
Query: 322 EAGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVK 381
E+GE LWSDRST EA + DD DS ENN GS TSISAVIDSGRFDARASGG+ARN VK
Sbjct: 241 ESGEQLWSDRSTLEALDGVDDTDSTENNNMNGSSTSISAVIDSGRFDARASGGLARNFVK 300
Query: 382 CYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLK 441
C SLANGDIVVLLQ+NVGVDSLRDPILEILQFEKH DRTTALESD+ VKEEHS CGELLK
Sbjct: 301 CCSLANGDIVVLLQMNVGVDSLRDPILEILQFEKHLDRTTALESDI-VKEEHSPCGELLK 360
Query: 442 WLLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFG-HFRSYSMSSLPQNTAPSA 501
WLLPLDNSQLP++RPLSPPLLNSSHKTSFS SSGSQLFSFG HFRSYSMSSLPQNTAP+A
Sbjct: 361 WLLPLDNSQLPISRPLSPPLLNSSHKTSFSASSGSQLFSFGNHFRSYSMSSLPQNTAPAA 420
Query: 502 SPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGI 561
SP SVS+ASSKPN+ +EDW+RF SQKSLKSPRSGSQGLLSFRGV LEPERFSVCCGLEGI
Sbjct: 421 SP-SVSTASSKPNVGVEDWDRFASQKSLKSPRSGSQGLLSFRGVALEPERFSVCCGLEGI 480
Query: 562 YIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEE 621
YIPGRRWRKQLEIIQPVDIHSFAANCNTDD+LCVQIKNICP H PDI VYIDAIT+ILEE
Sbjct: 481 YIPGRRWRKQLEIIQPVDIHSFAANCNTDDLLCVQIKNICPPHFPDIAVYIDAITIILEE 540
Query: 622 ASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLH 681
ASK A PLSVPIACVEAGT +LPNL LRRGEEHSFILKPAISALKKS+IPDDK++KLH
Sbjct: 541 ASK-AGPPLSVPIACVEAGTDHSLPNLVLRRGEEHSFILKPAISALKKSRIPDDKISKLH 600
Query: 682 SRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDF 741
+R GSK S RLPPVAVE + + S HKYAIL+SCRCNYSESRLFFKKPTDWRPRISRDF
Sbjct: 601 TRTGSKATSTRLPPVAVEGSMDESASHKYAILVSCRCNYSESRLFFKKPTDWRPRISRDF 660
Query: 742 MISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNS 801
MISVASE SK+T D TGKVSQLPVQ LTLQASNLTSEDLTL ILAPAS TP PSVLSLNS
Sbjct: 661 MISVASEISKETPDFTGKVSQLPVQVLTLQASNLTSEDLTLTILAPASFTPPPSVLSLNS 720
Query: 802 TPSSPMSPFRGFATTQRLGTVTRSTEDKSPSGAPSLSSDQQGISVADVIPTTASSCTHLW 861
TPSSPMSPF GFATTQ + TVTR +E++ S SLSSDQ+ + VADVIPTTA SCTHLW
Sbjct: 721 TPSSPMSPFGGFATTQ-ISTVTRLSENQIQSAPQSLSSDQKAVPVADVIPTTALSCTHLW 780
Query: 862 LHSRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSI 921
LHSR+PLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKE G+TY+PENALKINATSSI
Sbjct: 781 LHSRIPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKEKGITYVPENALKINATSSI 840
Query: 922 ATGII 925
ATGII
Sbjct: 841 ATGII 840
BLAST of Spo25780.1 vs. NCBI nr
Match:
gi|694361379|ref|XP_009360399.1| (PREDICTED: uncharacterized protein LOC103950874 [Pyrus x bretschneideri])
HSP 1 Score: 891.0 bits (2301), Expect = 1.800e-255
Identity = 510/875 (58.29%), Postives = 619/875 (70.74%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQ------SVTTLEGLIAEDK 141
MNFL+R + + A + P+ +PPVSP + + P++ S TTLEGLIAED
Sbjct: 1 MNFLMRSTHHVQRVTAEQPSVPS-IPSVPPVSP-VHEPPAETYPTPKSATTLEGLIAEDS 60
Query: 142 --SHLVDHSTASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDW 201
+ A+ +S +GENGI K + KH DV+++EGWI IP+++LPD+W
Sbjct: 61 YPQYSTTEDNAAESES-SGENGI----GAKKETSVIAKHYDVSDEEGWIAIPYKELPDNW 120
Query: 202 AGAPDIQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQ 261
APDIQS LDRSFVFPGEQVHILACLSA KQDTEIITPFK+AA M+KNG K Q
Sbjct: 121 NDAPDIQSLRPLDRSFVFPGEQVHILACLSACKQDTEIITPFKLAAAMSKNGIRLSPKKQ 180
Query: 262 NGNVAELESPSGAQPAGFDDNLDAKNVNKH-----------QVDVSASESYLRMENHRRQ 321
N N LE +G D + D++ +++ Q DVSASES LRME+H+RQ
Sbjct: 181 NRN---LEDSNGTLLGKGDMSPDSQGADRNGETLSKERTDSQKDVSASESLLRMEDHKRQ 240
Query: 322 VQTLLQRFKDSHFFVRISEAGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVID 381
+ LLQRF+ SHFFVRI+E+ E LW+ +STS+ + D E + +T+++A+ID
Sbjct: 241 TEILLQRFERSHFFVRIAESSEALWAKKSTSKKSSESVEVDGQEYTENGTQKTAVNAIID 300
Query: 382 SGRFDARASGGVARNSVKCYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTT-A 441
G FD SGGVARN+VKC SL+NGDIVVLLQVNVGVD L+DP++EILQFEK+++R+ A
Sbjct: 301 KGNFDPNVSGGVARNNVKCCSLSNGDIVVLLQVNVGVDFLKDPVIEILQFEKYHERSLFA 360
Query: 442 LESDLPVKEEHSTCGELLKWLLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFG 501
D V CGELLKWLLPLDN+ P RPLSPPL ++S S S SGSQL S
Sbjct: 361 QTQDSLVDANQDPCGELLKWLLPLDNTLPPPARPLSPPLTSNSGIGSTSQKSGSQLLS-- 420
Query: 502 HFRSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFR 561
HFRSYSMSSLPQNT P P + +ASSKP+ DLEDW+++ SQK LK+ ++G +GLLSFR
Sbjct: 421 HFRSYSMSSLPQNTTPPLGP--IKAASSKPSFDLEDWDQYSSQKFLKNQKTGGEGLLSFR 480
Query: 562 GVTLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPA 621
GV+LE ERFSVCCGLEGIYIPGRRWR++LEIIQPV+IHSFAA+CNTDD+LCVQIKN+ PA
Sbjct: 481 GVSLERERFSVCCGLEGIYIPGRRWRRKLEIIQPVEIHSFAADCNTDDLLCVQIKNVSPA 540
Query: 622 HVPDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPA 681
H P+IVVYIDAIT++ EEASK + LS+PIAC+EAG +LPNLALRRGEEHSFILKPA
Sbjct: 541 HAPNIVVYIDAITIVFEEASK-GGQSLSLPIACIEAGNDHSLPNLALRRGEEHSFILKPA 600
Query: 682 ISALKKSKIPDDKLTKLHS---RAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNY 741
S K K D+ + HS +AG+ S R PP VE K+ ST +YAI++SCRCNY
Sbjct: 601 TSLWKNFKAGGDR--RNHSSQLQAGNAAPSLRPPPKTVEGKKSASTADQYAIMVSCRCNY 660
Query: 742 SESRLFFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDL 801
+ESRLFFK+PT WRPR+SRD MISVASE S+Q+ G VSQLPVQ LTLQ SNL SEDL
Sbjct: 661 TESRLFFKQPTSWRPRVSRDLMISVASEMSEQSSAPNGGVSQLPVQVLTLQVSNLMSEDL 720
Query: 802 TLIILAPASVTPAPSVLSLNSTPSSPMSPFRGF-------ATTQRLGTVTRSTEDKS--P 861
L +LAPAS T PSV+SLNS+P+SPMSPF F T QRL + S K
Sbjct: 721 NLTVLAPASFTSPPSVVSLNSSPASPMSPFLSFPDYTGKSPTIQRLSSPLLSDNQKQNVK 780
Query: 862 SGAPSLSSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAKIKLELLPLTDGII 921
G S +Q ++D IP+ CTHLWL SRVPLGCVPSQSTA IKLELLPLTDGII
Sbjct: 781 GGVWPASFSEQTSPLSDAIPSAGLCCTHLWLQSRVPLGCVPSQSTATIKLELLPLTDGII 840
Query: 922 TLDTLQIYVKETGLTYIPENALKINATSSIATGII 925
TLDTLQI VKE G+TYIPE +LKINATSSI+TGI+
Sbjct: 841 TLDTLQIDVKEKGVTYIPEFSLKINATSSISTGIL 858
BLAST of Spo25780.1 vs. NCBI nr
Match:
gi|590700722|ref|XP_007046232.1| (Uncharacterized protein isoform 2 [Theobroma cacao])
HSP 1 Score: 880.6 bits (2274), Expect = 2.500e-252
Identity = 508/874 (58.12%), Postives = 627/874 (71.74%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTP--SQSVTTLEGLIAEDKSHLV 141
MNFLL P + TP + PPV + ++P S+S TTLEGLIAED
Sbjct: 1 MNFLL---PLRSNQQGTP--------EPPPVPEEVAESPYVSKSATTLEGLIAEDP--YP 60
Query: 142 DHSTASNRDSLT-GENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPD 201
++ST N T G G + + A + + H DV+E++GWITIP++ LPDDW APD
Sbjct: 61 EYSTVENHGGETNGFEGESTDVVSEKNASVLENHTDVSEEDGWITIPYKDLPDDWNQAPD 120
Query: 202 IQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVA 261
I S SLDRSFVFPGEQVHILACLSA Q+TEIITPFKVAAVM+KNG +G + QNGN+
Sbjct: 121 IHSLRSLDRSFVFPGEQVHILACLSACNQETEIITPFKVAAVMSKNGMRKGIEKQNGNM- 180
Query: 262 ELES---PSGAQ--PAGFDDNLDAKNVNKHQVD----VSASESYLRMENHRRQVQTLLQR 321
E+E+ P G + P G + + +N+ K ++D VSASES+LRME+HRRQ + LL+R
Sbjct: 181 EVETNSVPGGVEVSPNGTVIDQNGENLEKERIDAAKDVSASESFLRMEDHRRQTEILLKR 240
Query: 322 FKDSHFFVRISEAGELLWSDRSTSEAPETEDDAD-SMENNKAAGSRTSISAVIDSGRFDA 381
FK+SHFFVRI+E+GE LWS + S++ + + + E A + +S++AVID G FDA
Sbjct: 241 FKNSHFFVRIAESGEPLWSKKGASDSSQMDSQQSIANETKSTAKNISSLNAVIDRGNFDA 300
Query: 382 RASGGVARNSVKCYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALES-DLP 441
SGGVAR++VKC SL+NGDIVVLLQVNVGVD LRDP++EILQFEK+ D+ + E+ +
Sbjct: 301 NVSGGVARDTVKCCSLSNGDIVVLLQVNVGVDFLRDPVIEILQFEKYQDKNLSSENQENL 360
Query: 442 VKEEHSTCGELLKWLLPLDNSQLPVNRPLSPPLLNS-------SHKTSFSGSSGSQLFSF 501
V E CGELLKWLLPLDN+ LP R LSPP L S S +++FS SSGSQLFSF
Sbjct: 361 VYENQDPCGELLKWLLPLDNT-LPPPRTLSPPPLGSGSGIGSTSQRSAFSASSGSQLFSF 420
Query: 502 GHFRSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSF 561
GHFRS+SMSSLPQN A P V + SSKP+ DL++ + + SQK LKS R+G++GLLSF
Sbjct: 421 GHFRSHSMSSLPQNVA--TPPGPVKAQSSKPSFDLDELDHYSSQKILKSQRTGTEGLLSF 480
Query: 562 RGVTLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICP 621
RGV+LE ERFSV CGLEGI+IPGRRWR++LEIIQPV+IHS+AA+CNT+D+LCVQIKN+ P
Sbjct: 481 RGVSLERERFSVRCGLEGIHIPGRRWRRKLEIIQPVEIHSYAADCNTNDLLCVQIKNVAP 540
Query: 622 AHVPDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKP 681
AH+PDIVVYIDAITV+LEEASK P S+PIAC+EAG +LPNLALRRGEEHSFILKP
Sbjct: 541 AHIPDIVVYIDAITVVLEEASK-GGPPTSLPIACIEAGDDHSLPNLALRRGEEHSFILKP 600
Query: 682 AISALKKSKIPDDKLTKLHSRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSE 741
A S K K +K SK +S R P + + ST ++YAI++SC CNY+
Sbjct: 601 ATSMWKDLKTYGEK---------SKLSSLRPPSKTFDRKGSASTVNQYAIMVSCHCNYTA 660
Query: 742 SRLFFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTL 801
SRLFFK+PT WRPRISRD MISVASE S Q +V+QLPVQ LTLQASNLT EDLT+
Sbjct: 661 SRLFFKQPTSWRPRISRDLMISVASEMSGQYCGPNERVTQLPVQVLTLQASNLTPEDLTM 720
Query: 802 IILAPASVTPAPSVLSLNSTPSSPMSPFRGFA----------TTQRLGTVTRSTEDKSPS 861
+LAPAS T PSV+SLNS+P+SPMSPF GF+ + T + + + +
Sbjct: 721 TVLAPASFTSPPSVVSLNSSPTSPMSPFVGFSELAGKASSVHKLSSMSTASENLKQNGDA 780
Query: 862 GAPSLSSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAKIKLELLPLTDGIIT 921
GA S ++Q +ADVIPT+ CTHLWL SRVPLGCVP+QS A IKLELLPLTDGIIT
Sbjct: 781 GARFTSFNEQLTPIADVIPTSGLGCTHLWLQSRVPLGCVPAQSMATIKLELLPLTDGIIT 840
Query: 922 LDTLQIYVKETGLTYIPENALKINATSSIATGII 925
LDTLQI VKE GLTYIPE++LKINATSS++TGII
Sbjct: 841 LDTLQIDVKEKGLTYIPEHSLKINATSSVSTGII 847
BLAST of Spo25780.1 vs. NCBI nr
Match:
gi|1009154003|ref|XP_015894933.1| (PREDICTED: uncharacterized protein LOC107428854 [Ziziphus jujuba])
HSP 1 Score: 879.4 bits (2271), Expect = 5.500e-252
Identity = 510/883 (57.76%), Postives = 621/883 (70.33%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 141
MNFLLR + + AA + PA Q S+ TLEGLIAED
Sbjct: 1 MNFLLRSTHHVAAEQPSLQEAPAETPQT-----------SKPAVTLEGLIAEDP--YPQF 60
Query: 142 STASNRDSLT----GENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAP 201
ST RD T ENG ++ A K+ + + KH DV+E+EGWITIP++KLP +W
Sbjct: 61 STVEERDEETDGIVAENGSIAGAEAKNESSVVAKHSDVSEEEGWITIPYKKLPGNWNDVA 120
Query: 202 DIQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNV 261
DI S SLDR FVFPGEQVHILACL+A KQDTEIITPFKVAAVM+KNG + +NGNV
Sbjct: 121 DINSLRSLDRPFVFPGEQVHILACLAACKQDTEIITPFKVAAVMSKNGIGKSPDKRNGNV 180
Query: 262 AELESPSGAQ----PAGFDDNLDAKNV---NKHQVDVSASESYLRMENHRRQVQTLLQRF 321
+ +P + P G + + +N+ N+H+ +V ES LRME+H+RQ + LL RF
Sbjct: 181 EDDSNPHSRKEEMSPGGQSVHQNGENLSEENQHK-NVPTGESLLRMEDHKRQTEILLDRF 240
Query: 322 KDSHFFVRISEAGELLWSDRST---SEAPETEDDADSMENN--KAAGSRTSISAVIDSGR 381
+ SHFFVRI+E+GE LWS ++T S P D S+EN K A + I+AVID G
Sbjct: 241 ERSHFFVRIAESGEPLWSKKNTKKKSTEPSVMDGQKSIENETLKTAKDTSHINAVIDKGN 300
Query: 382 FDARASGGVARNSVKCYSL-ANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALES 441
FD SGG ARN+ +++ G + VLLQVNVGVD L DP++E+LQFEKH +R L S
Sbjct: 301 FDPNLSGGAARNTADSHAIFLFGRVQVLLQVNVGVDFLNDPVIEVLQFEKHRERN--LTS 360
Query: 442 DLPVKEEHSTCGELLKWLLPLDNSQLPVNRPLSPPLL------NSSHKTSFSGSSGSQLF 501
+ CGELLKWLLPLDN+ RPLSPPL N+++K+SFS SSGSQLF
Sbjct: 361 ENLESANQDPCGELLKWLLPLDNTVPSPARPLSPPLSSNSGYGNTTYKSSFSASSGSQLF 420
Query: 502 SFGHFRSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLL 561
SFGHFRSYSMS+LPQNT P A+P V +ASSKP+ +LEDW+++ SQK K+ ++G +GLL
Sbjct: 421 SFGHFRSYSMSALPQNTTPPAAP--VKAASSKPSFNLEDWDQYESQKIWKTQKAGPEGLL 480
Query: 562 SFRGVTLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNI 621
SFRGV+LE ERFSVCCGLEGIYIPG+RWR++LEIIQPV+IHSFAA+CNTDD+LCVQIKNI
Sbjct: 481 SFRGVSLERERFSVCCGLEGIYIPGKRWRRKLEIIQPVEIHSFAADCNTDDLLCVQIKNI 540
Query: 622 CPAHVPDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFIL 681
CPAH PDIVVYIDAIT++ EEASK +PLS+PIAC+EAG LPNLALR+GEEHSFIL
Sbjct: 541 CPAHAPDIVVYIDAITIVFEEASK-GGQPLSLPIACIEAGDDHNLPNLALRQGEEHSFIL 600
Query: 682 KPAISALKKSKIPDDKLTK-LHSRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCN 741
KPA S K K+ ++K T+ L S+AG+ +S RLP V E K ++ +YAI++SCRCN
Sbjct: 601 KPATSMWKNLKVNNEKKTQPLQSQAGNVASSLRLPSKTV-EGKRSASGEQYAIMVSCRCN 660
Query: 742 YSESRLFFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSED 801
Y+ESRLFFK+PT W+PRISRD MISVASE S Q + G SQLPVQ LTLQASNLTS+D
Sbjct: 661 YTESRLFFKQPTSWQPRISRDLMISVASEISGQHMSNEG-ASQLPVQVLTLQASNLTSQD 720
Query: 802 LTLIILAPASVTPAPSVLSLNSTPSSPMSPFRGFAT-TQRLGTVTRSTEDKSPSGAP--- 861
LTL +LAPAS T PSV+S NS+P+SPMSPF GF T R + RST + AP
Sbjct: 721 LTLTVLAPASFTSPPSVVSFNSSPTSPMSPFVGFPEFTGRFSSDKRSTAIQRMGSAPLAS 780
Query: 862 ------------SLSSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAKIKLEL 921
S S D+Q ++DV+P++ CTHLWLHSRVPLGCVPSQSTA IKLEL
Sbjct: 781 NKKKQNDNGRSQSASFDEQVSPLSDVLPSSGLGCTHLWLHSRVPLGCVPSQSTATIKLEL 840
Query: 922 LPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIATGII 925
LPLTDGIITLDTLQI VKE GLTYIPE++L INATSSI+TGII
Sbjct: 841 LPLTDGIITLDTLQIDVKEKGLTYIPEHSLMINATSSISTGII 862
BLAST of Spo25780.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9RUZ9_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_032600 PE=4 SV=1)
HSP 1 Score: 1630.5 bits (4221), Expect = 0.000e+0
Identity = 842/843 (99.88%), Postives = 842/843 (99.88%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 141
MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH
Sbjct: 1 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 60
Query: 142 STASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQS 201
STASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQS
Sbjct: 61 STASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQS 120
Query: 202 FHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAELE 261
FHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAELE
Sbjct: 121 FHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAELE 180
Query: 262 SPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRISE 321
SPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRISE
Sbjct: 181 SPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRISE 240
Query: 322 AGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVKC 381
AGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVKC
Sbjct: 241 AGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVKC 300
Query: 382 YSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLKW 441
YSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLKW
Sbjct: 301 YSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLKW 360
Query: 442 LLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASP 501
LLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASP
Sbjct: 361 LLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASP 420
Query: 502 SSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYI 561
SSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYI
Sbjct: 421 SSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYI 480
Query: 562 PGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEEAS 621
PGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEEAS
Sbjct: 481 PGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEEAS 540
Query: 622 KEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSR 681
KEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSR
Sbjct: 541 KEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSR 600
Query: 682 AGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDFMI 741
AGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDFMI
Sbjct: 601 AGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDFMI 660
Query: 742 SVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTP 801
SVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTP
Sbjct: 661 SVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTP 720
Query: 802 SSPMSPFRGFATTQRLGTVTRSTEDKSPSGAPSLSSDQQGISVADVIPTTASSCTHLWLH 861
SSPMSPFRGFATTQRLGTVTRSTEDKSPSGA SLSSDQQGISVADVIPTTASSCTHLWLH
Sbjct: 721 SSPMSPFRGFATTQRLGTVTRSTEDKSPSGAQSLSSDQQGISVADVIPTTASSCTHLWLH 780
Query: 862 SRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIAT 921
SRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIAT
Sbjct: 781 SRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIAT 840
Query: 922 GII 925
GII
Sbjct: 841 GII 843
BLAST of Spo25780.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8C524_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_6g154660 PE=4 SV=1)
HSP 1 Score: 1331.2 bits (3444), Expect = 0.000e+0
Identity = 703/845 (83.20%), Postives = 752/845 (88.99%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 141
MNFLLRQSP+AAAAAATPA PA ADQLPP+SPWL+KTPSQSVTTLEGLIAED S++VD+
Sbjct: 1 MNFLLRQSPSAAAAAATPAAAPATADQLPPISPWLRKTPSQSVTTLEGLIAEDHSNVVDY 60
Query: 142 STASNRDSLTGEN-GIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPDIQ 201
+T NRDS T +N G +SSAS KDF+PL DKHVDV EDEGWITIP + LPDDW+GAPDI
Sbjct: 61 TTEINRDSFTSDNSGKLSSASSKDFSPLVDKHVDVTEDEGWITIPCKNLPDDWSGAPDIH 120
Query: 202 SFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVAEL 261
F S DRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGY G+K +NGN AE
Sbjct: 121 EFQSFDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYGPGAKARNGNAAEQ 180
Query: 262 ESPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFVRIS 321
ES SG QP DDN+D+KNVNKHQ DVSASESYLRMENHRR QTLLQRFKDSHFF RI+
Sbjct: 181 ESLSGTQPIELDDNVDSKNVNKHQ-DVSASESYLRMENHRRITQTLLQRFKDSHFFARIA 240
Query: 322 EAGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDARASGGVARNSVK 381
E+GE LWSDRST EA + DD DS ENN GS TSISAVIDSGRFDARASGG+ARN VK
Sbjct: 241 ESGEQLWSDRSTLEALDGVDDTDSTENNNMNGSSTSISAVIDSGRFDARASGGLARNFVK 300
Query: 382 CYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVKEEHSTCGELLK 441
C SLANGDIVVLLQ+NVGVDSLRDPILEILQFEKH DRTTALESD+ VKEEHS CGELLK
Sbjct: 301 CCSLANGDIVVLLQMNVGVDSLRDPILEILQFEKHLDRTTALESDI-VKEEHSPCGELLK 360
Query: 442 WLLPLDNSQLPVNRPLSPPLLNSSHKTSFSGSSGSQLFSFG-HFRSYSMSSLPQNTAPSA 501
WLLPLDNSQLP++RPLSPPLLNSSHKTSFS SSGSQLFSFG HFRSYSMSSLPQNTAP+A
Sbjct: 361 WLLPLDNSQLPISRPLSPPLLNSSHKTSFSASSGSQLFSFGNHFRSYSMSSLPQNTAPAA 420
Query: 502 SPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGI 561
SP SVS+ASSKPN+ +EDW+RF SQKSLKSPRSGSQGLLSFRGV LEPERFSVCCGLEGI
Sbjct: 421 SP-SVSTASSKPNVGVEDWDRFASQKSLKSPRSGSQGLLSFRGVALEPERFSVCCGLEGI 480
Query: 562 YIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHVPDIVVYIDAITVILEE 621
YIPGRRWRKQLEIIQPVDIHSFAANCNTDD+LCVQIKNICP H PDI VYIDAIT+ILEE
Sbjct: 481 YIPGRRWRKQLEIIQPVDIHSFAANCNTDDLLCVQIKNICPPHFPDIAVYIDAITIILEE 540
Query: 622 ASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAISALKKSKIPDDKLTKLH 681
ASK A PLSVPIACVEAGT +LPNL LRRGEEHSFILKPAISALKKS+IPDDK++KLH
Sbjct: 541 ASK-AGPPLSVPIACVEAGTDHSLPNLVLRRGEEHSFILKPAISALKKSRIPDDKISKLH 600
Query: 682 SRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRLFFKKPTDWRPRISRDF 741
+R GSK S RLPPVAVE + + S HKYAIL+SCRCNYSESRLFFKKPTDWRPRISRDF
Sbjct: 601 TRTGSKATSTRLPPVAVEGSMDESASHKYAILVSCRCNYSESRLFFKKPTDWRPRISRDF 660
Query: 742 MISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIILAPASVTPAPSVLSLNS 801
MISVASE SK+T D TGKVSQLPVQ LTLQASNLTSEDLTL ILAPAS TP PSVLSLNS
Sbjct: 661 MISVASEISKETPDFTGKVSQLPVQVLTLQASNLTSEDLTLTILAPASFTPPPSVLSLNS 720
Query: 802 TPSSPMSPFRGFATTQRLGTVTRSTEDKSPSGAPSLSSDQQGISVADVIPTTASSCTHLW 861
TPSSPMSPF GFATTQ + TVTR +E++ S SLSSDQ+ + VADVIPTTA SCTHLW
Sbjct: 721 TPSSPMSPFGGFATTQ-ISTVTRLSENQIQSAPQSLSSDQKAVPVADVIPTTALSCTHLW 780
Query: 862 LHSRVPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSI 921
LHSR+PLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKE G+TY+PENALKINATSSI
Sbjct: 781 LHSRIPLGCVPSQSTAKIKLELLPLTDGIITLDTLQIYVKEKGITYVPENALKINATSSI 840
Query: 922 ATGII 925
ATGII
Sbjct: 841 ATGII 840
BLAST of Spo25780.1 vs. UniProtKB/TrEMBL
Match:
A0A061EID2_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_011806 PE=4 SV=1)
HSP 1 Score: 880.6 bits (2274), Expect = 1.700e-252
Identity = 508/874 (58.12%), Postives = 627/874 (71.74%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTP--SQSVTTLEGLIAEDKSHLV 141
MNFLL P + TP + PPV + ++P S+S TTLEGLIAED
Sbjct: 1 MNFLL---PLRSNQQGTP--------EPPPVPEEVAESPYVSKSATTLEGLIAEDP--YP 60
Query: 142 DHSTASNRDSLT-GENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPD 201
++ST N T G G + + A + + H DV+E++GWITIP++ LPDDW APD
Sbjct: 61 EYSTVENHGGETNGFEGESTDVVSEKNASVLENHTDVSEEDGWITIPYKDLPDDWNQAPD 120
Query: 202 IQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVA 261
I S SLDRSFVFPGEQVHILACLSA Q+TEIITPFKVAAVM+KNG +G + QNGN+
Sbjct: 121 IHSLRSLDRSFVFPGEQVHILACLSACNQETEIITPFKVAAVMSKNGMRKGIEKQNGNM- 180
Query: 262 ELES---PSGAQ--PAGFDDNLDAKNVNKHQVD----VSASESYLRMENHRRQVQTLLQR 321
E+E+ P G + P G + + +N+ K ++D VSASES+LRME+HRRQ + LL+R
Sbjct: 181 EVETNSVPGGVEVSPNGTVIDQNGENLEKERIDAAKDVSASESFLRMEDHRRQTEILLKR 240
Query: 322 FKDSHFFVRISEAGELLWSDRSTSEAPETEDDAD-SMENNKAAGSRTSISAVIDSGRFDA 381
FK+SHFFVRI+E+GE LWS + S++ + + + E A + +S++AVID G FDA
Sbjct: 241 FKNSHFFVRIAESGEPLWSKKGASDSSQMDSQQSIANETKSTAKNISSLNAVIDRGNFDA 300
Query: 382 RASGGVARNSVKCYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALES-DLP 441
SGGVAR++VKC SL+NGDIVVLLQVNVGVD LRDP++EILQFEK+ D+ + E+ +
Sbjct: 301 NVSGGVARDTVKCCSLSNGDIVVLLQVNVGVDFLRDPVIEILQFEKYQDKNLSSENQENL 360
Query: 442 VKEEHSTCGELLKWLLPLDNSQLPVNRPLSPPLLNS-------SHKTSFSGSSGSQLFSF 501
V E CGELLKWLLPLDN+ LP R LSPP L S S +++FS SSGSQLFSF
Sbjct: 361 VYENQDPCGELLKWLLPLDNT-LPPPRTLSPPPLGSGSGIGSTSQRSAFSASSGSQLFSF 420
Query: 502 GHFRSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSF 561
GHFRS+SMSSLPQN A P V + SSKP+ DL++ + + SQK LKS R+G++GLLSF
Sbjct: 421 GHFRSHSMSSLPQNVA--TPPGPVKAQSSKPSFDLDELDHYSSQKILKSQRTGTEGLLSF 480
Query: 562 RGVTLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICP 621
RGV+LE ERFSV CGLEGI+IPGRRWR++LEIIQPV+IHS+AA+CNT+D+LCVQIKN+ P
Sbjct: 481 RGVSLERERFSVRCGLEGIHIPGRRWRRKLEIIQPVEIHSYAADCNTNDLLCVQIKNVAP 540
Query: 622 AHVPDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKP 681
AH+PDIVVYIDAITV+LEEASK P S+PIAC+EAG +LPNLALRRGEEHSFILKP
Sbjct: 541 AHIPDIVVYIDAITVVLEEASK-GGPPTSLPIACIEAGDDHSLPNLALRRGEEHSFILKP 600
Query: 682 AISALKKSKIPDDKLTKLHSRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSE 741
A S K K +K SK +S R P + + ST ++YAI++SC CNY+
Sbjct: 601 ATSMWKDLKTYGEK---------SKLSSLRPPSKTFDRKGSASTVNQYAIMVSCHCNYTA 660
Query: 742 SRLFFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTL 801
SRLFFK+PT WRPRISRD MISVASE S Q +V+QLPVQ LTLQASNLT EDLT+
Sbjct: 661 SRLFFKQPTSWRPRISRDLMISVASEMSGQYCGPNERVTQLPVQVLTLQASNLTPEDLTM 720
Query: 802 IILAPASVTPAPSVLSLNSTPSSPMSPFRGFA----------TTQRLGTVTRSTEDKSPS 861
+LAPAS T PSV+SLNS+P+SPMSPF GF+ + T + + + +
Sbjct: 721 TVLAPASFTSPPSVVSLNSSPTSPMSPFVGFSELAGKASSVHKLSSMSTASENLKQNGDA 780
Query: 862 GAPSLSSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAKIKLELLPLTDGIIT 921
GA S ++Q +ADVIPT+ CTHLWL SRVPLGCVP+QS A IKLELLPLTDGIIT
Sbjct: 781 GARFTSFNEQLTPIADVIPTSGLGCTHLWLQSRVPLGCVPAQSMATIKLELLPLTDGIIT 840
Query: 922 LDTLQIYVKETGLTYIPENALKINATSSIATGII 925
LDTLQI VKE GLTYIPE++LKINATSS++TGII
Sbjct: 841 LDTLQIDVKEKGLTYIPEHSLKINATSSVSTGII 847
BLAST of Spo25780.1 vs. UniProtKB/TrEMBL
Match:
W9RC09_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_021086 PE=4 SV=1)
HSP 1 Score: 876.3 bits (2263), Expect = 3.200e-251
Identity = 502/878 (57.18%), Postives = 606/878 (69.02%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDK----SH 141
MNFL+R + + A+ A P K T S LE LIAED S
Sbjct: 1 MNFLMRSTQSVTTEQASVPEPVAETHHDP------KPTAS-----LESLIAEDPYPQYSR 60
Query: 142 LVDHSTASNRDSLTGENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAP 201
+ H D GEN ++ K + KH DV+E+EGWITIP+++LPDDW AP
Sbjct: 61 VELHD--GENDGFAGENASIAVPDAKKDSSTIAKHSDVSEEEGWITIPYKELPDDWKDAP 120
Query: 202 DIQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNV 261
DI+S +LDRSFVFPGEQVHILACL+A KQD EIITPFKVAA+M+KNG + + QNG+
Sbjct: 121 DIKSLRTLDRSFVFPGEQVHILACLAACKQDAEIITPFKVAALMSKNGIGKSPEKQNGST 180
Query: 262 AELESPSGAQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQRFKDSHFFV 321
+ + D N + + DVSA ES RME+H+RQ + LLQRF+ SH+FV
Sbjct: 181 EDGKGEMSPGGQNIDKNAEILLNVDLKKDVSAGESLFRMEDHKRQTEMLLQRFEKSHYFV 240
Query: 322 RISEAGELLWSDRSTSEAPETEDDADSME--------NNKAAGSRTSISAVIDSGRFDAR 381
RI+E+ E LWS +S DA M+ K A + +AVID G FD
Sbjct: 241 RIAESTEPLWSKKSAPNPSSESSDAHEMDGQNSIPNGTQKTAKDASCFNAVIDKGIFDPT 300
Query: 382 ASGGVARNSVKCYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVK 441
SGG ARN+VKC SL NGDIVVLLQVNVGVD L DPI+EILQFEK+++R E+ V
Sbjct: 301 ISGGAARNTVKCCSLPNGDIVVLLQVNVGVDVLNDPIIEILQFEKYHERNLGSENQRNVA 360
Query: 442 -EEHSTCGELLKWLLPLDNSQLPVNRPLSPPL------LNSSHKTSFSGSSGSQLFSFGH 501
+ CGELLKWLLPLDN+ P RPLSPPL N+S K++F+ SSGSQLFSFGH
Sbjct: 361 FTDQDPCGELLKWLLPLDNTLPPPARPLSPPLGSTSGFGNTSQKSNFTSSSGSQLFSFGH 420
Query: 502 FRSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRG 561
FRSYSMSSLPQN P P+SV + SSKP+ +LE W+++ SQK KS ++GS+ LLSFRG
Sbjct: 421 FRSYSMSSLPQNNTP--PPASVKAISSKPSFELEGWDQYSSQKLWKSQKTGSEALLSFRG 480
Query: 562 VTLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAH 621
V+LE ERFSVCCGLEGIY+PGRRWR++LEIIQPV+IHSFAA+CNTDD+LCVQIKN+ PAH
Sbjct: 481 VSLERERFSVCCGLEGIYMPGRRWRRKLEIIQPVEIHSFAADCNTDDLLCVQIKNVSPAH 540
Query: 622 VPDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAI 681
PDIVVYIDAIT++ EEASK +PLS+PIAC+EAG +LPNL LRRGEEHSFILKPA
Sbjct: 541 TPDIVVYIDAITIVFEEASK-GGQPLSLPIACIEAGIDHSLPNLVLRRGEEHSFILKPAT 600
Query: 682 SALKKSKIPDDKLTKLHSRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESR 741
S K K +K T+ H A + +S RLPP E K++S+ +Y+I++SCRCNY+ESR
Sbjct: 601 SLWKNVKATGEKSTRSHLPAVNAASSLRLPPTV--EGKSVSSAGQYSIMVSCRCNYTESR 660
Query: 742 LFFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLII 801
LFFK+PT WRPRISRD MISVASE S Q G V QLPVQ LTLQASNLTSEDLTL +
Sbjct: 661 LFFKQPTSWRPRISRDLMISVASEISGQH-GANGGVYQLPVQVLTLQASNLTSEDLTLTV 720
Query: 802 LAPASVTPAPSVLSLNSTPSSPMSPFRGFA-------------TTQRLGTVTRSTEDKSP 861
LAPAS T PSV+SLNS+P+SPMSPF GFA RL + S+ ++
Sbjct: 721 LAPASFTSPPSVVSLNSSPTSPMSPFVGFAEFTGSISGDKRSSAIHRLNSAPVSSGNQKQ 780
Query: 862 S---GAPSLSSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAKIKLELLPLTD 921
+ GA S+S +QG S++DVIP++ CTHLWL SRVPLGCVPS S A IKLELLPLTD
Sbjct: 781 NGNGGARSVSFTEQGSSISDVIPSSGLGCTHLWLQSRVPLGCVPSHSAATIKLELLPLTD 840
Query: 922 GIITLDTLQIYVKETGLTYIPENALKINATSSIATGII 925
GIITLDTLQI VKE GLTYIPE++LKINATSSI+T I+
Sbjct: 841 GIITLDTLQIDVKEKGLTYIPEHSLKINATSSISTAIV 859
BLAST of Spo25780.1 vs. UniProtKB/TrEMBL
Match:
A0A061EBP5_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_011806 PE=4 SV=1)
HSP 1 Score: 870.9 bits (2249), Expect = 1.400e-249
Identity = 508/888 (57.21%), Postives = 627/888 (70.61%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTP--SQSVTTLEGLIAEDKSHLV 141
MNFLL P + TP + PPV + ++P S+S TTLEGLIAED
Sbjct: 1 MNFLL---PLRSNQQGTP--------EPPPVPEEVAESPYVSKSATTLEGLIAEDP--YP 60
Query: 142 DHSTASNRDSLT-GENGIMSSASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAPD 201
++ST N T G G + + A + + H DV+E++GWITIP++ LPDDW APD
Sbjct: 61 EYSTVENHGGETNGFEGESTDVVSEKNASVLENHTDVSEEDGWITIPYKDLPDDWNQAPD 120
Query: 202 IQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNVA 261
I S SLDRSFVFPGEQVHILACLSA Q+TEIITPFKVAAVM+KNG +G + QNGN+
Sbjct: 121 IHSLRSLDRSFVFPGEQVHILACLSACNQETEIITPFKVAAVMSKNGMRKGIEKQNGNM- 180
Query: 262 ELES---PSGAQ--PAGFDDNLDAKNVNKHQVD----VSASESYLRMENHRRQVQTLLQR 321
E+E+ P G + P G + + +N+ K ++D VSASES+LRME+HRRQ + LL+R
Sbjct: 181 EVETNSVPGGVEVSPNGTVIDQNGENLEKERIDAAKDVSASESFLRMEDHRRQTEILLKR 240
Query: 322 FKDSHFFVRISEAGELLWSDRSTSEAPETEDDAD-SMENNKAAGSRTSISAVIDSGRFDA 381
FK+SHFFVRI+E+GE LWS + S++ + + + E A + +S++AVID G FDA
Sbjct: 241 FKNSHFFVRIAESGEPLWSKKGASDSSQMDSQQSIANETKSTAKNISSLNAVIDRGNFDA 300
Query: 382 RASGGVARNSVKCYSLANGDIV--------------VLLQVNVGVDSLRDPILEILQFEK 441
SGGVAR++VKC SL+NGDIV VLLQVNVGVD LRDP++EILQFEK
Sbjct: 301 NVSGGVARDTVKCCSLSNGDIVTTDSHTTSLFGRMQVLLQVNVGVDFLRDPVIEILQFEK 360
Query: 442 HNDRTTALES-DLPVKEEHSTCGELLKWLLPLDNSQLPVNRPLSPPLLNS-------SHK 501
+ D+ + E+ + V E CGELLKWLLPLDN+ LP R LSPP L S S +
Sbjct: 361 YQDKNLSSENQENLVYENQDPCGELLKWLLPLDNT-LPPPRTLSPPPLGSGSGIGSTSQR 420
Query: 502 TSFSGSSGSQLFSFGHFRSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKS 561
++FS SSGSQLFSFGHFRS+SMSSLPQN A P V + SSKP+ DL++ + + SQK
Sbjct: 421 SAFSASSGSQLFSFGHFRSHSMSSLPQNVA--TPPGPVKAQSSKPSFDLDELDHYSSQKI 480
Query: 562 LKSPRSGSQGLLSFRGVTLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCN 621
LKS R+G++GLLSFRGV+LE ERFSV CGLEGI+IPGRRWR++LEIIQPV+IHS+AA+CN
Sbjct: 481 LKSQRTGTEGLLSFRGVSLERERFSVRCGLEGIHIPGRRWRRKLEIIQPVEIHSYAADCN 540
Query: 622 TDDMLCVQIKNICPAHVPDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNL 681
T+D+LCVQIKN+ PAH+PDIVVYIDAITV+LEEASK P S+PIAC+EAG +LPNL
Sbjct: 541 TNDLLCVQIKNVAPAHIPDIVVYIDAITVVLEEASK-GGPPTSLPIACIEAGDDHSLPNL 600
Query: 682 ALRRGEEHSFILKPAISALKKSKIPDDKLTKLHSRAGSKPASFRLPPVAVEETKNISTPH 741
ALRRGEEHSFILKPA S K K +K SK +S R P + + ST +
Sbjct: 601 ALRRGEEHSFILKPATSMWKDLKTYGEK---------SKLSSLRPPSKTFDRKGSASTVN 660
Query: 742 KYAILISCRCNYSESRLFFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQAL 801
+YAI++SC CNY+ SRLFFK+PT WRPRISRD MISVASE S Q +V+QLPVQ L
Sbjct: 661 QYAIMVSCHCNYTASRLFFKQPTSWRPRISRDLMISVASEMSGQYCGPNERVTQLPVQVL 720
Query: 802 TLQASNLTSEDLTLIILAPASVTPAPSVLSLNSTPSSPMSPFRGFA----------TTQR 861
TLQASNLT EDLT+ +LAPAS T PSV+SLNS+P+SPMSPF GF+
Sbjct: 721 TLQASNLTPEDLTMTVLAPASFTSPPSVVSLNSSPTSPMSPFVGFSELAGKASSVHKLSS 780
Query: 862 LGTVTRSTEDKSPSGAPSLSSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAK 921
+ T + + + +GA S ++Q +ADVIPT+ CTHLWL SRVPLGCVP+QS A
Sbjct: 781 MSTASENLKQNGDAGARFTSFNEQLTPIADVIPTSGLGCTHLWLQSRVPLGCVPAQSMAT 840
Query: 922 IKLELLPLTDGIITLDTLQIYVKETGLTYIPENALKINATSSIATGII 925
IKLELLPLTDGIITLDTLQI VKE GLTYIPE++LKINATSS++TGII
Sbjct: 841 IKLELLPLTDGIITLDTLQIDVKEKGLTYIPEHSLKINATSSVSTGII 861
BLAST of Spo25780.1 vs. TAIR (Arabidopsis)
Match:
AT3G17900.1 (unknown protein)
HSP 1 Score: 767.3 bits (1980), Expect = 1.100e-221
Identity = 454/869 (52.24%), Postives = 585/869 (67.32%), Query Frame = 1
Query: 82 MNFLLRQSPNAAAAAATPATTPANADQLPPVSPWLKKTPSQSVTTLEGLIAEDKSHLVDH 141
MNFLLR S ++A PA Q PP + ++ TLEGLIAE+ H +
Sbjct: 1 MNFLLR-SASSATHRPPVIEPPATPPQPPPET-------AKPGVTLEGLIAEE--HFPQY 60
Query: 142 -STASNRDSLTGENGIMSS---ASIKDFAPLADKHVDVAEDEGWITIPHEKLPDDWAGAP 201
S + D + +G + ++ K ++ DV+E++GWI IP++++PD+W+ +
Sbjct: 61 PSVDEDLDRVGDGSGDLDGNGESNAKSGGSGMERFSDVSEEQGWIAIPYKEIPDNWSESV 120
Query: 202 DIQSFHSLDRSFVFPGEQVHILACLSASKQDTEIITPFKVAAVMNKNGYTQGSKDQNGNV 261
DI S SLDRSFVFPGEQ+ ILACLS SK DTEIITPFKVA VM++ G + S QNG++
Sbjct: 121 DIHSLRSLDRSFVFPGEQIQILACLSESKGDTEIITPFKVAEVMSRTGQRKVSDKQNGDM 180
Query: 262 AE-LESPSG-------AQPAGFDDNLDAKNVNKHQVDVSASESYLRMENHRRQVQTLLQR 321
++ +PSG AQ A + + K Q D+S ES LRME+H+R+ + LL R
Sbjct: 181 SDGASTPSGDGEMSPDAQFATQNGDSPCKESLDSQKDLSDGESILRMEDHKRRTEDLLSR 240
Query: 322 FKDSHFFVRISEAGELLWSDRSTSEAPETEDDADSMENNKAAGSRTSISAVIDSGRFDAR 381
F+ SHFFVRI+E+GE LWS +S+ A D + E K SR +SA +D G FD
Sbjct: 241 FQKSHFFVRIAESGEPLWSKKSSLVA-----DTEMDEERKRTKSRPCVSAFVDRGDFDPN 300
Query: 382 ASGGVARNSVKCYSLANGDIVVLLQVNVGVDSLRDPILEILQFEKHNDRTTALESDLPVK 441
SGGVAR+ KC +L NGDIVV LQV + VD ++PI+EILQFEKH D+ E+D K
Sbjct: 301 VSGGVARSKAKCCALPNGDIVVSLQVYI-VDCPKEPIIEILQFEKHQDQDQNPEND---K 360
Query: 442 EEHSTCGELLKWLLPLDN--SQLPVNRP----LSPPLLNSSHKTSFSGSSGSQLFSFGHF 501
+ + G LLKWL+PLDN SQ P + P SP + +++HK + S +SGSQLFSFGHF
Sbjct: 361 DPY---GNLLKWLIPLDNTISQQPRSLPPPITPSPSISSTAHKPAISSTSGSQLFSFGHF 420
Query: 502 RSYSMSSLPQNTAPSASPSSVSSASSKPNLDLEDWNRFGSQKSLKSPRSGSQGLLSFRGV 561
RSYSMS+LP NTAP P + + SSKP+ D+EDW+ + Q +SG++ LLSFRGV
Sbjct: 421 RSYSMSALPPNTAPVTGP--IKTQSSKPSFDIEDWDSYSGQTVRNGQKSGTEELLSFRGV 480
Query: 562 TLEPERFSVCCGLEGIYIPGRRWRKQLEIIQPVDIHSFAANCNTDDMLCVQIKNICPAHV 621
LE +RFSV CGLEGI IPGRRWR++LEIIQP++I+SFAA+CNTDD+LCVQIKN+ P H
Sbjct: 481 ALERDRFSVRCGLEGICIPGRRWRRKLEIIQPIEINSFAADCNTDDLLCVQIKNVAPTHA 540
Query: 622 PDIVVYIDAITVILEEASKEADRPLSVPIACVEAGTGQTLPNLALRRGEEHSFILKPAIS 681
PDIV+YIDAIT++ EEA K A P SVPIAC+EAG +LPNL LR+GEEHSFI+KPA S
Sbjct: 541 PDIVIYIDAITIVFEEAGKNAS-PSSVPIACIEAGNEHSLPNLTLRKGEEHSFIVKPAFS 600
Query: 682 ALKKSKIPDDKLTKLHSRAGSKPASFRLPPVAVEETKNISTPHKYAILISCRCNYSESRL 741
K P KL K +S LP V E + + +YA+++SCRCNY+ESRL
Sbjct: 601 VGSNLK-PSAARNKL------KSSSLSLPTVNFERKGSGLSGDQYAVMVSCRCNYTESRL 660
Query: 742 FFKKPTDWRPRISRDFMISVASETSKQTLDCTGKVSQLPVQALTLQASNLTSEDLTLIIL 801
FFK+ T WRPR+SRD MISVASE S + G+ SQLPVQ LTLQASNLTSEDL+L +L
Sbjct: 661 FFKQRTKWRPRVSRDLMISVASEMSGEPCGPHGRASQLPVQILTLQASNLTSEDLSLTVL 720
Query: 802 APASVTPAPSVLSLNSTPSSPMSPFRGFAT-TQRLGTVTRSTEDKSPSGAPSL------- 861
APAS T P+V+SLNSTP++P+SPF GF+ T+R+ R+T + P +
Sbjct: 721 APASFTSPPTVVSLNSTPTTPISPFLGFSDFTERVQNEKRNTTVRKQQSLPPIPLETRTE 780
Query: 862 -SSDQQGISVADVIPTTASSCTHLWLHSRVPLGCVPSQSTAKIKLELLPLTDGIITLDTL 921
+++ + + +DV+P + CTHLWL SRVPLGCVPS+STA IKLELLPLTDGIITLDTL
Sbjct: 781 NNTNGESSNPSDVVPKSGLGCTHLWLQSRVPLGCVPSKSTATIKLELLPLTDGIITLDTL 837
Query: 922 QIYVKETGLTYIPENALKINATSSIATGI 924
QI+ KE G YIPE +LKINATSSI++GI
Sbjct: 841 QIHAKEKGRRYIPEQSLKINATSSISSGI 837
The following BLAST results are available for this feature: