Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTTTTTCGGATATCAGATATGTGTGGCAACACACATATCAGCTAAAAGACAAATAGCTAAAAGACAAAAAAGGAAGTACCATTTATGTTAAAAAAGTACCATTCTGTAAAAAAAAAGTACCATTCAGTTAAAAAAAGTACCATTTTTGTTTAAAAAAGTACCATTTAATTTCTTTTTTATCATTTTTCTTATTTTGTCTTTTACTATGATATGCATTTGCCCACACATATCTGATATNATAAACGCCGAAAATACTTCCTCCCAATATCTAATCGTCCTTCGAAAGCCTTTCCTTTGATTAAAAAAAAAAAAATGTAAATAGCGCGCAGCTAGGCACCCTTTAATGTATCATGCGAGTTTCGTGCACCACTGCACCGTCTCTTACCTAATCCTTTCTGCCTAGCAACCGCTCGAGGGCTCGAGGCTTGAAGTGATTGTGTCATCGTGTTTACATTCCAACGAAATAAACGCACAGAGAGAGAAATTGCTCTCTTCCTAAAATAGGCGACCAAAAACCCTAATTTTTTTTTTCCAATCGACAAATTCGATGTCTTCAATCTATCAAGTTTAAGCTCAGAGCAAAATTTAACTAACAATTTCCCCAATTTCCCCCCCAAATCGGTTAATTATGGATCTTTTGAAGGATTATCCTTCAGAAATTGAAATTGGAAGCTCCATAGACTCGTTCCAGAAGGGAATGGATTCACAGAAGCAGCTTTTCCATACACAGATCGATCAACTCCGCAATATCGTCGTCACTCAGTGCAAACTCACCGGCGTTAACCCTCTCTCTCAAGAAATGGTTATCTTTCTCTCTCCTTTAAATGTAACTTGTTGGGTTTTCAACTTTAGGGGAAATTTATGAACCCTAGGAAGTTGCGTTGTTGGGTTTATTTTACCTCATTTCTAGATTTGTATCGATTTCTCATGTTAGTTTTATTTTCTTGTATGTGTTTGGCAGGCAGCTGGTGCTCTATCTATAAATATTGGTAAGTTTGGTGAATATATTTACTTGGGTTCTTGGATTTTTTTGTTTTGCACGTAATTGGTTGTAGTGTAACTGTTCTGTATTCGTATTGTCTCGAACACTTGTTCATTGAGTGTATTTTTCTGAGTTCTTGTTTGATGAAAGAGCTGTATGACGTAAATTTTTATCTGTGATCATCGACCCACTTTTCCTTCGTAGTGTTGCAACTGTGTTTGAAGTTTAAGCTAACTGTAAATATTTTTTTTTGATGAATTTACAGGGAAGAAGCCTAGAGATTTGTTAAATCCGAAGGCTGCAAAATATATGCAAGCAGTTTTTTCTATGAAAGATGCAATCAGTAAAAAGGAATCTCGTGAGATTGGCGCTCTTTTTGGTCTAACAGTCACACAGGTATCGTCTGATAGAACTTATATTGGCATTAAGTGTGCTGCATTCTTTAGATGAAACAATGTGTAATTTGAACTGGCTTGTGATAGGTGAGGGAGTATTTTGCCAGTCAGCGGTCCAGGGTGAGGAAATTGACACGGTTATCTAGAGAAAGAGCGATTAGGGCAAGTGGTAATAAGGAATTACAAGATGGCGTACCTGCTGAGTCTGACCTTATGCTCCCCATTGAAACTGCTCCATTAAGCTCTGTTGGTCCTTCTAATGTCGAGGAAGCACCTTCTTGTTCAAACCAGGATGAAGTCTTGCCAGGACTAGGCGAGTCTGAAAGACATTTTGTTGATAATATTTTCAGTTCTATGTGTAAAGAAGAAACCTTTTCTGGTCAGGTGAAATTGATGGAGTGGATCTTGCAGATAGAAAATCCTACAATCTTGAGTTGGTAATTGCTTGTCAACCAACATCTACTTCCGTTTTTTGTGACTTTCATAGTTCTTTAGTGACTCGGCTTCTTTTCTGTATGGCTGTAGGTTTTTGATCAAAGGTGGTGTCATGATTCTAGCTACTTGGTTGAGTCAAGCTGCTAGTGAAGAACAAACAAGTGTTATTTCTGCTGCTCTCAAGGTCTAAATTTATGTCGCTGCAGACCTTTTTGGTTTCATGTTAGGGGTTAAACTAATCTGCTATGATGTTTTGGCTATTATGTAGGTTTTTTGTCATTTACCCTTGAATAAAGCACTTCCTGCTCACATGTCGGCGTTACTCCAAGGTGTCAACAAATTGCGATTCTACCGAGTACGAGGTTTGTACTCGCTTTCTTCCTAATGATTGAAGCTCATATTGCATCTAATAGATTCTGAATTCATGCAAAATGCATTATAGTGTGTCTGAAGGCAGCATAAGTGGGTATCAACATCAAATTCTTATATGAACATGGTGATTACTTAGTGTATTTCTTTTTTAATGATGATGGTTGCCCATGAACAATGTGTTGATCTTCCAAATTATAGCGGTCAGTACTTTTATGGGGTAAGTTTTTGATGCCCAGAATCTGGTAACAAGGTTGGGTGTTGGGGCTCTGGGTAGTGGGTACTGATACTTTTTTGTTCGCGTGTATTATGGGTTATGACAAGGCTACAATCACACCTAGTATTTATGGAATATTACGTTTCTCACAATTCTATTGAATTAACTATCGTTTTTTTGTACAAAAAAAATAATTCCAAGTGGAAGCTTGTCATTCAGTGAATGGGACAGATAATTGACATAATTCATCGTTGATGTTCATATATATCATTGTATATCTCGTGTTTAACATCTGAACTTTTGTGACCTCCATAAGAATCTGCTCTGTTTTTTCTTTGGTGTACTTTAGATATTTTTTTTTCACTGCTTCCCTTTCTTTGCAATTCTTATTAATCTGATTTGAAGACCAGACATCTCAAACAGGGCAAGAATTCTCTTGTCAAAGTGGAGCAAAATGTTTGTCAAAATCCAAGCTTCAAAGAAACCTAATGGCATTAGAGCTGCCGGTGAGCAAGAGATAGACCTAACCCGAAGGCATGTTTCCTGCATAGCCTAAAACAATCAGTTGTTTCATATTGCCTTTTCCCTTCTATAACGTGGAACGTGAATTATGTTCAGGATTGGCGAAATTGTTGGAGATTTTTCCTGGCAGTCTTCTGTAGATAATTCTGTGAGTAGAATGTAATTTCAAATTATTACTTGCTTTGAAGGTTAATGTACTTACATAACATGTTTATGGCTTATTTCTTGTTTCACAGGATGACATGCTAGTTACCTCATACGTAGGCCCAGACGATATCAGGTGATTATGGATTTCCACTTAAATTCATTTTGTTAGTCGGCCTTACTTTTCATAGATTGACAGTTCTGATTTATGGTGGTTGTCAGGAAAGTGGAAACTCTAGAAGCAATGAAGCTGCTCACAGCATCTTCGGATGATTCTAATAAGAGGCTCTTAGGGACAACGGCGTCTCGTATCCTTGACTTCTTATTTATTTGCTCACAGCAGTGTTATAAAAAATATGTGGACTGTAGAGTTGACTATATGAATCCTGTCTTGATGTAGCTGAATAATTTTCCAATATCTTGAGTAGCCATTCCAGTTAAGCATAGGTACTATCTAGAAGATCTTACACATTCCTTAGCTATTTACAGATAATAAAGAACGCAGAAAGGTTCAGCTTGTTGAACAACCAGGCCAAAAAGCAGTTAGTAACAGCCAGTCTATAAAAGCAGTTCCTGCCAATCAAAGGCGTCCAATTACTGCTGATGAAATCCAAAAGGCAAAAATGCGTGCGCAATTTATGCAGAGCAACAAAAGGAAAACAGAAGGTCCCAGGAAACCTTCTTTACTCACTAATGATTTGCTTTCAGCATCAGAAGCTTACCTTCGACCTAAGCTTGAAGCGCAGAAAAAGGCAAGGTTGCTTCCTCCAAAGAGTATTATTAAACAGATGGATTGTCTTTCTGATCAGAAGCCAGCTTTTGATCAGAAGGAATCTTTGATGGACAAGTGCCGGAGGGTTCAAATCCCTTGGTGGGCACCCCCAGGTACAGTTTCTTCAAGTGTTTGTTGGATTCCTTTATTTCCTTTTCAGATTATTGCTCTTAAAGGGCACTCAGTATTGGACAATAACTGTTTATTTCTTCAATTAAAGGAAATAAGGATCACAATATTGGTCATGTGGGAAATTCTTTTAGATGACAAGCAACGGTTTTACGATTCCCCCCACCCCCCACCAATGCCCGCTATAAAATCGAAGGCAAATGTCCTGTGAAATAATAGATTTTCCTTAATTTAGTTGCGGTTCTGTATATATTGTATAGAACATAATCAGGGCAAAGAGTTATTGACTAAGTTTTTTCCCCCTTCTGAAACTTAAGCTCGGAATTGTAGAATGGCATAACCTCTGATGCCCTCACTCGTTTGTAGTTGCAATTCTAACATTCACCTTGATTTCTGTTTGGCGCGGTTGCTTGGTGAAAGATATACTAACAATATACTTTGCAAGGGACAAATGTCTATGTTTATCCGCTTCATTTGCTTTAAAGAGCATTGAAATTTACCATTAATTTATCAAAATTTTCCCTGCGTTACGGTGTATGAAGAATGCTATTACATACTCCATAATTCGTCTTAAAATTATTGTTGACGAGAAAACAATATTATAAGTTCATCATGTTTTGACTGTTTGATGTAAAAACCCTAAATACCATTTGCATGGTGTAAAACTGCCCCCTCAATGTTGGATTAACCTTTTGATGAACAAGTCTATTTTTATAGCAAGGTTCTTATTTTCTTGTGGGCTTCCTAAAAACGACAATGGCAGAGGAAATAGAATGGAAGCAAGTTCAATCAGAGATTATTGGTGCCTACTTTCCTGTATTTCATATGCAATTTCCTTTTGCTAAACTAAAACAAAGTTCACTAATTGTTGTAAGTTTTGACATGAAGGCCAGTAGAATTTTAGGGGTCCATAGTAGAGCTCCTAGAGACTTGAAGACCTCCATAGCGCAAGTAGGAAATAAGATTTGATTACATTTTCCAGAGAGGATTATATTAATTGTTACAAAAGTGGCATTCTCAAAATCTCTTAAGGTAGAGATATTCTTATTTACAATGGGACGATTTGGAGTGCATCTGCATGTGCATGAATGCGTAAATTGGTGTGTGTTGTATTCATGGTACCGCTCACATGATGCTGATCCCCATTCTTACTTAGGATGCAAGGGATAATGACTGATGTCCTTATTGCTTGTAATATTACTTGATAATGGAGATTGAACTGCAGAAGTAAAAACAGTTTGAATGTTGTTTTTACAGAGAATAAATATCATGTAACTATTGTTGTTTGTTAATAACGCCTCTTTAGTCTTTACCCTAGAGTAGTGATCATTAAAAGGCAATAGATTAGAACTGGTGATCAGTTGACCGTTTGATGCCGTTTATATCAAATATTCAAACTCCCTTGTTGCCAGCCGTCTTGGATTTGAAAGTTTCATTTCTGATATTTATCAGAAGTTGCATTATCGTTGATGGTATGATGTTATGAATGTTCTTGATTCACGCCTTTTGTTTTGAGGAGCTGTTAATGGCCTCGGAGGCCTAAGCTTCAATATTTTGTGCCTATATTCCAGAATTTGGTGATTGGCGTACCACTACCGCCATGTTGTTTGAGCTGAAGGGCATGGAGTGTCTTTTGCCAGCAAATGAAGACCTGGTATTGAATATTGATCTTAGGCATATCTCCTGGGGTGTTTTCATATAAATCCTCTGAGAAGCCAACCTTTGGGTGGTCGAAGATTCAATTTACATGGTTTAGTCTCATTACCAATATGGATGTTTTCTATTTTTGGATGGTCGGATTTTTGTTCGGTAAACAAGTAAGTGACCACTGTGGATATGAAACTGTCAGTTCTCTGTCAAAAAAACTATGTAATTTGCGCTGATGACTATGACATGTTGTAAGCATGAGCTTGGAACAACTGTTGTATATCATTGATGTAGCTTGGAACAACAGCATGTGCAGGGCAGGAATATGCGGTAATGGAGTTCATATCGAACTCGGAAAAGGTGTATTAAAAGTCAACTGTAGCATAACAATGGCGTGAATAATGGGGGGGGGGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGTGATGGCTCTGCATGAGAAGAAAACAAAACTTATACGTTAGTAAAAAAAAGTCAACCATCTCAACACGTACAAACAAGTCAGCAAAATAGAACTAAAGACGTTGAGTTTTTATTTGGTCAATTAGGCAACGCAATTTGCTTTCCTGTTTGGTTATGCTGCAGTTACTATTTCCTCTTGGGGTATATGATGATTTTGCGGTTTTGATATGTAAAATCATTGTTGTGAGTGATTTTTGGGTTTACACACTTGGAACTATATCCAACTTGTGTTTTCTGTATGGATTGTTGATCTATTGTTTGTCCATCCCTGCAGAAATTGATCTGGCAGTTGAAATTGGTAGTGGAGAAAGCAGCAAGGAAACAGAAGTTCAGAGAAACCGTATCCATCGGGAAAAGGAAACTACTTATCCAACACTTGAAGATGTACCACCAAACCCGAAGGAGCCATGGGATGTCGAGATGGACTATGATGACACCTTAACCCCAGAAATACCCGTTGAGCAGTTACCTGATGCTGATGGCCCGTGTTCGCCTCTTGCTTCAGCTGAAACTCATAATCCCTCAGCTGACCCTTCCACTTTATCACAAGCTGAGCCCGATCTTGAACTCCTGGCAGTTTTGCTGAAAAATCCAGAGTTAGTTTTTGCCCTGACTTCGGGTCAAGCAAGCGGGCTTTCAAGTGAGGATACTGTAAAACTTCTAGACTTGCTCAAGTCAAATGGAGGGGCAGCATTGTTAAACGGGGCTTCTGAGAAGCAAGAGGAAGTGAAAGTTGAAGTTTCTCTTCCTTCGCCAACTCCTACTCCGACCAGAAATGCTGAAGTCTCTCTTCCATCTCCAACCCCACCAAGCAGATTTGAAGTTTCTTTACCCTCCCCGACACCTTCCACCAATCCTGGGCCGGTGCGTGAAAGTTGTACTTATATCGCTAAACTCTCACATTTCTTTTATTTTGCAAATGGATTTTGCCAATTTTTTAGTGGTAATTTTGGTCTACCTCTAAATGTCATTTCCATCTTGTTACTGACAAATTTACCCATAAGCCTTCACCTAGCTCTTTTAGTGCTCAGAAAATTAGTCAAGAACACCACGCTTATCCCATTTTTTTTGCTTCAATTTAGGAAATTGGTTCTCGTTTGTTATGTTCTCTTTTATAAGATTATATATATTTGCTATTAAGGACCAAAACTCATGGAATTGGGAGCGTAGGATGTAGGTGGTGTAGGAGTTGATATTTCAGGAAGATGACAGCAAATTCCTAATGGTTCATATTTAACATAAAAGTCACCTATCCCATCGGCCAAACAGCTCCTTGAATCCAAAAAGAACGAAAAACAGAATCTGATAATTAAATAGCAATTACTATTTGATAAATTTTGGATCAATGTTTGTCACTTGGCTATAAATAATGTTGGCGGTGTACTGTTCATTACAACAGTTGGGGGGTGGGGGGGAGGTAGAAATCAAGCTATTTATTGTATTGATTAAATGTGTATTATTGTTCCTGAATTTGCCTAATTTTCTTGATTAAGCTTGTATTTAGAGATTGGATGCTAAAATGGTATGGTTTACTTAAATTTGCAGAATGGATGGACATCAACAGAGGCTGCAAAAAATCCATTTTCCCGGCGCGCGATGGGAGTCCAGGATACTATGACAGTAACAACTGCTGTTATTAGTTCAGATAGGTCTCAGAATAGCTGCAGTTTGCAGCCACAGTATCCATCAACAGGACCTGGTGTAATCACATCTTTGCCTCAGCACCTTCCTTGTCAAACTCACATACCCGAGTATCAACCCGCATTTTATGACCCTGTTGTGAATCAAAATCCACCAACCATGAACTCTTCATATTCCCGGCCAACACCTGCACATTCACAGTCACAAATACATGCACATTCAAAGCAGTATCTGATGCCTGAAGATAATAACGCTAGGCAAGGCTTGTCTCAGTCTCACAATTCACGTTTCTCAAATCATGTTAGTCAAAATGTATATAATCAGACAATGACCGGCGGCCCCACATGGGAGAGGAACGATTATCCGACGGGTAGTCACAAGGGATATGAATCATGGAGTCCAGAGAATAGCCCTGGCCCATCAAGGTATCCACCTGAGCATCATCATGGGCATGGGCATGGGCATGGATGGAGTTACCCAGAGCAGAGGGAAAGGGGATATGCGCCCCCTGATAGAATAAGTAGGCACCAAAATTGGCGTGGTAGTGGTGGTGGTAATGGTGGTGGTGGTGGCCGTGGCGGTGTTAACCGACGATGGAATGGTGGTGACAGGAGACGATAATGACTCTGTATTGGCTTGTATGACTGTTGTTCTGTGCATAGGCAATTTTTTTATACCAAACGTACCCTGGAATGTTGGGTTGACCGACCGGATTTTGTTGGATAGTTGAGTCACGGACCTTCAACCGTGGATCCGGGAACACATTTGATATGGTAATTCATTTGATTAGGAAGTCTTCACACAAAACTTTTATTAGTCTTGTAGAGGAAATGTTATAACATTAATTAATTAATTGACCCGAAGAAACTGTTTTGAAGATTGGTAATTTTGTGTTTTGATAAAATATCCATCAAAAAACAACACTACCTAAATTTTAAAAAGCGGATCAAAGCCCAATAATAT
mRNA sequence
TGTTTTTTCGGATATCAGATATGTGTGGCAACACACATATCAGCTAAAAGACAAATAGCTAAAAGACAAAAAAGGAAGTACCATTTATGTTAAAAAAGTACCATTCTGTAAAAAAAAAGTACCATTCAGTTAAAAAAAGTACCATTTTTGTTTAAAAAAGTACCATTTAATTTCTTTTTTATCATTTTTCTTATTTTGTCTTTTACTATGATATGCATTTGCCCACACATATCTGATATNATAAACGCCGAAAATACTTCCTCCCAATATCTAATCGTCCTTCGAAAGCCTTTCCTTTGATTAAAAAAAAAAAAATGTAAATAGCGCGCAGCTAGGCACCCTTTAATGTATCATGCGAGTTTCGTGCACCACTGCACCGTCTCTTACCTAATCCTTTCTGCCTAGCAACCGCTCGAGGGCTCGAGGCTTGAAGTGATTGTGTCATCGTGTTTACATTCCAACGAAATAAACGCACAGAGAGAGAAATTGCTCTCTTCCTAAAATAGGCGACCAAAAACCCTAATTTTTTTTTTCCAATCGACAAATTCGATGTCTTCAATCTATCAAGTTTAAGCTCAGAGCAAAATTTAACTAACAATTTCCCCAATTTCCCCCCCAAATCGGTTAATTATGGATCTTTTGAAGGATTATCCTTCAGAAATTGAAATTGGAAGCTCCATAGACTCGTTCCAGAAGGGAATGGATTCACAGAAGCAGCTTTTCCATACACAGATCGATCAACTCCGCAATATCGTCGTCACTCAGTGCAAACTCACCGGCGTTAACCCTCTCTCTCAAGAAATGGCAGCTGGTGCTCTATCTATAAATATTGGGAAGAAGCCTAGAGATTTGTTAAATCCGAAGGCTGCAAAATATATGCAAGCAGTTTTTTCTATGAAAGATGCAATCAGTAAAAAGGAATCTCGTGAGATTGGCGCTCTTTTTGGTCTAACAGTCACACAGGTGAGGGAGTATTTTGCCAGTCAGCGGTCCAGGGTGAGGAAATTGACACGGTTATCTAGAGAAAGAGCGATTAGGGCAAGTGGTAATAAGGAATTACAAGATGGCGTACCTGCTGAGTCTGACCTTATGCTCCCCATTGAAACTGCTCCATTAAGCTCTGTTGGTCCTTCTAATGTCGAGGAAGCACCTTCTTGTTCAAACCAGGATGAAGTCTTGCCAGGACTAGGCGAGTCTGAAAGACATTTTGTTGATAATATTTTCAGTTCTATGTGTAAAGAAGAAACCTTTTCTGGTCAGGTGAAATTGATGGAGTGGATCTTGCAGATAGAAAATCCTACAATCTTGAGTTGGTTTTTGATCAAAGGTGGTGTCATGATTCTAGCTACTTGGTTGAGTCAAGCTGCTAGTGAAGAACAAACAAGTGTTATTTCTGCTGCTCTCAAGGTTTTTTGTCATTTACCCTTGAATAAAGCACTTCCTGCTCACATGTCGGCGTTACTCCAAGGTGTCAACAAATTGCGATTCTACCGAGTACGAGACATCTCAAACAGGGCAAGAATTCTCTTGTCAAAGTGGAGCAAAATGTTTGTCAAAATCCAAGCTTCAAAGAAACCTAATGGCATTAGAGCTGCCGGTGAGCAAGAGATAGACCTAACCCGAAGGATTGGCGAAATTGTTGGAGATTTTTCCTGGCAGTCTTCTGTAGATAATTCTGATGACATGCTAGTTACCTCATACGTAGGCCCAGACGATATCAGGAAAGTGGAAACTCTAGAAGCAATGAAGCTGCTCACAGCATCTTCGGATGATTCTAATAAGAGGCTCTTAGGGACAACGGCGTCTCATAATAAAGAACGCAGAAAGGTTCAGCTTGTTGAACAACCAGGCCAAAAAGCAGTTAGTAACAGCCAGTCTATAAAAGCAGTTCCTGCCAATCAAAGGCGTCCAATTACTGCTGATGAAATCCAAAAGGCAAAAATGCGTGCGCAATTTATGCAGAGCAACAAAAGGAAAACAGAAGGTCCCAGGAAACCTTCTTTACTCACTAATGATTTGCTTTCAGCATCAGAAGCTTACCTTCGACCTAAGCTTGAAGCGCAGAAAAAGGCAAGGTTGCTTCCTCCAAAGAGTATTATTAAACAGATGGATTGTCTTTCTGATCAGAAGCCAGCTTTTGATCAGAAGGAATCTTTGATGGACAAGTGCCGGAGGGTTCAAATCCCTTGGTGGGCACCCCCAGAAATTGATCTGGCAGTTGAAATTGGTAGTGGAGAAAGCAGCAAGGAAACAGAAGTTCAGAGAAACCGTATCCATCGGGAAAAGGAAACTACTTATCCAACACTTGAAGATGTACCACCAAACCCGAAGGAGCCATGGGATGTCGAGATGGACTATGATGACACCTTAACCCCAGAAATACCCGTTGAGCAGTTACCTGATGCTGATGGCCCGTGTTCGCCTCTTGCTTCAGCTGAAACTCATAATCCCTCAGCTGACCCTTCCACTTTATCACAAGCTGAGCCCGATCTTGAACTCCTGGCAGTTTTGCTGAAAAATCCAGAGTTAGTTTTTGCCCTGACTTCGGGTCAAGCAAGCGGGCTTTCAAGTGAGGATACTGTAAAACTTCTAGACTTGCTCAAGTCAAATGGAGGGGCAGCATTGTTAAACGGGGCTTCTGAGAAGCAAGAGGAAGTGAAAGTTGAAGTTTCTCTTCCTTCGCCAACTCCTACTCCGACCAGAAATGCTGAAGTCTCTCTTCCATCTCCAACCCCACCAAGCAGATTTGAAGTTTCTTTACCCTCCCCGACACCTTCCACCAATCCTGGGCCGAATGGATGGACATCAACAGAGGCTGCAAAAAATCCATTTTCCCGGCGCGCGATGGGAGTCCAGGATACTATGACAGTAACAACTGCTGTTATTAGTTCAGATAGGTCTCAGAATAGCTGCAGTTTGCAGCCACAGTATCCATCAACAGGACCTGGTGTAATCACATCTTTGCCTCAGCACCTTCCTTGTCAAACTCACATACCCGAGTATCAACCCGCATTTTATGACCCTGTTGTGAATCAAAATCCACCAACCATGAACTCTTCATATTCCCGGCCAACACCTGCACATTCACAGTCACAAATACATGCACATTCAAAGCAGTATCTGATGCCTGAAGATAATAACGCTAGGCAAGGCTTGTCTCAGTCTCACAATTCACGTTTCTCAAATCATGTTAGTCAAAATGTATATAATCAGACAATGACCGGCGGCCCCACATGGGAGAGGAACGATTATCCGACGGGTAGTCACAAGGGATATGAATCATGGAGTCCAGAGAATAGCCCTGGCCCATCAAGGTATCCACCTGAGCATCATCATGGGCATGGGCATGGGCATGGATGGAGTTACCCAGAGCAGAGGGAAAGGGGATATGCGCCCCCTGATAGAATAAGTAGGCACCAAAATTGGCGTGGTAGTGGTGGTGGTAATGGTGGTGGTGGTGGCCGTGGCGGTGTTAACCGACGATGGAATGGTGGTGACAGGAGACGATAATGACTCTGTATTGGCTTGTATGACTGTTGTTCTGTGCATAGGCAATTTTTTTATACCAAACGTACCCTGGAATGTTGGGTTGACCGACCGGATTTTGTTGGATAGTTGAGTCACGGACCTTCAACCGTGGATCCGGGAACACATTTGATATGGTAATTCATTTGATTAGGAAGTCTTCACACAAAACTTTTATTAGTCTTGTAGAGGAAATGTTATAACATTAATTAATTAATTGACCCGAAGAAACTGTTTTGAAGATTGGTAATTTTGTGTTTTGATAAAATATCCATCAAAAAACAACACTACCTAAATTTTAAAAAGCGGATCAAAGCCCAATAATAT
Coding sequence (CDS)
ATGGATCTTTTGAAGGATTATCCTTCAGAAATTGAAATTGGAAGCTCCATAGACTCGTTCCAGAAGGGAATGGATTCACAGAAGCAGCTTTTCCATACACAGATCGATCAACTCCGCAATATCGTCGTCACTCAGTGCAAACTCACCGGCGTTAACCCTCTCTCTCAAGAAATGGCAGCTGGTGCTCTATCTATAAATATTGGGAAGAAGCCTAGAGATTTGTTAAATCCGAAGGCTGCAAAATATATGCAAGCAGTTTTTTCTATGAAAGATGCAATCAGTAAAAAGGAATCTCGTGAGATTGGCGCTCTTTTTGGTCTAACAGTCACACAGGTGAGGGAGTATTTTGCCAGTCAGCGGTCCAGGGTGAGGAAATTGACACGGTTATCTAGAGAAAGAGCGATTAGGGCAAGTGGTAATAAGGAATTACAAGATGGCGTACCTGCTGAGTCTGACCTTATGCTCCCCATTGAAACTGCTCCATTAAGCTCTGTTGGTCCTTCTAATGTCGAGGAAGCACCTTCTTGTTCAAACCAGGATGAAGTCTTGCCAGGACTAGGCGAGTCTGAAAGACATTTTGTTGATAATATTTTCAGTTCTATGTGTAAAGAAGAAACCTTTTCTGGTCAGGTGAAATTGATGGAGTGGATCTTGCAGATAGAAAATCCTACAATCTTGAGTTGGTTTTTGATCAAAGGTGGTGTCATGATTCTAGCTACTTGGTTGAGTCAAGCTGCTAGTGAAGAACAAACAAGTGTTATTTCTGCTGCTCTCAAGGTTTTTTGTCATTTACCCTTGAATAAAGCACTTCCTGCTCACATGTCGGCGTTACTCCAAGGTGTCAACAAATTGCGATTCTACCGAGTACGAGACATCTCAAACAGGGCAAGAATTCTCTTGTCAAAGTGGAGCAAAATGTTTGTCAAAATCCAAGCTTCAAAGAAACCTAATGGCATTAGAGCTGCCGGTGAGCAAGAGATAGACCTAACCCGAAGGATTGGCGAAATTGTTGGAGATTTTTCCTGGCAGTCTTCTGTAGATAATTCTGATGACATGCTAGTTACCTCATACGTAGGCCCAGACGATATCAGGAAAGTGGAAACTCTAGAAGCAATGAAGCTGCTCACAGCATCTTCGGATGATTCTAATAAGAGGCTCTTAGGGACAACGGCGTCTCATAATAAAGAACGCAGAAAGGTTCAGCTTGTTGAACAACCAGGCCAAAAAGCAGTTAGTAACAGCCAGTCTATAAAAGCAGTTCCTGCCAATCAAAGGCGTCCAATTACTGCTGATGAAATCCAAAAGGCAAAAATGCGTGCGCAATTTATGCAGAGCAACAAAAGGAAAACAGAAGGTCCCAGGAAACCTTCTTTACTCACTAATGATTTGCTTTCAGCATCAGAAGCTTACCTTCGACCTAAGCTTGAAGCGCAGAAAAAGGCAAGGTTGCTTCCTCCAAAGAGTATTATTAAACAGATGGATTGTCTTTCTGATCAGAAGCCAGCTTTTGATCAGAAGGAATCTTTGATGGACAAGTGCCGGAGGGTTCAAATCCCTTGGTGGGCACCCCCAGAAATTGATCTGGCAGTTGAAATTGGTAGTGGAGAAAGCAGCAAGGAAACAGAAGTTCAGAGAAACCGTATCCATCGGGAAAAGGAAACTACTTATCCAACACTTGAAGATGTACCACCAAACCCGAAGGAGCCATGGGATGTCGAGATGGACTATGATGACACCTTAACCCCAGAAATACCCGTTGAGCAGTTACCTGATGCTGATGGCCCGTGTTCGCCTCTTGCTTCAGCTGAAACTCATAATCCCTCAGCTGACCCTTCCACTTTATCACAAGCTGAGCCCGATCTTGAACTCCTGGCAGTTTTGCTGAAAAATCCAGAGTTAGTTTTTGCCCTGACTTCGGGTCAAGCAAGCGGGCTTTCAAGTGAGGATACTGTAAAACTTCTAGACTTGCTCAAGTCAAATGGAGGGGCAGCATTGTTAAACGGGGCTTCTGAGAAGCAAGAGGAAGTGAAAGTTGAAGTTTCTCTTCCTTCGCCAACTCCTACTCCGACCAGAAATGCTGAAGTCTCTCTTCCATCTCCAACCCCACCAAGCAGATTTGAAGTTTCTTTACCCTCCCCGACACCTTCCACCAATCCTGGGCCGAATGGATGGACATCAACAGAGGCTGCAAAAAATCCATTTTCCCGGCGCGCGATGGGAGTCCAGGATACTATGACAGTAACAACTGCTGTTATTAGTTCAGATAGGTCTCAGAATAGCTGCAGTTTGCAGCCACAGTATCCATCAACAGGACCTGGTGTAATCACATCTTTGCCTCAGCACCTTCCTTGTCAAACTCACATACCCGAGTATCAACCCGCATTTTATGACCCTGTTGTGAATCAAAATCCACCAACCATGAACTCTTCATATTCCCGGCCAACACCTGCACATTCACAGTCACAAATACATGCACATTCAAAGCAGTATCTGATGCCTGAAGATAATAACGCTAGGCAAGGCTTGTCTCAGTCTCACAATTCACGTTTCTCAAATCATGTTAGTCAAAATGTATATAATCAGACAATGACCGGCGGCCCCACATGGGAGAGGAACGATTATCCGACGGGTAGTCACAAGGGATATGAATCATGGAGTCCAGAGAATAGCCCTGGCCCATCAAGGTATCCACCTGAGCATCATCATGGGCATGGGCATGGGCATGGATGGAGTTACCCAGAGCAGAGGGAAAGGGGATATGCGCCCCCTGATAGAATAAGTAGGCACCAAAATTGGCGTGGTAGTGGTGGTGGTAATGGTGGTGGTGGTGGCCGTGGCGGTGTTAACCGACGATGGAATGGTGGTGACAGGAGACGATAA
Protein sequence
MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAAGALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQRSRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQDEVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILATWLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILLSKWSKMFVKIQASKKPNGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGPDDIRKVETLEAMKLLTASSDDSNKRLLGTTASHNKERRKVQLVEQPGQKAVSNSQSIKAVPANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEAQKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGESSKETEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCSPLASAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSNGGAALLNGASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPGPNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQHLPCQTHIPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHAHSKQYLMPEDNNARQGLSQSHNSRFSNHVSQNVYNQTMTGGPTWERNDYPTGSHKGYESWSPENSPGPSRYPPEHHHGHGHGHGWSYPEQRERGYAPPDRISRHQNWRGSGGGNGGGGGRGGVNRRWNGGDRRR
Homology
BLAST of Spo21974.1 vs. NCBI nr
Match:
gi|902231620|gb|KNA22330.1| (hypothetical protein SOVF_035080 [Spinacia oleracea])
HSP 1 Score: 1862.4 bits (4823), Expect = 0.000e+0
Identity = 959/959 (100.00%), Postives = 959/959 (100.00%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA
Sbjct: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR
Sbjct: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD
Sbjct: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT
Sbjct: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL
Sbjct: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGP 360
SKWSKMFVKIQASKKPNGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGP
Sbjct: 301 SKWSKMFVKIQASKKPNGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGP 360
Query: 361 DDIRKVETLEAMKLLTASSDDSNKRLLGTTASHNKERRKVQLVEQPGQKAVSNSQSIKAV 420
DDIRKVETLEAMKLLTASSDDSNKRLLGTTASHNKERRKVQLVEQPGQKAVSNSQSIKAV
Sbjct: 361 DDIRKVETLEAMKLLTASSDDSNKRLLGTTASHNKERRKVQLVEQPGQKAVSNSQSIKAV 420
Query: 421 PANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEAQKK 480
PANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEAQKK
Sbjct: 421 PANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEAQKK 480
Query: 481 ARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGESSKE 540
ARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGESSKE
Sbjct: 481 ARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGESSKE 540
Query: 541 TEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCSPLA 600
TEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCSPLA
Sbjct: 541 TEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCSPLA 600
Query: 601 SAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSN 660
SAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSN
Sbjct: 601 SAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSN 660
Query: 661 GGAALLNGASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPG 720
GGAALLNGASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPG
Sbjct: 661 GGAALLNGASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPG 720
Query: 721 PNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQ 780
PNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQ
Sbjct: 721 PNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQ 780
Query: 781 HLPCQTHIPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHAHSKQYLMPEDNNARQ 840
HLPCQTHIPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHAHSKQYLMPEDNNARQ
Sbjct: 781 HLPCQTHIPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHAHSKQYLMPEDNNARQ 840
Query: 841 GLSQSHNSRFSNHVSQNVYNQTMTGGPTWERNDYPTGSHKGYESWSPENSPGPSRYPPEH 900
GLSQSHNSRFSNHVSQNVYNQTMTGGPTWERNDYPTGSHKGYESWSPENSPGPSRYPPEH
Sbjct: 841 GLSQSHNSRFSNHVSQNVYNQTMTGGPTWERNDYPTGSHKGYESWSPENSPGPSRYPPEH 900
Query: 901 HHGHGHGHGWSYPEQRERGYAPPDRISRHQNWRGSGGGNGGGGGRGGVNRRWNGGDRRR 960
HHGHGHGHGWSYPEQRERGYAPPDRISRHQNWRGSGGGNGGGGGRGGVNRRWNGGDRRR
Sbjct: 901 HHGHGHGHGWSYPEQRERGYAPPDRISRHQNWRGSGGGNGGGGGRGGVNRRWNGGDRRR 959
BLAST of Spo21974.1 vs. NCBI nr
Match:
gi|731346283|ref|XP_010684373.1| (PREDICTED: homeobox protein LUMINIDEPENDENS isoform X2 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 1375.1 bits (3558), Expect = 0.000e+0
Identity = 744/986 (75.46%), Postives = 824/986 (83.57%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
M++LKDYPSEI+IG+S++SFQKGMD QK LFH QIDQLRNIVVTQCKLTGVNPLSQEMAA
Sbjct: 1 MEVLKDYPSEIDIGNSMESFQKGMDLQKHLFHNQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSINIGK+PRDLLNPKA KYMQAVFS+KDAISKKE REIGALFGLTVTQVREYFA QR
Sbjct: 61 GALSINIGKRPRDLLNPKALKYMQAVFSIKDAISKKELREIGALFGLTVTQVREYFAGQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVR++ RLSRE+AIR S KE+QDGV A++DLM PI+ PLSS+GPSN EE PSCSNQD
Sbjct: 121 SRVRRIVRLSREKAIRTSATKEIQDGVSADADLMHPIDPTPLSSIGPSNAEEVPSCSNQD 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
EVLPGLGESERHFV+NIFSSMCKEETFSGQVKLM+WILQIENP IL WFL+KGGVMILAT
Sbjct: 181 EVLPGLGESERHFVENIFSSMCKEETFSGQVKLMDWILQIENPLILGWFLMKGGVMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAASEEQTSV+SA LKV CHLPLNKA+PAHMSA+LQGVNKLRFYR+ DISNRAR+LL
Sbjct: 241 WLSQAASEEQTSVVSAVLKVLCHLPLNKAVPAHMSAVLQGVNKLRFYRLPDISNRARVLL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAG--EQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYV 360
SKWS+MF K QA KKPNG+R AG + +IDL+ R+GE+VGD SWQSSVD+ D +L SYV
Sbjct: 301 SKWSRMFAKSQALKKPNGVRTAGGSQPDIDLSGRLGELVGDGSWQSSVDDHD-ILALSYV 360
Query: 361 GPDDIRKVETLEAMKLLTASSDDSNKRL-LGTTASHNKERRKVQLVEQPGQKAVSNSQSI 420
GPDD RKVETLEA+KLLTASSDD+NK+L LGT+ +HNKERRKVQLVEQPGQKA S SQ +
Sbjct: 361 GPDDTRKVETLEAVKLLTASSDDTNKKLILGTSTAHNKERRKVQLVEQPGQKAGSKSQPV 420
Query: 421 KAVPANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEA 480
KAV ANQRRPITADEIQKAKMRAQFMQSNKRKTEGP KPSLLTNDLLSASEAYLRPKLEA
Sbjct: 421 KAVLANQRRPITADEIQKAKMRAQFMQSNKRKTEGPNKPSLLTNDLLSASEAYLRPKLEA 480
Query: 481 QKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGES 540
QKKARLLPPKS KQ++C SD KP FDQKE+L++KCRRVQIPW APPEI L E+G+GES
Sbjct: 481 QKKARLLPPKSSTKQVECPSDHKPTFDQKETLLEKCRRVQIPWRAPPEIQLIAEVGAGES 540
Query: 541 SKETEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCS 600
SKE E QRNRIHREKET Y L +VPPNPKEPWD+E+DYDDTLTPEIP++QLPD+DG +
Sbjct: 541 SKELEGQRNRIHREKETIYRMLHEVPPNPKEPWDIEIDYDDTLTPEIPIDQLPDSDG--A 600
Query: 601 PLASAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLL 660
++ SAD ST SQAEPDLELLAVLLKNP+LVFALTSGQASGLSSEDTVKLLDLL
Sbjct: 601 DHVPENLNDSSADHSTSSQAEPDLELLAVLLKNPDLVFALTSGQASGLSSEDTVKLLDLL 660
Query: 661 KSNGGAALLNG-ASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPS 720
KS+GG ALLNG AS K EEVKVEVSLPSPTPT RN EVSLPSPTPPSRFEVSLPSPTPS
Sbjct: 661 KSSGGEALLNGQASGKSEEVKVEVSLPSPTPT--RNVEVSLPSPTPPSRFEVSLPSPTPS 720
Query: 721 TNPGPNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVIT 780
TNPGP+GWT+T+AAKNPF+R M VQD+ VTTAVI+SDR QN C+LQPQ+ ST P V+
Sbjct: 721 TNPGPSGWTTTQAAKNPFTRHTMAVQDSAVVTTAVITSDRPQNGCTLQPQFASTAPAVVP 780
Query: 781 SLPQHLPCQTH-----------IPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHA 840
SLPQ+ QT IPE FY P QNPPTMNSS ++ +IH+
Sbjct: 781 SLPQYRSSQTQILGPVTHSPSLIPENHLTFYAPNPAQNPPTMNSSIIPSYTRQTRPRIHS 840
Query: 841 H--SKQYLMPEDNN------ARQGLSQSHNSRFSNHVSQNVYN-QTMTGG---PTWERND 900
H +QY + ED++ AR GL SHN F+NHV+QNVYN MTGG PTWE N
Sbjct: 841 HPQPEQYPLHEDSSNTEMVKARHGL--SHNLPFNNHVTQNVYNTPIMTGGQPDPTWENNS 900
Query: 901 YPTGSHKGYESWSPENSPGPSRYPPEHHHGHGHGHGWSYPEQRERGYAPPDRISRHQNWR 960
Y TG KGYE +PENSPGP RYPPEHH HGHGWSYPEQRERGY PPDR+SRHQ+WR
Sbjct: 901 YCTGGQKGYELRNPENSPGPMRYPPEHH----HGHGWSYPEQRERGYGPPDRLSRHQSWR 960
BLAST of Spo21974.1 vs. NCBI nr
Match:
gi|731346281|ref|XP_010684371.1| (PREDICTED: homeobox protein LUMINIDEPENDENS isoform X1 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 1355.9 bits (3508), Expect = 0.000e+0
Identity = 740/990 (74.75%), Postives = 821/990 (82.93%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
M++LKDYPSEI+IG+S++SFQKGMD QK LFH QIDQLRNIVVTQCKLTGVNPLSQEMAA
Sbjct: 1 MEVLKDYPSEIDIGNSMESFQKGMDLQKHLFHNQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSINIGK+PRDLLNPKA KYMQAVFS+KDAISKKE REIGALFGLTVTQVREYFA QR
Sbjct: 61 GALSINIGKRPRDLLNPKALKYMQAVFSIKDAISKKELREIGALFGLTVTQVREYFAGQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVR++ RLSRE+AIR S KE+QDGV A++DLM PI+ PLSS+GPSN EE PSCSNQD
Sbjct: 121 SRVRRIVRLSREKAIRTSATKEIQDGVSADADLMHPIDPTPLSSIGPSNAEEVPSCSNQD 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
EVLPGLGESERHFV+NIFSSMCKEETFSGQVKLM+WILQIENP IL WFL+KGGVMILAT
Sbjct: 181 EVLPGLGESERHFVENIFSSMCKEETFSGQVKLMDWILQIENPLILGWFLMKGGVMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAASEEQTSV+SA LKV CHLPLNKA+PAHMSA+LQGVNKLRFYR+ DISNRAR+LL
Sbjct: 241 WLSQAASEEQTSVVSAVLKVLCHLPLNKAVPAHMSAVLQGVNKLRFYRLPDISNRARVLL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAG--EQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYV 360
SKWS+MF K QA KKPNG+R AG + +IDL+ R+GE+VGD SWQSSVD+ D +L SYV
Sbjct: 301 SKWSRMFAKSQALKKPNGVRTAGGSQPDIDLSGRLGELVGDGSWQSSVDDHD-ILALSYV 360
Query: 361 GPDDIRKVETLEAMKLLTASSDDSNKRL-LGTTASHNKERRKVQLVEQPGQKAVSNSQSI 420
GPDD RKVETLEA+KLLTASSDD+NK+L LGT+ + +RRKVQLVEQPGQKA S SQ +
Sbjct: 361 GPDDTRKVETLEAVKLLTASSDDTNKKLILGTSTA---QRRKVQLVEQPGQKAGSKSQPV 420
Query: 421 KAVPANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEA 480
KAV ANQRRPITADEIQKAKMRAQFMQSNKRKTEGP KPSLLTNDLLSASEAYLRPKLEA
Sbjct: 421 KAVLANQRRPITADEIQKAKMRAQFMQSNKRKTEGPNKPSLLTNDLLSASEAYLRPKLEA 480
Query: 481 QKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGES 540
QKKARLLPPKS KQ++C SD KP FDQKE+L++KCRRVQIPW APPEI L E+G+GES
Sbjct: 481 QKKARLLPPKSSTKQVECPSDHKPTFDQKETLLEKCRRVQIPWRAPPEIQLIAEVGAGES 540
Query: 541 SKETEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCS 600
SKE E QRNRIHREKET Y L +VPPNPKEPWD+E+DYDDTLTPEIP++QLPD+DG +
Sbjct: 541 SKELEGQRNRIHREKETIYRMLHEVPPNPKEPWDIEIDYDDTLTPEIPIDQLPDSDG--A 600
Query: 601 PLASAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLL 660
++ SAD ST SQAEPDLELLAVLLKNP+LVFALTSGQASGLSSEDTVKLLDLL
Sbjct: 601 DHVPENLNDSSADHSTSSQAEPDLELLAVLLKNPDLVFALTSGQASGLSSEDTVKLLDLL 660
Query: 661 KSNGGAALLNG-ASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPS 720
KS+GG ALLNG AS K EEVKVEVSLPS PTPTRN EVSLPSPTPPSRFEVSLPSPTPS
Sbjct: 661 KSSGGEALLNGQASGKSEEVKVEVSLPS--PTPTRNVEVSLPSPTPPSRFEVSLPSPTPS 720
Query: 721 TNPGP----NGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGP 780
TNPGP +GWT+T+AAKNPF+R M VQD+ VTTAVI+SDR QN C+LQPQ+ ST P
Sbjct: 721 TNPGPVRENSGWTTTQAAKNPFTRHTMAVQDSAVVTTAVITSDRPQNGCTLQPQFASTAP 780
Query: 781 GVITSLPQHLPCQTH-----------IPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQS 840
V+ SLPQ+ QT IPE FY P QNPPTMNSS ++
Sbjct: 781 AVVPSLPQYRSSQTQILGPVTHSPSLIPENHLTFYAPNPAQNPPTMNSSIIPSYTRQTRP 840
Query: 841 QIHAH--SKQYLMPEDNN------ARQGLSQSHNSRFSNHVSQNVYN-QTMTGG---PTW 900
+IH+H +QY + ED++ AR GL SHN F+NHV+QNVYN MTGG PTW
Sbjct: 841 RIHSHPQPEQYPLHEDSSNTEMVKARHGL--SHNLPFNNHVTQNVYNTPIMTGGQPDPTW 900
Query: 901 ERNDYPTGSHKGYESWSPENSPGPSRYPPEHHHGHGHGHGWSYPEQRERGYAPPDRISRH 960
E N Y TG KGYE +PENSPGP RYPPEHH HGHGWSYPEQRERGY PPDR+SRH
Sbjct: 901 ENNSYCTGGQKGYELRNPENSPGPMRYPPEHH----HGHGWSYPEQRERGYGPPDRLSRH 960
BLAST of Spo21974.1 vs. NCBI nr
Match:
gi|870854253|gb|KMT06050.1| (hypothetical protein BVRB_7g163790 isoform A [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 843.6 bits (2178), Expect = 3.500e-241
Identity = 435/527 (82.54%), Postives = 479/527 (90.89%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
M++LKDYPSEI+IG+S++SFQKGMD QK LFH QIDQLRNIVVTQCKLTGVNPLSQEMAA
Sbjct: 1 MEVLKDYPSEIDIGNSMESFQKGMDLQKHLFHNQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSINIGK+PRDLLNPKA KYMQAVFS+KDAISKKE REIGALFGLTVTQVREYFA QR
Sbjct: 61 GALSINIGKRPRDLLNPKALKYMQAVFSIKDAISKKELREIGALFGLTVTQVREYFAGQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVR++ RLSRE+AIR S KE+QDGV A++DLM PI+ PLSS+GPSN EE PSCSNQD
Sbjct: 121 SRVRRIVRLSREKAIRTSATKEIQDGVSADADLMHPIDPTPLSSIGPSNAEEVPSCSNQD 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
EVLPGLGESERHFV+NIFSSMCKEETFSGQVKLM+WILQIENP IL WFL+KGGVMILAT
Sbjct: 181 EVLPGLGESERHFVENIFSSMCKEETFSGQVKLMDWILQIENPLILGWFLMKGGVMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAASEEQTSV+SA LKV CHLPLNKA+PAHMSA+LQGVNKLRFYR+ DISNRAR+LL
Sbjct: 241 WLSQAASEEQTSVVSAVLKVLCHLPLNKAVPAHMSAVLQGVNKLRFYRLPDISNRARVLL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAG--EQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYV 360
SKWS+MF K QA KKPNG+R AG + +IDL+ R+GE+VGD SWQSSVD+ D+L SYV
Sbjct: 301 SKWSRMFAKSQALKKPNGVRTAGGSQPDIDLSGRLGELVGDGSWQSSVDD-HDILALSYV 360
Query: 361 GPDDIRKVETLEAMKLLTASSDDSNKRL-LGTTASHNKERRKVQLVEQPGQKAVSNSQSI 420
GPDD RKVETLEA+KLLTASSDD+NK+L LGT+ +HNKERRKVQLVEQPGQKA S SQ +
Sbjct: 361 GPDDTRKVETLEAVKLLTASSDDTNKKLILGTSTAHNKERRKVQLVEQPGQKAGSKSQPV 420
Query: 421 KAVPANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEA 480
KAV ANQRRPITADEIQKAKMRAQFMQSNKRKTEGP KPSLLTNDLLSASEAYLRPKLEA
Sbjct: 421 KAVLANQRRPITADEIQKAKMRAQFMQSNKRKTEGPNKPSLLTNDLLSASEAYLRPKLEA 480
Query: 481 QKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPP 525
QKKARLLPPKS KQ++C SD KP FDQKE+L++KCRRVQIPW APP
Sbjct: 481 QKKARLLPPKSSTKQVECPSDHKPTFDQKETLLEKCRRVQIPWRAPP 526
BLAST of Spo21974.1 vs. NCBI nr
Match:
gi|645258718|ref|XP_008235016.1| (PREDICTED: homeobox protein LUMINIDEPENDENS isoform X2 [Prunus mume])
HSP 1 Score: 778.9 bits (2010), Expect = 1.000e-221
Identity = 503/1011 (49.75%), Postives = 644/1011 (63.70%), Query Frame = 1
Query: 9 SEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAAGALSINIG 68
SE+EIGSS++S QK +DSQ+QLFH+QIDQL+ +VVTQC LTGVNPLSQEMAAGALS+ IG
Sbjct: 5 SEMEIGSSVESVQKFLDSQRQLFHSQIDQLQKVVVTQCNLTGVNPLSQEMAAGALSVKIG 64
Query: 69 KKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQRSRVRKLTR 128
K+PRDLLNPKA KYMQ+VFS+KDAISKKESRE+ ALFG+T TQVR++F SQRSRVRKL +
Sbjct: 65 KRPRDLLNPKAIKYMQSVFSIKDAISKKESRELSALFGVTGTQVRDFFNSQRSRVRKLVQ 124
Query: 129 LSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQDEVLPGLGE 188
LSRE+A R+S +KELQDGV SD + PI+ PL+SVGPS+VE+APSCS QD+ L GL +
Sbjct: 125 LSREKATRSSEHKELQDGVSTSSDPLTPIDPVPLNSVGPSSVEDAPSCSTQDDALSGLDD 184
Query: 189 SERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILATWLSQAASE 248
++HFVDNIF+ M KEETFSGQVKLMEWILQI+N ++L WFL GGVMILATWLSQAA E
Sbjct: 185 LDKHFVDNIFNLMRKEETFSGQVKLMEWILQIQNSSVLCWFLNTGGVMILATWLSQAAIE 244
Query: 249 EQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILLSKWSKMFV 308
EQTSV+ LKV CHLPL+KALP HMSA+LQ VN+LRFYR D+SNRAR+LLS+WSK+
Sbjct: 245 EQTSVLLVILKVLCHLPLHKALPVHMSAILQSVNRLRFYRTADVSNRARVLLSRWSKLLA 304
Query: 309 KIQASKKPNGIRAAGEQE---IDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGPDDIRK 368
+IQ KKPNG++ + + + + L + I E++GD SW+S++D +D+ T + ++ R+
Sbjct: 305 RIQNMKKPNGMKTSSDSQHELVMLKQSIDEVMGDESWKSNIDIPEDIFATPFENAENSRR 364
Query: 369 VETLEAMKLLTASSDDSNKR-LLGTTASHNKERRKVQLVEQPGQKAVSNS-QSIKAVPAN 428
E E +KLLTASSD+SNK+ +LG ++S + RRKVQLVEQPGQK+ S Q +A P +
Sbjct: 365 SEASEPLKLLTASSDESNKKQILGVSSSQFRARRKVQLVEQPGQKSAGRSVQVTRATPVS 424
Query: 429 QRRPITADEIQKAKMRAQFMQS----------NKR-KTEGPRKPSLLTNDLLS-ASEAYL 488
+ RP++AD+IQKAKMRAQFMQS NK KTEG K S +L + +
Sbjct: 425 KGRPMSADDIQKAKMRAQFMQSKYGKSGSSNENKELKTEGGNKLSTSQASILPVVPKVPV 484
Query: 489 RPKLEAQKK--ARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLA 548
R +E KK LL + +++ K D KES+++KC+R+++PW PPEI L
Sbjct: 485 RLDIEEPKKPVTLLLKERETPNRLETSLAPKLRMDLKESILEKCQRIRVPWKTPPEIKLD 544
Query: 549 VE--IGSGESSKETEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVE 608
E +G GE+ KE EVQRNR REKET Y ++++P NPKEPWD+EMDYDD+LTP+IP+E
Sbjct: 545 PEWRVGGGENGKEIEVQRNRNRREKETIYQRVQEIPSNPKEPWDIEMDYDDSLTPDIPIE 604
Query: 609 QLPDADGPCSP-------------LASAETHNPSAD-PSTLSQ-------AEPDLELLAV 668
Q PDADG + +AS++ N +A LSQ AEPDLELLAV
Sbjct: 605 QPPDADGTETQASLSREGNNAQAWVASSQGVNSAASLAPALSQMNGASAAAEPDLELLAV 664
Query: 669 LLKNPELVFALTSGQASGLSSEDTVKLLDLLKSNGGAALLNGASEKQEEVKVEVSLPSPT 728
LLKNPELVFALTSGQA+ LSSEDTVKLLD++KS GGA LNG K E+
Sbjct: 665 LLKNPELVFALTSGQAANLSSEDTVKLLDMIKS-GGAGNLNGLGRKMEQ----------- 724
Query: 729 PTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPGPNGWTSTEAAKNPFSRRAMGVQDTMT 788
R EVSLPSPTPS+NPG +GW + +A N F ++ M +
Sbjct: 725 ------------------RVEVSLPSPTPSSNPGTSGWRA-DAGWNAFPQQ-MATTNKSL 784
Query: 789 VTTAV--ISSDRSQNSCSLQPQY-----------PSTGPGVITSLPQHLPCQT---HIPE 848
V++AV I S R S P Y P+ V+T HL + ++ E
Sbjct: 785 VSSAVRMIPSQRLSTSQPAVPSYSPDYFPPSMQTPAASEMVLTMKNTHLNNLSNSYNVAE 844
Query: 849 YQPAFYDPVVNQNP--------PTMNSSYSRP-TPAH---SQSQIHAHSKQYLMPEDN-N 908
QP + P + P P S +S P P H S+ Q+ P D+
Sbjct: 845 RQPNSFPPPLVTTPARQQRQPQPLQQSRFSEPRLPTHMYPSKPQMGKPGPPPPSPSDSWR 904
Query: 909 ARQGLSQSHNSRFSNHVSQNVYNQTMTG---------GPTWERNDYPTGSHKGYESWSPE 940
ARQ + S + +QN YN + G GP+WE N+ G ++ +ESWSP+
Sbjct: 905 ARQDVP----SNYRYLENQNQYNASYGGPSQQPQLLPGPSWEGNE-RVGGNQDFESWSPD 964
BLAST of Spo21974.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9RS10_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_035080 PE=4 SV=1)
HSP 1 Score: 1862.4 bits (4823), Expect = 0.000e+0
Identity = 959/959 (100.00%), Postives = 959/959 (100.00%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA
Sbjct: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR
Sbjct: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD
Sbjct: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT
Sbjct: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL
Sbjct: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGP 360
SKWSKMFVKIQASKKPNGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGP
Sbjct: 301 SKWSKMFVKIQASKKPNGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYVGP 360
Query: 361 DDIRKVETLEAMKLLTASSDDSNKRLLGTTASHNKERRKVQLVEQPGQKAVSNSQSIKAV 420
DDIRKVETLEAMKLLTASSDDSNKRLLGTTASHNKERRKVQLVEQPGQKAVSNSQSIKAV
Sbjct: 361 DDIRKVETLEAMKLLTASSDDSNKRLLGTTASHNKERRKVQLVEQPGQKAVSNSQSIKAV 420
Query: 421 PANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEAQKK 480
PANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEAQKK
Sbjct: 421 PANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEAQKK 480
Query: 481 ARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGESSKE 540
ARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGESSKE
Sbjct: 481 ARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGESSKE 540
Query: 541 TEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCSPLA 600
TEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCSPLA
Sbjct: 541 TEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCSPLA 600
Query: 601 SAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSN 660
SAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSN
Sbjct: 601 SAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSN 660
Query: 661 GGAALLNGASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPG 720
GGAALLNGASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPG
Sbjct: 661 GGAALLNGASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPG 720
Query: 721 PNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQ 780
PNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQ
Sbjct: 721 PNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQ 780
Query: 781 HLPCQTHIPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHAHSKQYLMPEDNNARQ 840
HLPCQTHIPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHAHSKQYLMPEDNNARQ
Sbjct: 781 HLPCQTHIPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHAHSKQYLMPEDNNARQ 840
Query: 841 GLSQSHNSRFSNHVSQNVYNQTMTGGPTWERNDYPTGSHKGYESWSPENSPGPSRYPPEH 900
GLSQSHNSRFSNHVSQNVYNQTMTGGPTWERNDYPTGSHKGYESWSPENSPGPSRYPPEH
Sbjct: 841 GLSQSHNSRFSNHVSQNVYNQTMTGGPTWERNDYPTGSHKGYESWSPENSPGPSRYPPEH 900
Query: 901 HHGHGHGHGWSYPEQRERGYAPPDRISRHQNWRGSGGGNGGGGGRGGVNRRWNGGDRRR 960
HHGHGHGHGWSYPEQRERGYAPPDRISRHQNWRGSGGGNGGGGGRGGVNRRWNGGDRRR
Sbjct: 901 HHGHGHGHGWSYPEQRERGYAPPDRISRHQNWRGSGGGNGGGGGRGGVNRRWNGGDRRR 959
BLAST of Spo21974.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8ESE0_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g163790 PE=4 SV=1)
HSP 1 Score: 1375.1 bits (3558), Expect = 0.000e+0
Identity = 744/986 (75.46%), Postives = 824/986 (83.57%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
M++LKDYPSEI+IG+S++SFQKGMD QK LFH QIDQLRNIVVTQCKLTGVNPLSQEMAA
Sbjct: 1 MEVLKDYPSEIDIGNSMESFQKGMDLQKHLFHNQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSINIGK+PRDLLNPKA KYMQAVFS+KDAISKKE REIGALFGLTVTQVREYFA QR
Sbjct: 61 GALSINIGKRPRDLLNPKALKYMQAVFSIKDAISKKELREIGALFGLTVTQVREYFAGQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVR++ RLSRE+AIR S KE+QDGV A++DLM PI+ PLSS+GPSN EE PSCSNQD
Sbjct: 121 SRVRRIVRLSREKAIRTSATKEIQDGVSADADLMHPIDPTPLSSIGPSNAEEVPSCSNQD 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
EVLPGLGESERHFV+NIFSSMCKEETFSGQVKLM+WILQIENP IL WFL+KGGVMILAT
Sbjct: 181 EVLPGLGESERHFVENIFSSMCKEETFSGQVKLMDWILQIENPLILGWFLMKGGVMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAASEEQTSV+SA LKV CHLPLNKA+PAHMSA+LQGVNKLRFYR+ DISNRAR+LL
Sbjct: 241 WLSQAASEEQTSVVSAVLKVLCHLPLNKAVPAHMSAVLQGVNKLRFYRLPDISNRARVLL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAG--EQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYV 360
SKWS+MF K QA KKPNG+R AG + +IDL+ R+GE+VGD SWQSSVD+ D +L SYV
Sbjct: 301 SKWSRMFAKSQALKKPNGVRTAGGSQPDIDLSGRLGELVGDGSWQSSVDDHD-ILALSYV 360
Query: 361 GPDDIRKVETLEAMKLLTASSDDSNKRL-LGTTASHNKERRKVQLVEQPGQKAVSNSQSI 420
GPDD RKVETLEA+KLLTASSDD+NK+L LGT+ +HNKERRKVQLVEQPGQKA S SQ +
Sbjct: 361 GPDDTRKVETLEAVKLLTASSDDTNKKLILGTSTAHNKERRKVQLVEQPGQKAGSKSQPV 420
Query: 421 KAVPANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEA 480
KAV ANQRRPITADEIQKAKMRAQFMQSNKRKTEGP KPSLLTNDLLSASEAYLRPKLEA
Sbjct: 421 KAVLANQRRPITADEIQKAKMRAQFMQSNKRKTEGPNKPSLLTNDLLSASEAYLRPKLEA 480
Query: 481 QKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPPEIDLAVEIGSGES 540
QKKARLLPPKS KQ++C SD KP FDQKE+L++KCRRVQIPW APPEI L E+G+GES
Sbjct: 481 QKKARLLPPKSSTKQVECPSDHKPTFDQKETLLEKCRRVQIPWRAPPEIQLIAEVGAGES 540
Query: 541 SKETEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTLTPEIPVEQLPDADGPCS 600
SKE E QRNRIHREKET Y L +VPPNPKEPWD+E+DYDDTLTPEIP++QLPD+DG +
Sbjct: 541 SKELEGQRNRIHREKETIYRMLHEVPPNPKEPWDIEIDYDDTLTPEIPIDQLPDSDG--A 600
Query: 601 PLASAETHNPSADPSTLSQAEPDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLL 660
++ SAD ST SQAEPDLELLAVLLKNP+LVFALTSGQASGLSSEDTVKLLDLL
Sbjct: 601 DHVPENLNDSSADHSTSSQAEPDLELLAVLLKNPDLVFALTSGQASGLSSEDTVKLLDLL 660
Query: 661 KSNGGAALLNG-ASEKQEEVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPS 720
KS+GG ALLNG AS K EEVKVEVSLPSPTPT RN EVSLPSPTPPSRFEVSLPSPTPS
Sbjct: 661 KSSGGEALLNGQASGKSEEVKVEVSLPSPTPT--RNVEVSLPSPTPPSRFEVSLPSPTPS 720
Query: 721 TNPGPNGWTSTEAAKNPFSRRAMGVQDTMTVTTAVISSDRSQNSCSLQPQYPSTGPGVIT 780
TNPGP+GWT+T+AAKNPF+R M VQD+ VTTAVI+SDR QN C+LQPQ+ ST P V+
Sbjct: 721 TNPGPSGWTTTQAAKNPFTRHTMAVQDSAVVTTAVITSDRPQNGCTLQPQFASTAPAVVP 780
Query: 781 SLPQHLPCQTH-----------IPEYQPAFYDPVVNQNPPTMNSSYSRPTPAHSQSQIHA 840
SLPQ+ QT IPE FY P QNPPTMNSS ++ +IH+
Sbjct: 781 SLPQYRSSQTQILGPVTHSPSLIPENHLTFYAPNPAQNPPTMNSSIIPSYTRQTRPRIHS 840
Query: 841 H--SKQYLMPEDNN------ARQGLSQSHNSRFSNHVSQNVYN-QTMTGG---PTWERND 900
H +QY + ED++ AR GL SHN F+NHV+QNVYN MTGG PTWE N
Sbjct: 841 HPQPEQYPLHEDSSNTEMVKARHGL--SHNLPFNNHVTQNVYNTPIMTGGQPDPTWENNS 900
Query: 901 YPTGSHKGYESWSPENSPGPSRYPPEHHHGHGHGHGWSYPEQRERGYAPPDRISRHQNWR 960
Y TG KGYE +PENSPGP RYPPEHH HGHGWSYPEQRERGY PPDR+SRHQ+WR
Sbjct: 901 YCTGGQKGYELRNPENSPGPMRYPPEHH----HGHGWSYPEQRERGYGPPDRLSRHQSWR 960
BLAST of Spo21974.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8BWY5_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_7g163790 PE=4 SV=1)
HSP 1 Score: 843.6 bits (2178), Expect = 2.400e-241
Identity = 435/527 (82.54%), Postives = 479/527 (90.89%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
M++LKDYPSEI+IG+S++SFQKGMD QK LFH QIDQLRNIVVTQCKLTGVNPLSQEMAA
Sbjct: 1 MEVLKDYPSEIDIGNSMESFQKGMDLQKHLFHNQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSINIGK+PRDLLNPKA KYMQAVFS+KDAISKKE REIGALFGLTVTQVREYFA QR
Sbjct: 61 GALSINIGKRPRDLLNPKALKYMQAVFSIKDAISKKELREIGALFGLTVTQVREYFAGQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVR++ RLSRE+AIR S KE+QDGV A++DLM PI+ PLSS+GPSN EE PSCSNQD
Sbjct: 121 SRVRRIVRLSREKAIRTSATKEIQDGVSADADLMHPIDPTPLSSIGPSNAEEVPSCSNQD 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
EVLPGLGESERHFV+NIFSSMCKEETFSGQVKLM+WILQIENP IL WFL+KGGVMILAT
Sbjct: 181 EVLPGLGESERHFVENIFSSMCKEETFSGQVKLMDWILQIENPLILGWFLMKGGVMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAASEEQTSV+SA LKV CHLPLNKA+PAHMSA+LQGVNKLRFYR+ DISNRAR+LL
Sbjct: 241 WLSQAASEEQTSVVSAVLKVLCHLPLNKAVPAHMSAVLQGVNKLRFYRLPDISNRARVLL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAG--EQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYV 360
SKWS+MF K QA KKPNG+R AG + +IDL+ R+GE+VGD SWQSSVD+ D+L SYV
Sbjct: 301 SKWSRMFAKSQALKKPNGVRTAGGSQPDIDLSGRLGELVGDGSWQSSVDD-HDILALSYV 360
Query: 361 GPDDIRKVETLEAMKLLTASSDDSNKRL-LGTTASHNKERRKVQLVEQPGQKAVSNSQSI 420
GPDD RKVETLEA+KLLTASSDD+NK+L LGT+ +HNKERRKVQLVEQPGQKA S SQ +
Sbjct: 361 GPDDTRKVETLEAVKLLTASSDDTNKKLILGTSTAHNKERRKVQLVEQPGQKAGSKSQPV 420
Query: 421 KAVPANQRRPITADEIQKAKMRAQFMQSNKRKTEGPRKPSLLTNDLLSASEAYLRPKLEA 480
KAV ANQRRPITADEIQKAKMRAQFMQSNKRKTEGP KPSLLTNDLLSASEAYLRPKLEA
Sbjct: 421 KAVLANQRRPITADEIQKAKMRAQFMQSNKRKTEGPNKPSLLTNDLLSASEAYLRPKLEA 480
Query: 481 QKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWAPP 525
QKKARLLPPKS KQ++C SD KP FDQKE+L++KCRRVQIPW APP
Sbjct: 481 QKKARLLPPKSSTKQVECPSDHKPTFDQKETLLEKCRRVQIPWRAPP 526
BLAST of Spo21974.1 vs. UniProtKB/TrEMBL
Match:
F6HQ00_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_07s0104g01580 PE=4 SV=1)
HSP 1 Score: 768.1 bits (1982), Expect = 1.300e-218
Identity = 465/868 (53.57%), Postives = 589/868 (67.86%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
M++LK+ SEI+IG+S SF+K +DSQ +LF++Q+DQL +IV+ QC+LTGVNPLSQEMAA
Sbjct: 1 MEVLKENISEIDIGTSTASFKKFVDSQNELFNSQVDQLGSIVLKQCELTGVNPLSQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSI IGK+PRDLLNPKA KYMQAVFS+KDAISKKESREI ALFG+TVTQVRE+FA QR
Sbjct: 61 GALSIKIGKRPRDLLNPKAVKYMQAVFSIKDAISKKESREISALFGVTVTQVREFFAGQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVEEAPSCSNQD 180
SRVRK+ RLSRE+++R+ KELQDGV SD M+PI+ APL+S+GPS+ EE PSCS Q
Sbjct: 121 SRVRKVVRLSREKSVRSDVCKELQDGVLIPSDPMIPIDQAPLNSIGPSSAEEVPSCSTQA 180
Query: 181 EVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILAT 240
E L GL +SER+F++NIF+ M KEETFSGQV+LMEWILQ++N ++L+WFL KGG+MILAT
Sbjct: 181 EALHGLDDSERYFLENIFTLMRKEETFSGQVELMEWILQMQNSSVLNWFLSKGGMMILAT 240
Query: 241 WLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARILL 300
WLSQAA+EEQTSV+ LKV CHLPL+KALP HMSA+L VN+LRFYR DISNRAR+LL
Sbjct: 241 WLSQAANEEQTSVLLVILKVLCHLPLHKALPVHMSAILHSVNRLRFYRTSDISNRARVLL 300
Query: 301 SKWSKMFVKIQASKKPNGIRAAGE--QEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSYV 360
S+WSKM +IQ K N + + + +EI + + IGEI+GD SW+S ++ L
Sbjct: 301 SRWSKMLARIQPIKTSNSAKLSSDAQREIIMKQSIGEIMGDESWKSEINIPGQALAPFCE 360
Query: 361 GPDDIRKVETLEAMKLLTASSDDSNKRLL-GTTASHNKERRKVQLVEQPGQKAVSNS-QS 420
+ +RK+E L+A+KLL +S++D+N++ + G ++S +ERRKVQLVEQPGQK Q
Sbjct: 361 NSETVRKLEPLQALKLLPSSAEDTNRKSIRGVSSSQTRERRKVQLVEQPGQKTAGRILQP 420
Query: 421 IKAVPANQRRPITADEIQKAKMRAQFMQSNKRK------------TEGP--RKPSLLTND 480
+AVP + RP++AD+IQKAKMRAQFMQS K +EGP + S T+
Sbjct: 421 GRAVPVSHGRPMSADDIQKAKMRAQFMQSKYGKIGSSSKDKHEANSEGPSSKSSSSQTST 480
Query: 481 LLSASEAYLRPKLEAQKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWA 540
LLS S+A+ RPK+E KK LPP++ K + +P + E+L +KC++VQIPW A
Sbjct: 481 LLSVSKAHGRPKIEENKKPVTLPPRASNKVE---ASPQPKLELMETLFEKCKKVQIPWQA 540
Query: 541 PPEIDL--AVEIGSGESSKETEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTL 600
PPEI A +G+GESSKE EVQ+NRI REKET Y L+D+PPNPKEPWD+EMDYDD+L
Sbjct: 541 PPEIRFNPAWRVGTGESSKEVEVQKNRIRREKETVYEALQDIPPNPKEPWDLEMDYDDSL 600
Query: 601 TPEIPVEQLPDAD-------------GP----------CSPLASAETHNPSADPSTLSQA 660
TP IP+EQ PDAD GP +P S+ +H +A S +S A
Sbjct: 601 TPVIPIEQPPDADSAAESPIPPEPVVGPGETEKIAVAVVAPEPSSSSHAGNASSSNISSA 660
Query: 661 E-PDLELLAVLLKNPELVFALTSGQASGLSSEDTVKLLDLLKSNGGAAL--LNGASEKQE 720
PD ELL+VLLKNPELVFAL +GQA LSSEDTV+LLD++K+NG +L LNG K E
Sbjct: 661 ALPDFELLSVLLKNPELVFALMNGQAGSLSSEDTVRLLDMIKANGVGSLGTLNGLGRKAE 720
Query: 721 EVKVEVSLPSPTPTPTRNAEVSLPSPTPPSRFEVSLPSPTPSTNPGPNGWTSTEAAKNPF 780
E KVEVSLPSPTP+ S P P PS GW E AKNPF
Sbjct: 721 E-KVEVSLPSPTPS--------------------SNPVPVPS------GW-RPEFAKNPF 780
Query: 781 SRRAMGVQD-TMTVTTAVISSDRSQNSCSLQPQYPSTGPGVITSLPQH---LPCQTHI-- 815
SR+ + V M ++ + S+ TGP LP LP QT
Sbjct: 781 SRQGLTVNSRDMYASSPGVDFTGPARQVSM-ANIDITGPPPQRQLPATNLVLPPQTPAVI 836
BLAST of Spo21974.1 vs. UniProtKB/TrEMBL
Match:
A0A061DY79_THECC (Homeodomain-like superfamily protein, putative OS=Theobroma cacao GN=TCM_004566 PE=4 SV=1)
HSP 1 Score: 767.7 bits (1981), Expect = 1.700e-218
Identity = 451/850 (53.06%), Postives = 583/850 (68.59%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
MD+LK+ +E+EIG++++S Q +D Q++LFH+QIDQL+NIVVTQCKLTGVNPL+QEMAA
Sbjct: 1 MDVLKENLAEVEIGNTVESLQNFIDLQRELFHSQIDQLQNIVVTQCKLTGVNPLAQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSI IGK+PRDLLNPKA KYMQAVFS+KDAISKKESREI ALFG+T+TQVR++FASQR
Sbjct: 61 GALSIKIGKRPRDLLNPKAVKYMQAVFSIKDAISKKESREISALFGVTLTQVRDFFASQR 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVP-AESDLMLPIETAPLSSVGPSNVEEAPSCSNQ 180
+RVRK RLSRE+A+R++ KE ++GV +ESD M+P+E PL+SVGP N EEAPSCS
Sbjct: 121 TRVRKQVRLSREKAVRSNACKETEEGVVLSESDAMIPVEPVPLNSVGPVNAEEAPSCSTL 180
Query: 181 DEVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGGVMILA 240
D+ L G+ E ++HFV+NIF+ M KEETFSGQVKL+EWILQI+NP++L WFL KGGVMILA
Sbjct: 181 DDALTGIDELDKHFVENIFTKMRKEETFSGQVKLLEWILQIQNPSVLYWFLTKGGVMILA 240
Query: 241 TWLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISNRARIL 300
TWLSQAA EEQT+V+ LKV CHLPL KALP MSA+LQ VNKL YR DIS+RAR+L
Sbjct: 241 TWLSQAAVEEQTTVLFIILKVLCHLPLQKALPEQMSAILQSVNKLCLYRFSDISHRARLL 300
Query: 301 LSKWSKMFVKIQASKKPNGIR--AAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDMLVTSY 360
+S+WSKMF + QA+KKPNG++ A + E+ L + I EI+GD WQS+VDNS+++L TS
Sbjct: 301 ISRWSKMFARSQAAKKPNGLKSSADAQNELLLKQSISEIMGDEPWQSNVDNSEEILATS- 360
Query: 361 VGPDDIRKVETLEAMKLLTASSDDSNKR-LLGTTASHNKERRKVQLVEQPGQK-AVSNSQ 420
++RK+E+ + +KLL AS DDS K+ +LG + SH++ERRKVQLVEQPGQK A +SQ
Sbjct: 361 ----NVRKLESPQVLKLLPASMDDSTKKNILGVSGSHSRERRKVQLVEQPGQKMAGKSSQ 420
Query: 421 SIKAVPANQRRPITADEIQKAKMRAQFMQS------------NKRKTEGPRKPSLLTNDL 480
+ + VP +Q RP++AD+IQKAKMRA +MQS N+ K+EG KPS
Sbjct: 421 TTRTVPISQSRPMSADDIQKAKMRALYMQSKYGKTGSSSNGMNEAKSEGLNKPSTSQASF 480
Query: 481 L-SASEAYLRPKLEAQKKARLLPPKSIIKQMDCLSDQKPAFDQKESLMDKCRRVQIPWWA 540
S+ ++RP E QKK +LPPK+ + CL D K D KE +KC++V+IPW
Sbjct: 481 SPPVSKVHVRP-AEEQKKPVILPPKTSNRLGTCL-DPKQNMDSKEPPWEKCQKVKIPWHT 540
Query: 541 PPEIDL--AVEIGSGESSKETEVQRNRIHREKETTYPTLEDVPPNPKEPWDVEMDYDDTL 600
PPE+ L +G+GE+SKE +VQ+NR RE+ET Y T++++P NPKEPWD EMDYDDTL
Sbjct: 541 PPEVKLNELWRVGAGENSKEVDVQKNRNRRERETFYYTIQEIPSNPKEPWDREMDYDDTL 600
Query: 601 TPEIPVEQLPDADGPCSPLASAETHNPSADPSTLSQ-------AEPDLELLAVLLKNPEL 660
TPEIP EQ PD D + + E N +A + S AEPDLELLAVLLKNP L
Sbjct: 601 TPEIPTEQPPDTDSTETQVTHGEHVNSAATLAPSSSHIGGGVAAEPDLELLAVLLKNPAL 660
Query: 661 VFALTSGQASGLSSEDTVKLLDLLKSNGGAALLNGASEKQEEVKVEVSLPSPTPTPTRNA 720
VFALTSGQA L+SE+TVKLLD++K+ GGA N + EE
Sbjct: 661 VFALTSGQAGNLTSEETVKLLDMIKA-GGAGNSNNIGKNVEE------------------ 720
Query: 721 EVSLPSPTPPSRFEVSLPSPTPSTNPGPNGWTSTEAAKNPFSRRAM----GVQDTMTVTT 780
+ EVSLPSPTPS+NPG +GW EA +NPFS+++ Q ++ V T
Sbjct: 721 -----------KVEVSLPSPTPSSNPGTSGW-KPEAVRNPFSQQSQIGNTVAQASLGVGT 780
Query: 781 AVISSDRSQNSCSLQPQYPSTG----PGVITSLPQHLPCQTHIPEYQPAFYDPVVNQNPP 816
++R + PQ + G + ++ Q LP Q + P Q+P
Sbjct: 781 TTPVAERLPATSMAAPQQDANGQLLAQQLAAAIAQLLP--------QSSAMTPEKRQSPN 804
BLAST of Spo21974.1 vs. ExPASy Swiss-Prot
Match:
LUMI_ARATH (Homeobox protein LUMINIDEPENDENS OS=Arabidopsis thaliana GN=LD PE=1 SV=2)
HSP 1 Score: 465.3 bits (1196), Expect = 1.600e-129
Identity = 254/461 (55.10%), Postives = 341/461 (73.97%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
MD K+ EIEIGSS++S + +DSQK LFH+QIDQL+++VV QCKLTGVNPL+QEMAA
Sbjct: 1 MDAFKE---EIEIGSSVESLMELLDSQKVLFHSQIDQLQDVVVAQCKLTGVNPLAQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSI IGK+PRDLLNPKA KY+QAVF++KDAISK+ESREI ALFG+TV QVRE+F +Q+
Sbjct: 61 GALSIKIGKRPRDLLNPKAVKYLQAVFAIKDAISKRESREISALFGITVAQVREFFVTQK 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVE------EAP 180
+RVRK RLSRE+ + ++ + DGVP ++ +E PL+S+ P E
Sbjct: 121 TRVRKQVRLSREKVVMSNTHALQDDGVPENNNATNHVEPVPLNSIHPEACSISWGEGETV 180
Query: 181 SCSNQDEVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGG 240
+ +++ P + +S+++FV+NIFS + KEETFSGQVKLMEWI+QI++ ++L WFL KGG
Sbjct: 181 ALIPPEDIPPDISDSDKYFVENIFSLLRKEETFSGQVKLMEWIMQIQDASVLIWFLSKGG 240
Query: 241 VMILATWLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISN 300
V+IL TWLSQAASEEQTSV+ LKV CHLPL+KA P +MSA+LQ VN LRFYR+ DISN
Sbjct: 241 VLILTTWLSQAASEEQTSVLLLILKVLCHLPLHKASPENMSAILQSVNGLRFYRISDISN 300
Query: 301 RARILLSKWSKMFVKIQASKKP--NGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDM 360
RA+ LLS+W+K+F KIQA KK N + + ++ L + I EI+GD S N +D+
Sbjct: 301 RAKGLLSRWTKLFAKIQAMKKQNRNSSQIDSQSQLLLKQSIAEIMGDSS------NPEDI 360
Query: 361 LVTSYVGPDDIRKVETLEAMKLLTASSDDSNKR-LLGTTASHNKERRKVQLVEQPGQKAV 420
L S +++R++E+ + KLL S+DDS K+ +LG+ S+NKERRKVQ+VEQPGQKA
Sbjct: 361 LSLSNGKSENVRRIESSQGPKLLLTSADDSTKKHMLGSNPSYNKERRKVQMVEQPGQKAA 420
Query: 421 SNS-QSIKAVPANQRRPITADEIQKAKMRAQFMQSNKRKTE 452
S Q+++ + + RP++AD+IQKAKMRA +MQS K +
Sbjct: 421 GKSPQTVRIGTSGRSRPMSADDIQKAKMRALYMQSKNSKKD 452
BLAST of Spo21974.1 vs. TAIR (Arabidopsis)
Match:
AT4G02560.1 (Homeodomain-like superfamily protein)
HSP 1 Score: 465.3 bits (1196), Expect = 9.000e-131
Identity = 254/461 (55.10%), Postives = 341/461 (73.97%), Query Frame = 1
Query: 1 MDLLKDYPSEIEIGSSIDSFQKGMDSQKQLFHTQIDQLRNIVVTQCKLTGVNPLSQEMAA 60
MD K+ EIEIGSS++S + +DSQK LFH+QIDQL+++VV QCKLTGVNPL+QEMAA
Sbjct: 1 MDAFKE---EIEIGSSVESLMELLDSQKVLFHSQIDQLQDVVVAQCKLTGVNPLAQEMAA 60
Query: 61 GALSINIGKKPRDLLNPKAAKYMQAVFSMKDAISKKESREIGALFGLTVTQVREYFASQR 120
GALSI IGK+PRDLLNPKA KY+QAVF++KDAISK+ESREI ALFG+TV QVRE+F +Q+
Sbjct: 61 GALSIKIGKRPRDLLNPKAVKYLQAVFAIKDAISKRESREISALFGITVAQVREFFVTQK 120
Query: 121 SRVRKLTRLSRERAIRASGNKELQDGVPAESDLMLPIETAPLSSVGPSNVE------EAP 180
+RVRK RLSRE+ + ++ + DGVP ++ +E PL+S+ P E
Sbjct: 121 TRVRKQVRLSREKVVMSNTHALQDDGVPENNNATNHVEPVPLNSIHPEACSISWGEGETV 180
Query: 181 SCSNQDEVLPGLGESERHFVDNIFSSMCKEETFSGQVKLMEWILQIENPTILSWFLIKGG 240
+ +++ P + +S+++FV+NIFS + KEETFSGQVKLMEWI+QI++ ++L WFL KGG
Sbjct: 181 ALIPPEDIPPDISDSDKYFVENIFSLLRKEETFSGQVKLMEWIMQIQDASVLIWFLSKGG 240
Query: 241 VMILATWLSQAASEEQTSVISAALKVFCHLPLNKALPAHMSALLQGVNKLRFYRVRDISN 300
V+IL TWLSQAASEEQTSV+ LKV CHLPL+KA P +MSA+LQ VN LRFYR+ DISN
Sbjct: 241 VLILTTWLSQAASEEQTSVLLLILKVLCHLPLHKASPENMSAILQSVNGLRFYRISDISN 300
Query: 301 RARILLSKWSKMFVKIQASKKP--NGIRAAGEQEIDLTRRIGEIVGDFSWQSSVDNSDDM 360
RA+ LLS+W+K+F KIQA KK N + + ++ L + I EI+GD S N +D+
Sbjct: 301 RAKGLLSRWTKLFAKIQAMKKQNRNSSQIDSQSQLLLKQSIAEIMGDSS------NPEDI 360
Query: 361 LVTSYVGPDDIRKVETLEAMKLLTASSDDSNKR-LLGTTASHNKERRKVQLVEQPGQKAV 420
L S +++R++E+ + KLL S+DDS K+ +LG+ S+NKERRKVQ+VEQPGQKA
Sbjct: 361 LSLSNGKSENVRRIESSQGPKLLLTSADDSTKKHMLGSNPSYNKERRKVQMVEQPGQKAA 420
Query: 421 SNS-QSIKAVPANQRRPITADEIQKAKMRAQFMQSNKRKTE 452
S Q+++ + + RP++AD+IQKAKMRA +MQS K +
Sbjct: 421 GKSPQTVRIGTSGRSRPMSADDIQKAKMRALYMQSKNSKKD 452
The following BLAST results are available for this feature: