Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAATCCTATCCATGTGATTCTTAATGTATTATCAAAACCCAATCAGCAAACTTAGTGTAGTTTGATGTTTGCATAGAGTCGGTCTATGGCTGATCCAGTATTACGTAGAATGACAGAACACCATAAACAAAAAATATCTTTGACTTGAGAATACCAGAACACCATAAACAAAAAATATCCATGTGAACCTAAATCAAAATCCTAAAGTAAAAAACCTAAATCAAAATCCTAAATCAAAATCCTAAAATAAAAAACCTAAATCAAAATCCTAAATCAAAATTAAAATTAAAAAACTCTACCAATTTCAGCAACATGAATCAATTTCAATTAGAAACATTTCGCAAGATAAAATGAGCACGGAAAATCAAGAGAAGAGAGAACAACAAACCTCTACTCTTCGCTGTCCTACCCCGTCGGCCACCATTATTCGGTCTACAGAAGGTGAAGGAACGCGCATCAACGTCGGCAAGGAGGTGACGGAGACGGTGGTTGAGGTTGAGACGGCTTGGCTGGTGAGTTTTTCTGGGTGCAATTACTGAGGGAGAGGTTGGGCCTCAATAGCTGGCAAAGGTAAGAGGTTGGCAGAGAAAGGAGTTCCAACGTGCACCAAAATAGGGGTAAAATTGGTAGTTCATGTAAATGTGTTTCAACGCACGTATCGGATAGCCGCTGCGCCATCCTCTACGGGACCTAGCTAGTATCAACAACTTCCCATCCTGTTCTCAAACTCTTTTCCCAAAAGTTTTCATTCCCACGAACCAAGTCATCAACTGTGTTTCTCACTCCCAAATTCCAAGTTTCTAACCCATTGGCACAACGAAATTCAATTAATTGGAACAAATTCGAAACCCTAACCGCCATTTCCGAACCTTGACATGCTACCAATCCTTCCCCATTTACCTTTTTAATGGAGAATAGCTACTCGGTATCACCCAATCTATTTGATCAAGTTCAGATCTTGTTGAAGAAAGAAGCTGGCTTTGATTCTATCAACCTTAATGATCACTCACTTGATTCTCCACCATCGTCATTTTTCGAAGAAGCTGTCGCTGGATCAAAGCCCTCTTCTGATGCTTTGCGATGTAAGAACTGCAGCGGAGAGCTATTGCGAGGGTCGGATTCTATAATTTGTGTATATTGTGGGCATGCGCGGCAGCAAGATTTTTTTCGTCAGCCGATTTCTTTCAAAAATTCCTTCGCGTTTCAGTGGTTTTTGAAGTCTCTTGATTTGGATGGATTCGTAAGTAAACATTCTGTTTTCTTTCCATTAATTTTTGCTAGGCGTTGACTTGTTTTTGTACTACATTCATTGTTTGTAATTTTGTTTTGTGGAAACCGTATTTTGAATATGTAGTTATGATTGAATATGGGGTTGCAGGAAAGTGTTGGGATACCGGTTAGAGATAATCAATTTGGGGATAGAGGGCGGAGTTCGATCAAGGAGGACGAAATCACGTTATCGGATTTCCTTAATCTCCAGCTAAAGTGGTCTGATGATTCAAGAGAATTTGGTGTGGAATCTGAAGAACGGCTTCCTGGGAAAAGTTATTTGAGTTTAATTGGTGTAGATATAGAGGATATTTCAGGTCAGGAAAGTATTGATAATGGTTTTATTGTTCGCAATGAACATGTGGTGGTGAGTCGAAAATTTGGAAATGAAGAAAGTGGAATTTCAGGAAAGAGGAGTCTTAGTTTGTTTGAAAATACGCAGTTTACTGAGGCTGCTTCCAAGTCTGGCGAGGGTGAGAATACCGAGTCCTTTTCTGATTGGGGAGCGGAGTTTCAATCTGCCGTCCTAGAGTCTGGATCTGAGAAATCAGCCAATAGCTTACTTTCTGATGCTCTTTCAGGAAGTATGACTATTTCTAGTGACAATCTTTGGAGTAACTTGGAGTCTGTACAACCTTTCAACATTAGGGTATCTGACGGGAATATCTTGGTTGAAGGTGGTGGAAAGGCCACTGAACCAGTTGATCCCTCTTTTGATTGGTCACCAGCAAATGAACAACCTACCATGTTGGACAATTCCCCTAATATTGAGAAAAGAGATGGTGATGATTCTTTTTGTGACTGGGGGGACTTTAGGATGTCTATAAATGATAGTATTACTCAAAGCGAACCAAGCAGTGCAGCTGTGACTGGTACTGATAGTAGCAGTAGACTCTTTACTTCTGACCAATTTGATAATTTCACTGGTTTTGTGGTTTCAAAAAGCATCCAATTCCCGACCGATTCTTTTAAAGGGCCTGATAATGCCTGGAACAGTTTTACAACATCAAGTACTAGAGGAGATATTCAATCGCAGATCAACAGCGATAAGACAGAGGAGCATGCTGGTATAGGTGATCCAGCAATGGATGGAGAAAATGATCTTGCTCCGTTGGATGTTACTTACTTTGAACACGAAGGTATCAATGTTGAAAGGACCCAGAATAAGATATCTGATCTAGGTGATTCAATCGGGTCATGGAATAATGATGTACACTTAGGTATTGTAGGTGGACTTGATCAGTTGCAAGGCAGAGAAAGCAAGTCACCTGCAAGTAGGACTACCAATAACCTTCATAATTCATTTGACTTGTGGAATGATTTTAAGGGTTCTGACAATCAACACCAAGCAATTACTAAAGAAGAAGCTGATAACAAAATATCAACTGAAGATGTAAACAGATGTGATGAATGGAATGATTTTGCTGGACCTAGTGTTGGGAAAAAGGATCAGCAAGCCAGCAACCTAGAAGTGGTAGATCGGAAGGGAAATGGTAATCAAAGTGATTCATTGGCTTTGTGGAACGATAACTCGCATGATGCATGGAGTGATTTTGCAGGACGTGGTGTTTTTGACAGTAAGCCGATTAGTAATGATATTTCAGCTGTGGTATGGAATGACTTTTCAAGTATAATAGCAAGTGATGCAAATCAAACTCAAAGTAGCGACACAAGAAACTCTGAAAATCAAGCAGCACCCGAGAATGATAACCCATTTAGTTCATGGAGTGATTTCCGGAGCTCGGGAACTCATACAGGAAATGGGTTTTTGGGACCGGATGATAACAACGCTAATTCAGATCAAATCAGCTTCCAGAAAGATCCTAACAGCCTAATAAAGGTTGAGGGCAATAGCACATTCAGTTCATCGGGTGATATTGGGGGCTCAACATCCATTTTAGAACCGCAACTGCAAGCTGCAGTAGCAGGATCTCCTGAAAATATATTGAGCAGTGGAGATGGTTATTTGTTTTCAGGATTTAGTAACAGTGGCAAACAATTTATTAGTCAACCAGACAATCAACTCCGGTTTGATGGAGTTGCAATACATAATGGAGAATCTGCTCCTGAAAGTGATGATCCATTTGGTGCATGGTCTGATTTTACGAGTTCAAGTAATATGCAAGCAAATCTTAGTAGCTCAGCGAATGCACCTGATGACAAAGAAGCTGTTAAAGGTGATCTGTTTGATGCATGGAATGTAATCGTTAGCTCCCCCAATATGGTAACTGGTAAAGACACCAATGTCACAATGGTAAACACACCAGAATTAAATCATTTCAGCCCCAATATGAAGTCACAATATGCGGGGCCTAACAGATCGATTCCCTTGGATCTAAATGTTGGAATGTTCACTCACCAAAATGGTTCTGCACCAGATGGATTTTCACCACCTGAAGGTCCTGATTTGGTTAGGTAAGTGTGCTGCGATTTATCATCTTTATAGCGTCATTCTGTGTTGGTTTCTCTAATGTCTTGTCCCTTTCTTTTAACTTTGAGATTATGATTTATGACCATGCCTTAATTTTTTGAGTTACACTAGCCCATATGTTATATCGAAAGATGTTTGTACTAGTTGTTTCTCTAGTTTCATTTTGTGCATTAGTGTGAATCCCTCTCCCCTCCCCCGCNCCCCCCCCCCCCCCCCCCTCCATGTGGGACCCGCTCCTTGTATTGCGAATATGTCCTCTTTACATAGATCACTGATAGACTATTCTGTTGGTCGTACTTCCATTTATAATTATTACCAAGTATTAGGTGTAGCAATCTTTTGAGAGGCTTTTCTGTTGCTGTCACAATCTGTTTGATCACTGTAACAATGCATCAAAATGTATGAATTGTATAGTGCAAATTTTATGATATACTTCGTATATACTTTTGATTTACCACGTTTAATAGTTTGGGGAAATTAGCTTACAACTAGTATTAAAGCCAATCAAACATGATATCTATATGTCCTGTTCAAGATCTGAAAATTTGACCTGATTTCTATTTTATTATCGGAACTAGAATTACTGGACATCCCAAGATCCATTGGGTTTCCCGTACAGAAATTTTGTCTTGACGATTGACATTGAGGAGCTCTGTTTCCGGACTCTGCATTTTTGCTGTCATTCAGATCCGACAACTTGCTCTCGGACACGGCTAAGAAGTTGACCCATCAATCTTTAACAAAAAGGAATGATTCTTTAGGGCTTTTAGTCGCCATAGCCAGACAACTCCACTGTTGTGTAAATGGCCGGGTAGTGGCCAAATAGACGAGTAAGGCGGAGGAGAGAGAAAACCCTTAACTCCTTTTCTCTCTTTGATTTTTTTTCTGTTTGTGTTTCTCTCTATTCACTGTCTCTCTCTTCACTGTTCCCTCTTCAACTCTTTTCTTTCTTTCTTTTTTTTAATATTTTTAACTTTTGACTTCATCCTCTACTTCTACAATCTGGAAAGTGTTGGTAGTCACGTGGGCCCCACTACCCTTGTTTTTCTTTTCTTTTCTTTTCTTTTTACTTAACTTTTTCTTTTAAAAATTACTCCCTTCGTATTTTTTGAGAAGTTACATGTTCACTGGCACAAAGATTAAAAAAGGTGAGTTGATTTTCTTATAATAATAATGATGCAACTAGAATAGTGGGCAATAAAAAAAATACTTCATCTGTTTGTTTTTACTCGCAACATTAGTCATTTTCACGCATGCCAATGTACAACTTTGATCATTAATATCTTTAAGAAAAAATTATAAAAAGTTTATATTTTGAAAATACACATTGAGACGAATCTAACAAAATCCCATATGACTATGTTTTATCTTACATATAAATCACCAACAATAGGCAAAATAGAATATATGAATAGTGTAAAAAATATAAACGTTTCGAGTAATTAAAAAATGGAGGAAGTATAAGAAAAGAAAGTGGTGTGAATAATTTAATCAAAGAAAAAGGAAGTAAACTCATGGACAATTGTTTATAAGTGAGACCCATGAGAAAGTGAGAAATAACAAACAATGGAGGGGAGTGGGGACTTTGTGTGTAATATTTTAATATTTTAATGTGGCGGAGATGCGCATGTTACATTGGATGTGTGGGCATATAAGAAATGATCGTTTGAGTAATGAGATTATTAGGAAGAAAGTAGGGTTGCACTGATTGATTTTAAGATGATGGAAAATCGTTTAAGATGGTTTGGGCATGTAAGCAGAAGATCAAGTGATGCCTTGGTTGGCAAAGTGATAGAATTGCAAGGGGTAGGGGAAGACCTAAGAAAACCTGGAGGAAGGTGATTGAGCATGATATGAGCCTAATTGGGATTGAGGAAAATATGGCGTTGGATAGGACAGAGTGGTTGAAAATTTATTAGAAACTGTTATATTATTTTAGAAAAATTGCTTGATACTTCTTCCGTATTTTAAAAAGAGATACACTTTGACCGGCATATAATTTTAGAAGAGTGAGTTGAATTTATTAAAATAAGATGAAAGTGGGGTAGTGGGTGAATTATTTATTATAGTAAAATGATGATGTGAGTAAATGTAGGGACCACAAGAAAGAGAGAAATATTAATATAATTTGAAGTGGGGACCAAAACATGTCAAAAAAGGAAAGTGTATTTTTAAAATACGACCGTTTAAGGAAGGTGTATCTCTTTTTAAAATACAGAGGGAGTACCATGAGGGTTGTTTCAGCTAACTCACTTTTCAATTTTACCCATTTCTTAATCGGGGCTGTTTGGTTCTTATAGGAGAAAGAGAAATGAAATAAGAAAGGGAAGAAGGGGATAAAAACAAGAAAGTGAACTTAATTATTTTGTTTGGTTCTAAGGTGGCTAAAATCATTAATGAAATATATTTGTTTGGTTATATTTACAGACAATATAAAGCCCATCCTTCTCCATTTGCTTCTAACAACTCCGTATTTCCACCTCAACTTCCATTTTACTACCTTCATCCTCCTCCCCCATGTATGATTACATGAGGTAAGCATGTGTACATATTTGTGACTAACTTACTAAATGCTTTGCGAAAAAGTTCACTTTTAGAGAGCCATGATATGTCCTTCATGTTGCTTGAGTTTCTTGCTGCCAGTGAATTTGAACAGCGAAAAACTTTAAATTGGAGAAAAACAAAATTCTAAAGAAAGTAGGATAAATGATTGGAACTAACATGCTTTCAAATGTGCTTTTATGTAAGTTAAATCTAGGAGTGTGACTGGGAATGAAATTACTGGCATCATGCTTGTCATTATTGTAATGTTTTGATTCAGTTGCATCTCTTTGCTTTGTGCCTAATGAAAGATATACTTTTTATTTAGAAGTTACTAGTATGATAGCTGCCAAGGGTTCTTAGAAAATGCTCTTCTATTACTAGTATGATAGCGCCAAGGCTGGACAGAAGATAAACCCAGAATTGAGAGAGTATATTTAAACAGCTTGTAGAATATATGCTTCTGCCATTATCCTATTATATGTGTTGTTTTGCAAGTTCATTACTCTATGCCTAAACCTTTTCCAGGGAAAAAAAAAAAAAAACCCTCTGTTTGGATATATATTTGATGTCCGAAAAAAAGGTTACATTTCCTAAAATTAAAATATATTTATGGTAATATTTCCTATTGTTAAGGAGGTTTTCAGTAAACCATGGTTAATCTCTAGTTATTAGAGGTGAGGATAATCCTTGTTTTTATTAGATAAGTTGGAGTGAAGATTTTGACGTTTGTGAGCTGGCTGACTAGTACGGTCTGTATAGGTCTATACTATGAAGTTTGGAGTTTAGAGTTGGATTTGAGTTATGTCTAGTGTCTACAACTCTACATCTATTGGCCTCTGTCTTTACACATGAACCAAAGTGAATGTTTACATTGATATACTGTATAAGTTTATTTGATGGAATTGAATCTGGAAAGAGCTCAGCTAATGAATATCAGTTTTACTGTTTTGTAGGAGCCAACGTGGTGATGTTAATAGTGATTGTTACGTTGATGGAATGAACCTAATTGAAAGTAGCTCGAAGGAAGCATTAAATGTTATGTCTGTTGCTGAGACTTTGATCTCAGAGATGCACGATTTATCCTTCATGCTTGAAAGCAGCCTCTCCATTCCAAACAGTGGTAGATGATAACATCACATATGTGCCTACTTGTACCAGATGCATGATTCGAGCTTTTTATCGTCTCTCTGCATTGTATGACCTTTTGTATGTATTGGCTTGATCTAATTTTGAACGTCGTGGGATATTTCTGAAATTAAACCGCACCTTAATTGGCCTAGGAATGGGTTTCTACTCCATTATGATGCCATCAACGGTACTAGGGCCATAAAATATGTACTGTGATTTATTGGTGGTTACTACATCTCGTTGTGGGAACCAACTTTTCGCCCCACAGAGGCATCGACCCACCCAAAGCAACTGAAACAGAAAGAGGGGAACTTAGCCTCACCCAAAAGGTGGTTGTTCAAGGTCAGTCGAAATTAGCTCCGATGACGACGCGTGGACAGTTGAGTCCAACTAAAGGAGGAAAACCAAATCCTCTTGTAAAATAAAGATGGTGAGTGTGGTGTATTTTATTTGAGGGAGTGGTGAAGTTTAACCCGTGTACATTAGAGGCTTTGGTACACAAATTTGAAGTATTTGCAACATTATATTAGGACTTTGACTTTTTTTTCCTCATTAATGTAATTCTTGGGAATGACCAAAAGCAATTTGTGAATTAACAAACTTTCAATCTTTTAGCTTTTTTCAATTATAATAATCAAATTAAA
mRNA sequence
AAAAAAATCCTATCCATGTGATTCTTAATGTATTATCAAAACCCAATCAGCAAACTTAGTGTAGTTTGATGTTTGCATAGAGTCGGTCTATGGCTGATCCAGTATTACGTAGAATGACAGAACACCATAAACAAAAAATATCTTTGACTTGAGAATACCAGAACACCATAAACAAAAAATATCCATGTGAACCTAAATCAAAATCCTAAAGTAAAAAACCTAAATCAAAATCCTAAATCAAAATCCTAAAATAAAAAACCTAAATCAAAATCCTAAATCAAAATTAAAATTAAAAAACTCTACCAATTTCAGCAACATGAATCAATTTCAATTAGAAACATTTCGCAAGATAAAATGAGCACGGAAAATCAAGAGAAGAGAGAACAACAAACCTCTACTCTTCGCTGTCCTACCCCGTCGGCCACCATTATTCGGTCTACAGAAGGTGAAGGAACGCGCATCAACGTCGGCAAGGAGGTGACGGAGACGGTGGTTGAGGTTGAGACGGCTTGGCTGGTGAGTTTTTCTGGGTGCAATTACTGAGGGAGAGGTTGGGCCTCAATAGCTGGCAAAGGTAAGAGGTTGGCAGAGAAAGGAGTTCCAACGTGCACCAAAATAGGGGTAAAATTGGTAGTTCATGTAAATGTGTTTCAACGCACGTATCGGATAGCCGCTGCGCCATCCTCTACGGGACCTAGCTAGTATCAACAACTTCCCATCCTGTTCTCAAACTCTTTTCCCAAAAGTTTTCATTCCCACGAACCAAGTCATCAACTGTGTTTCTCACTCCCAAATTCCAAGTTTCTAACCCATTGGCACAACGAAATTCAATTAATTGGAACAAATTCGAAACCCTAACCGCCATTTCCGAACCTTGACATGCTACCAATCCTTCCCCATTTACCTTTTTAATGGAGAATAGCTACTCGGTATCACCCAATCTATTTGATCAAGTTCAGATCTTGTTGAAGAAAGAAGCTGGCTTTGATTCTATCAACCTTAATGATCACTCACTTGATTCTCCACCATCGTCATTTTTCGAAGAAGCTGTCGCTGGATCAAAGCCCTCTTCTGATGCTTTGCGATGTAAGAACTGCAGCGGAGAGCTATTGCGAGGGTCGGATTCTATAATTTGTGTATATTGTGGGCATGCGCGGCAGCAAGATTTTTTTCGTCAGCCGATTTCTTTCAAAAATTCCTTCGCGTTTCAGTGGTTTTTGAAGTCTCTTGATTTGGATGGATTCGAAAGTGTTGGGATACCGGTTAGAGATAATCAATTTGGGGATAGAGGGCGGAGTTCGATCAAGGAGGACGAAATCACGTTATCGGATTTCCTTAATCTCCAGCTAAAGTGGTCTGATGATTCAAGAGAATTTGGTGTGGAATCTGAAGAACGGCTTCCTGGGAAAAGTTATTTGAGTTTAATTGGTGTAGATATAGAGGATATTTCAGGTCAGGAAAGTATTGATAATGGTTTTATTGTTCGCAATGAACATGTGGTGGTGAGTCGAAAATTTGGAAATGAAGAAAGTGGAATTTCAGGAAAGAGGAGTCTTAGTTTGTTTGAAAATACGCAGTTTACTGAGGCTGCTTCCAAGTCTGGCGAGGGTGAGAATACCGAGTCCTTTTCTGATTGGGGAGCGGAGTTTCAATCTGCCGTCCTAGAGTCTGGATCTGAGAAATCAGCCAATAGCTTACTTTCTGATGCTCTTTCAGGAAGTATGACTATTTCTAGTGACAATCTTTGGAGTAACTTGGAGTCTGTACAACCTTTCAACATTAGGGTATCTGACGGGAATATCTTGGTTGAAGGTGGTGGAAAGGCCACTGAACCAGTTGATCCCTCTTTTGATTGGTCACCAGCAAATGAACAACCTACCATGTTGGACAATTCCCCTAATATTGAGAAAAGAGATGGTGATGATTCTTTTTGTGACTGGGGGGACTTTAGGATGTCTATAAATGATAGTATTACTCAAAGCGAACCAAGCAGTGCAGCTGTGACTGGTACTGATAGTAGCAGTAGACTCTTTACTTCTGACCAATTTGATAATTTCACTGGTTTTGTGGTTTCAAAAAGCATCCAATTCCCGACCGATTCTTTTAAAGGGCCTGATAATGCCTGGAACAGTTTTACAACATCAAGTACTAGAGGAGATATTCAATCGCAGATCAACAGCGATAAGACAGAGGAGCATGCTGGTATAGGTGATCCAGCAATGGATGGAGAAAATGATCTTGCTCCGTTGGATGTTACTTACTTTGAACACGAAGGTATCAATGTTGAAAGGACCCAGAATAAGATATCTGATCTAGGTGATTCAATCGGGTCATGGAATAATGATGTACACTTAGGTATTGTAGGTGGACTTGATCAGTTGCAAGGCAGAGAAAGCAAGTCACCTGCAAGTAGGACTACCAATAACCTTCATAATTCATTTGACTTGTGGAATGATTTTAAGGGTTCTGACAATCAACACCAAGCAATTACTAAAGAAGAAGCTGATAACAAAATATCAACTGAAGATGTAAACAGATGTGATGAATGGAATGATTTTGCTGGACCTAGTGTTGGGAAAAAGGATCAGCAAGCCAGCAACCTAGAAGTGGTAGATCGGAAGGGAAATGGTAATCAAAGTGATTCATTGGCTTTGTGGAACGATAACTCGCATGATGCATGGAGTGATTTTGCAGGACGTGGTGTTTTTGACAGTAAGCCGATTAGTAATGATATTTCAGCTGTGGTATGGAATGACTTTTCAAGTATAATAGCAAGTGATGCAAATCAAACTCAAAGTAGCGACACAAGAAACTCTGAAAATCAAGCAGCACCCGAGAATGATAACCCATTTAGTTCATGGAGTGATTTCCGGAGCTCGGGAACTCATACAGGAAATGGGTTTTTGGGACCGGATGATAACAACGCTAATTCAGATCAAATCAGCTTCCAGAAAGATCCTAACAGCCTAATAAAGGTTGAGGGCAATAGCACATTCAGTTCATCGGGTGATATTGGGGGCTCAACATCCATTTTAGAACCGCAACTGCAAGCTGCAGTAGCAGGATCTCCTGAAAATATATTGAGCAGTGGAGATGGTTATTTGTTTTCAGGATTTAGTAACAGTGGCAAACAATTTATTAGTCAACCAGACAATCAACTCCGGTTTGATGGAGTTGCAATACATAATGGAGAATCTGCTCCTGAAAGTGATGATCCATTTGGTGCATGGTCTGATTTTACGAGTTCAAGTAATATGCAAGCAAATCTTAGTAGCTCAGCGAATGCACCTGATGACAAAGAAGCTGTTAAAGGTGATCTGTTTGATGCATGGAATGTAATCGTTAGCTCCCCCAATATGGTAACTGGTAAAGACACCAATGTCACAATGGTAAACACACCAGAATTAAATCATTTCAGCCCCAATATGAAGTCACAATATGCGGGGCCTAACAGATCGATTCCCTTGGATCTAAATGTTGGAATGTTCACTCACCAAAATGGTTCTGCACCAGATGGATTTTCACCACCTGAAGGTCCTGATTTGGTTAGGAGCCAACGTGGTGATGTTAATAGTGATTGTTACGTTGATGGAATGAACCTAATTGAAAGTAGCTCGAAGGAAGCATTAAATGTTATGTCTGTTGCTGAGACTTTGATCTCAGAGATGCACGATTTATCCTTCATGCTTGAAAGCAGCCTCTCCATTCCAAACAGTGGTAGATGATAACATCACATATGTGCCTACTTGTACCAGATGCATGATTCGAGCTTTTTATCGTCTCTCTGCATTGTATGACCTTTTGTATGTATTGGCTTGATCTAATTTTGAACGTCGTGGGATATTTCTGAAATTAAACCGCACCTTAATTGGCCTAGGAATGGGTTTCTACTCCATTATGATGCCATCAACGGTACTAGGGCCATAAAATATGTACTGTGATTTATTGGTGGTTACTACATCTCGTTGTGGGAACCAACTTTTCGCCCCACAGAGGCATCGACCCACCCAAAGCAACTGAAACAGAAAGAGGGGAACTTAGCCTCACCCAAAAGGTGGTTGTTCAAGGTCAGTCGAAATTAGCTCCGATGACGACGCGTGGACAGTTGAGTCCAACTAAAGGAGGAAAACCAAATCCTCTTGTAAAATAAAGATGGTGAGTGTGGTGTATTTTATTTGAGGGAGTGGTGAAGTTTAACCCGTGTACATTAGAGGCTTTGGTACACAAATTTGAAGTATTTGCAACATTATATTAGGACTTTGACTTTTTTTTCCTCATTAATGTAATTCTTGGGAATGACCAAAAGCAATTTGTGAATTAACAAACTTTCAATCTTTTAGCTTTTTTCAATTATAATAATCAAATTAAA
Coding sequence (CDS)
ATGGAGAATAGCTACTCGGTATCACCCAATCTATTTGATCAAGTTCAGATCTTGTTGAAGAAAGAAGCTGGCTTTGATTCTATCAACCTTAATGATCACTCACTTGATTCTCCACCATCGTCATTTTTCGAAGAAGCTGTCGCTGGATCAAAGCCCTCTTCTGATGCTTTGCGATGTAAGAACTGCAGCGGAGAGCTATTGCGAGGGTCGGATTCTATAATTTGTGTATATTGTGGGCATGCGCGGCAGCAAGATTTTTTTCGTCAGCCGATTTCTTTCAAAAATTCCTTCGCGTTTCAGTGGTTTTTGAAGTCTCTTGATTTGGATGGATTCGAAAGTGTTGGGATACCGGTTAGAGATAATCAATTTGGGGATAGAGGGCGGAGTTCGATCAAGGAGGACGAAATCACGTTATCGGATTTCCTTAATCTCCAGCTAAAGTGGTCTGATGATTCAAGAGAATTTGGTGTGGAATCTGAAGAACGGCTTCCTGGGAAAAGTTATTTGAGTTTAATTGGTGTAGATATAGAGGATATTTCAGGTCAGGAAAGTATTGATAATGGTTTTATTGTTCGCAATGAACATGTGGTGGTGAGTCGAAAATTTGGAAATGAAGAAAGTGGAATTTCAGGAAAGAGGAGTCTTAGTTTGTTTGAAAATACGCAGTTTACTGAGGCTGCTTCCAAGTCTGGCGAGGGTGAGAATACCGAGTCCTTTTCTGATTGGGGAGCGGAGTTTCAATCTGCCGTCCTAGAGTCTGGATCTGAGAAATCAGCCAATAGCTTACTTTCTGATGCTCTTTCAGGAAGTATGACTATTTCTAGTGACAATCTTTGGAGTAACTTGGAGTCTGTACAACCTTTCAACATTAGGGTATCTGACGGGAATATCTTGGTTGAAGGTGGTGGAAAGGCCACTGAACCAGTTGATCCCTCTTTTGATTGGTCACCAGCAAATGAACAACCTACCATGTTGGACAATTCCCCTAATATTGAGAAAAGAGATGGTGATGATTCTTTTTGTGACTGGGGGGACTTTAGGATGTCTATAAATGATAGTATTACTCAAAGCGAACCAAGCAGTGCAGCTGTGACTGGTACTGATAGTAGCAGTAGACTCTTTACTTCTGACCAATTTGATAATTTCACTGGTTTTGTGGTTTCAAAAAGCATCCAATTCCCGACCGATTCTTTTAAAGGGCCTGATAATGCCTGGAACAGTTTTACAACATCAAGTACTAGAGGAGATATTCAATCGCAGATCAACAGCGATAAGACAGAGGAGCATGCTGGTATAGGTGATCCAGCAATGGATGGAGAAAATGATCTTGCTCCGTTGGATGTTACTTACTTTGAACACGAAGGTATCAATGTTGAAAGGACCCAGAATAAGATATCTGATCTAGGTGATTCAATCGGGTCATGGAATAATGATGTACACTTAGGTATTGTAGGTGGACTTGATCAGTTGCAAGGCAGAGAAAGCAAGTCACCTGCAAGTAGGACTACCAATAACCTTCATAATTCATTTGACTTGTGGAATGATTTTAAGGGTTCTGACAATCAACACCAAGCAATTACTAAAGAAGAAGCTGATAACAAAATATCAACTGAAGATGTAAACAGATGTGATGAATGGAATGATTTTGCTGGACCTAGTGTTGGGAAAAAGGATCAGCAAGCCAGCAACCTAGAAGTGGTAGATCGGAAGGGAAATGGTAATCAAAGTGATTCATTGGCTTTGTGGAACGATAACTCGCATGATGCATGGAGTGATTTTGCAGGACGTGGTGTTTTTGACAGTAAGCCGATTAGTAATGATATTTCAGCTGTGGTATGGAATGACTTTTCAAGTATAATAGCAAGTGATGCAAATCAAACTCAAAGTAGCGACACAAGAAACTCTGAAAATCAAGCAGCACCCGAGAATGATAACCCATTTAGTTCATGGAGTGATTTCCGGAGCTCGGGAACTCATACAGGAAATGGGTTTTTGGGACCGGATGATAACAACGCTAATTCAGATCAAATCAGCTTCCAGAAAGATCCTAACAGCCTAATAAAGGTTGAGGGCAATAGCACATTCAGTTCATCGGGTGATATTGGGGGCTCAACATCCATTTTAGAACCGCAACTGCAAGCTGCAGTAGCAGGATCTCCTGAAAATATATTGAGCAGTGGAGATGGTTATTTGTTTTCAGGATTTAGTAACAGTGGCAAACAATTTATTAGTCAACCAGACAATCAACTCCGGTTTGATGGAGTTGCAATACATAATGGAGAATCTGCTCCTGAAAGTGATGATCCATTTGGTGCATGGTCTGATTTTACGAGTTCAAGTAATATGCAAGCAAATCTTAGTAGCTCAGCGAATGCACCTGATGACAAAGAAGCTGTTAAAGGTGATCTGTTTGATGCATGGAATGTAATCGTTAGCTCCCCCAATATGGTAACTGGTAAAGACACCAATGTCACAATGGTAAACACACCAGAATTAAATCATTTCAGCCCCAATATGAAGTCACAATATGCGGGGCCTAACAGATCGATTCCCTTGGATCTAAATGTTGGAATGTTCACTCACCAAAATGGTTCTGCACCAGATGGATTTTCACCACCTGAAGGTCCTGATTTGGTTAGGAGCCAACGTGGTGATGTTAATAGTGATTGTTACGTTGATGGAATGAACCTAATTGAAAGTAGCTCGAAGGAAGCATTAAATGTTATGTCTGTTGCTGAGACTTTGATCTCAGAGATGCACGATTTATCCTTCATGCTTGAAAGCAGCCTCTCCATTCCAAACAGTGGTAGATGA
Protein sequence
MENSYSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFGDRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDISGQESIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFSDWGAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVEGGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPSSAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQINSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVHLGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDVNRCDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAGRGVFDSKPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHTGNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSPENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSSNMQANLSSSANAPDDKEAVKGDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNMKSQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNLIESSSKEALNVMSVAETLISEMHDLSFMLESSLSIPNSGR
Homology
BLAST of Spo26617.1 vs. NCBI nr
Match:
gi|902192031|gb|KNA11962.1| (hypothetical protein SOVF_130260 [Spinacia oleracea])
HSP 1 Score: 1844.7 bits (4777), Expect = 0.000e+0
Identity = 935/937 (99.79%), Postives = 936/937 (99.89%), Query Frame = 1
Query: 1 MENSYSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCK 60
MENSYSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCK
Sbjct: 1 MENSYSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCK 60
Query: 61 NCSGELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRD 120
NCSGELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRD
Sbjct: 61 NCSGELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRD 120
Query: 121 NQFGDRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDIS 180
NQFGDRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDIS
Sbjct: 121 NQFGDRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDIS 180
Query: 181 GQESIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFS 240
GQESIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFS
Sbjct: 181 GQESIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFS 240
Query: 241 DWGAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE 300
DWGAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE
Sbjct: 241 DWGAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE 300
Query: 301 GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS 360
GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS
Sbjct: 301 GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS 360
Query: 361 SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQ 420
SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQ
Sbjct: 361 SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQ 420
Query: 421 INSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVH 480
INSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVH
Sbjct: 421 INSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVH 480
Query: 481 LGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDV 540
LGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDV
Sbjct: 481 LGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDV 540
Query: 541 NRCDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAGRGVFD 600
+RCDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAGRGVFD
Sbjct: 541 SRCDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAGRGVFD 600
Query: 601 SKPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHT 660
SKPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHT
Sbjct: 601 SKPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHT 660
Query: 661 GNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSP 720
GNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSP
Sbjct: 661 GNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSP 720
Query: 721 ENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS 780
ENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS
Sbjct: 721 ENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS 780
Query: 781 NMQANLSSSANAPDDKEAVKGDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNMK 840
NMQANLSSSANAPDDKEAVKGDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNMK
Sbjct: 781 NMQANLSSSANAPDDKEAVKGDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNMK 840
Query: 841 SQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNLIES 900
SQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNL ES
Sbjct: 841 SQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNLSES 900
Query: 901 SSKEALNVMSVAETLISEMHDLSFMLESSLSIPNSGR 938
SSKEALNVMSVAETLISEMHDLSFMLESSLSIPNSGR
Sbjct: 901 SSKEALNVMSVAETLISEMHDLSFMLESSLSIPNSGR 937
BLAST of Spo26617.1 vs. NCBI nr
Match:
gi|731341395|ref|XP_010681881.1| (PREDICTED: uncharacterized protein LOC104896786 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 934.1 bits (2413), Expect = 1.900e-268
Identity = 537/937 (57.31%), Postives = 662/937 (70.65%), Query Frame = 1
Query: 5 YSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSG 64
+ +S +L V ILLKKEA F+S N +DHS DS SSF EEA+AGS PS D LRCKNC G
Sbjct: 4 HEISADLVYLVLILLKKEANFESFNSDDHSFDSLASSFLEEAIAGSSPSVDVLRCKNCRG 63
Query: 65 ELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFG 124
+LLRGS+SIICVYCG+A+ QDF +PISFK + AFQWFLKSLDLDG ESVG P+ Q G
Sbjct: 64 QLLRGSESIICVYCGNAQLQDFVPEPISFKPTSAFQWFLKSLDLDGNESVGTPLGGTQSG 123
Query: 125 DRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDISGQES 184
DRG+SS+KEDEI LSDFL+LQLKW D+SR FGV++EE LPGK Y SLIG+DIE+I QES
Sbjct: 124 DRGQSSVKEDEILLSDFLDLQLKWLDNSRGFGVKNEE-LPGKRYSSLIGIDIEEILTQES 183
Query: 185 IDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFSDWGA 244
D+ N + S NEE G+R LSLFENTQ +EAA +SGEGEN ESFSDWGA
Sbjct: 184 RDHTSSDSNLQGIASPDNLNEEFETLGQRKLSLFENTQSSEAAPRSGEGENDESFSDWGA 243
Query: 245 EFQSAVLESGSEKSANSLLSDALSGS-MTISS-DNLWSNLESVQPFNIRVSDGNILVEGG 304
EFQSA+ G EK +N +SDA SGS TISS +N+WSNLE PFN +VSDGN++VEG
Sbjct: 244 EFQSAIPGPGPEKVSNISVSDASSGSTSTISSNNNIWSNLEPGPPFNSKVSDGNLVVEGD 303
Query: 305 GKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPSSA 364
GKA E + SFDW P + T D+ N EKRDGDDSF +WG+F +S+++ ITQ EPS+
Sbjct: 304 GKAFESANASFDWLPDHRDTTREDSVEN-EKRDGDDSFDEWGEFNISVSNKITQCEPSNT 363
Query: 365 AVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQIN 424
AV TDSS L TSD ++FTG +VS +IQ T++ + PD+AWN+FTT ST G+IQSQ
Sbjct: 364 AVPDTDSSGGLVTSDPVNDFTGSIVSSNIQLQTNNLRFPDDAWNNFTTPSTGGNIQSQTI 423
Query: 425 SDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVHLG 484
SD +++H G+PA G+NDL LD YF H+ VE +QNK+S+ GD IG W +D+ G
Sbjct: 424 SDNSDQHVVTGNPAFAGQNDLGLLDGIYFSHQDTLVENSQNKVSEEGDPIGLWIDDIQFG 483
Query: 485 IVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDVNR 544
I G QLQG +SKS AS+ +SF LWNDFKGSDNQ QA E AD ISTE +
Sbjct: 484 IKSG--QLQGGDSKSHASKEL----DSFGLWNDFKGSDNQQQATNSEVADTIISTEGGST 543
Query: 545 CDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAG-RGVFDS 604
D WNDFAG SVG+K+++ L+VV K NGN++D +AL NDNS+D WSDF G G D+
Sbjct: 544 FDAWNDFAGHSVGRKEERDIGLKVVGVKENGNRNDLVALCNDNSNDLWSDFKGYTGGSDN 603
Query: 605 KPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHTG 664
K ISNDIS+ WNDFSS++ + NQ Q+S+T S++QA E+DN F+SW +FRSSG T
Sbjct: 604 KAISNDISSGAWNDFSSLMVTHTNQPQTSNTSTSDDQAVLEDDNIFNSWPEFRSSGVQTK 663
Query: 665 NGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSPE 724
+ L +D A+S+Q+SFQK P+SL VE NSTFSS GD GG+ SILE QL AA AGS E
Sbjct: 664 STSLQSNDTKASSNQVSFQKAPSSLKLVEDNSTFSSWGDFGGTASILEQQLHAAGAGSSE 723
Query: 725 NILSS-GDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS 784
N LS+ GDG+L+ G SN KQ SQP +QL+ GV I + +S E+DD F +W+DFTSSS
Sbjct: 724 NKLSNGGDGFLY-GLSNIAKQSSSQPADQLQCHGVTIFDDKSTCENDDLFDSWTDFTSSS 783
Query: 785 NMQANLSSSANAPDDKEAVK-GDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNM 844
Q ++SSS AP DKEAV+ G+LFDAW +I +SPN+ KD V++ TPE N S N+
Sbjct: 784 TAQTSISSSVKAPIDKEAVQHGNLFDAWKLIDNSPNVQADKDIAVSVGQTPETNLLSTNI 843
Query: 845 KSQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNLIE 904
QY GPN S+ DL GM + QNG AP PE PDL R + GDV +D YVD +N
Sbjct: 844 NLQYVGPNSSLQPDLCAGMSSSQNGLAPVALLSPEVPDLARMEHGDVKADGYVDRLNPSG 903
Query: 905 SSSKEALNVMSV-AETLISEMHDLSFMLESSLSIPNS 936
+SKE L + S AETLISEMH+LSFMLES LSIPNS
Sbjct: 904 MNSKETLGLKSADAETLISEMHNLSFMLESGLSIPNS 931
BLAST of Spo26617.1 vs. NCBI nr
Match:
gi|1021495937|ref|XP_016190999.1| (PREDICTED: uncharacterized protein LOC107631918 isoform X3 [Arachis ipaensis])
HSP 1 Score: 172.2 bits (435), Expect = 4.400e-39
Identity = 224/803 (27.90%), Postives = 339/803 (42.22%), Query Frame = 1
Query: 14 QVQILLKKEAG---FDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGELLRGS 73
Q+QILL+K+A +D+ + SL PS E VA PS LRCKNC G LLRG
Sbjct: 12 QLQILLRKDANLSWYDTDKNDQLSLPKLPS--VAETVANLDPSPPYLRCKNCDGRLLRGV 71
Query: 74 DSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFGDRGRSS 133
S ICV+CG +D +PI FK++ ++W L SL LDG E V P+ + + R S
Sbjct: 72 QSSICVFCGANPHKDLPPEPIKFKDTLGYKWLLDSLQLDGSEMVA-PMEEEENSSSRRRS 131
Query: 134 IKEDEITLSDFLNLQLKW-SDDSREFGVESE-ERLPGKSYLSLIGVDIEDISGQESIDNG 193
+DEI LS+ L+L+++W S+ R S+ E GKS LSL GVD++ D+
Sbjct: 132 ESKDEIPLSELLDLEIRWPSEAERTLSSNSDSEAFQGKSSLSLAGVDLDGFFDHRESDSN 191
Query: 194 FIVRNEHVVVSRKFGNEES--GISGKRSLSLFENTQFTEAAS--KSGEGENTESFSDWGA 253
+ + + G+ S I +LSLF+N Q +E A+ +S E ++ +SFS W A
Sbjct: 192 --ASGQTMAFGDQVGDTASYSAIQASENLSLFQNVQASELATPTRSMEDQSDDSFSGWAA 251
Query: 254 EFQSA----VLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE 313
F+SA V E +S + D +SGS S FN VS GN +
Sbjct: 252 NFRSASSGPVNEEPESSFGHSKVQDTVSGS-------------SKDDFNHSVSKGNDWFQ 311
Query: 314 GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS 373
G W +N + P + + ND+ T +
Sbjct: 312 VDG-----------WRTSNLEVPNQSGKPELN---------------VDFNDTKTAESAT 371
Query: 374 SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSS--TRGDIQ 433
S++ D + T VV++++++ SF G WN FT S+ + D
Sbjct: 372 SSSTRNLDWMQDDQWQRSDNKTTAAVVTEAVEY---SFDG----WNDFTGSARDSAQDPS 431
Query: 434 SQINSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNND 493
S I+ E G + D N A E + I D SW D
Sbjct: 432 SIISRSNITEQVGKSEITADLNNTKA--------------EGNSSSIEDF-----SWMQD 491
Query: 494 VHLGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTE 553
D G +K+ +++TNN+ +SFD WNDF GS N Q ++ +++KI+ E
Sbjct: 492 ---------DVWPGSNNKTTDTKSTNNVEDSFDDWNDFSGSANA-QYLSSNLSNSKITGE 551
Query: 554 ----DVNRCDEWNDFAGPSVGKK---DQQASNLEVVDRKGNGNQSDSLALWND--NSHDA 613
++ + ++ AG G D + ++V + NQ+ + N+ +S D
Sbjct: 552 SGKSELAKNNDDKIIAGGDSGSSRNFDWMQDDQQLV----SNNQASDVVTTNEDADSFDN 611
Query: 614 WSDFAGRGVFDSKPISNDISAVVWNDFSSIIASDAN-QTQSSDTRNSENQAAPENDNPFS 673
W+DF G S V N SS++ S+ N S+T +Q E N S
Sbjct: 612 WNDFTG-------------SPVTQNPSSSVLYSETNILAGKSETSVDAHQTKTEVGNS-S 671
Query: 674 SWSDFRSSGTHTGNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSIL 733
S DF +G+ P NN +D +D +S + + F+ + +S L
Sbjct: 672 SIEDF---NWMQDDGW--PSGNNKTTDAKGTNEDADSF---DDWNDFAGLANAQHLSSNL 707
Query: 734 EPQLQAAVAGSPENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDD 792
+G E ++ D + G S S + F NQ + + E D
Sbjct: 732 SNSKITGESGKSELAKNNDDKRIAGGDSGSPRNFDWMQANQWQGSNDQASGIATTNEGAD 707
BLAST of Spo26617.1 vs. NCBI nr
Match:
gi|1021495929|ref|XP_016190993.1| (PREDICTED: uncharacterized protein LOC107631918 isoform X1 [Arachis ipaensis])
HSP 1 Score: 172.2 bits (435), Expect = 4.400e-39
Identity = 224/803 (27.90%), Postives = 339/803 (42.22%), Query Frame = 1
Query: 14 QVQILLKKEAG---FDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGELLRGS 73
Q+QILL+K+A +D+ + SL PS E VA PS LRCKNC G LLRG
Sbjct: 12 QLQILLRKDANLSWYDTDKNDQLSLPKLPS--VAETVANLDPSPPYLRCKNCDGRLLRGV 71
Query: 74 DSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFGDRGRSS 133
S ICV+CG +D +PI FK++ ++W L SL LDG E V P+ + + R S
Sbjct: 72 QSSICVFCGANPHKDLPPEPIKFKDTLGYKWLLDSLQLDGSEMVA-PMEEEENSSSRRRS 131
Query: 134 IKEDEITLSDFLNLQLKW-SDDSREFGVESE-ERLPGKSYLSLIGVDIEDISGQESIDNG 193
+DEI LS+ L+L+++W S+ R S+ E GKS LSL GVD++ D+
Sbjct: 132 ESKDEIPLSELLDLEIRWPSEAERTLSSNSDSEAFQGKSSLSLAGVDLDGFFDHRESDSN 191
Query: 194 FIVRNEHVVVSRKFGNEES--GISGKRSLSLFENTQFTEAAS--KSGEGENTESFSDWGA 253
+ + + G+ S I +LSLF+N Q +E A+ +S E ++ +SFS W A
Sbjct: 192 --ASGQTMAFGDQVGDTASYSAIQASENLSLFQNVQASELATPTRSMEDQSDDSFSGWAA 251
Query: 254 EFQSA----VLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE 313
F+SA V E +S + D +SGS S FN VS GN +
Sbjct: 252 NFRSASSGPVNEEPESSFGHSKVQDTVSGS-------------SKDDFNHSVSKGNDWFQ 311
Query: 314 GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS 373
G W +N + P + + ND+ T +
Sbjct: 312 VDG-----------WRTSNLEVPNQSGKPELN---------------VDFNDTKTAESAT 371
Query: 374 SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSS--TRGDIQ 433
S++ D + T VV++++++ SF G WN FT S+ + D
Sbjct: 372 SSSTRNLDWMQDDQWQRSDNKTTAAVVTEAVEY---SFDG----WNDFTGSARDSAQDPS 431
Query: 434 SQINSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNND 493
S I+ E G + D N A E + I D SW D
Sbjct: 432 SIISRSNITEQVGKSEITADLNNTKA--------------EGNSSSIEDF-----SWMQD 491
Query: 494 VHLGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTE 553
D G +K+ +++TNN+ +SFD WNDF GS N Q ++ +++KI+ E
Sbjct: 492 ---------DVWPGSNNKTTDTKSTNNVEDSFDDWNDFSGSANA-QYLSSNLSNSKITGE 551
Query: 554 ----DVNRCDEWNDFAGPSVGKK---DQQASNLEVVDRKGNGNQSDSLALWND--NSHDA 613
++ + ++ AG G D + ++V + NQ+ + N+ +S D
Sbjct: 552 SGKSELAKNNDDKIIAGGDSGSSRNFDWMQDDQQLV----SNNQASDVVTTNEDADSFDN 611
Query: 614 WSDFAGRGVFDSKPISNDISAVVWNDFSSIIASDAN-QTQSSDTRNSENQAAPENDNPFS 673
W+DF G S V N SS++ S+ N S+T +Q E N S
Sbjct: 612 WNDFTG-------------SPVTQNPSSSVLYSETNILAGKSETSVDAHQTKTEVGNS-S 671
Query: 674 SWSDFRSSGTHTGNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSIL 733
S DF +G+ P NN +D +D +S + + F+ + +S L
Sbjct: 672 SIEDF---NWMQDDGW--PSGNNKTTDAKGTNEDADSF---DDWNDFAGLANAQHLSSNL 707
Query: 734 EPQLQAAVAGSPENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDD 792
+G E ++ D + G S S + F NQ + + E D
Sbjct: 732 SNSKITGESGKSELAKNNDDKRIAGGDSGSPRNFDWMQANQWQGSNDQASGIATTNEGAD 707
BLAST of Spo26617.1 vs. NCBI nr
Match:
gi|1012099552|ref|XP_015956991.1| (PREDICTED: uncharacterized protein LOC107481263 isoform X1 [Arachis duranensis])
HSP 1 Score: 162.2 bits (409), Expect = 4.500e-36
Identity = 216/791 (27.31%), Postives = 330/791 (41.72%), Query Frame = 1
Query: 14 QVQILLKKEAG---FDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGELLRGS 73
Q+QILL+K+A +D+ + SL PS E VA PS LRCKNC G LLRG
Sbjct: 12 QLQILLRKDANLSWYDTEKNDQLSLPKLPS--VAETVANLDPSPPYLRCKNCDGRLLRGV 71
Query: 74 DSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFGDRGRSS 133
S ICV+CG +D +PI FK++ ++W L SL LDG E V P+ + + R S
Sbjct: 72 QSSICVFCGANPHKDLPPEPIKFKDTLGYKWLLDSLQLDGSEMVA-PMEEEENSSSRRRS 131
Query: 134 IKEDEITLSDFLNLQLKW-SDDSREFGVESE-ERLPGKSYLSLIGVDIEDISGQESIDNG 193
+DEI LS+ L+L+++W S+ R S+ E GKS LSL GVD++ D+
Sbjct: 132 ESKDEIPLSELLDLEIRWPSEAERTLSSNSDSEAFQGKSSLSLAGVDLDGFFDHRESDSN 191
Query: 194 FIVRNEHVVVSRKFGNEES--GISGKRSLSLFENTQFTEAAS--KSGEGENTESFSDWGA 253
+ + + G+ S I +LSLF+N Q +E A+ +S E ++ +SFS W A
Sbjct: 192 --ASGQTMAFGDQVGDTASYSAIQAGENLSLFQNVQASELATPTRSMEDQSDDSFSGWAA 251
Query: 254 EFQSA----VLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE 313
F+SA V E +S + D +SGS S FN VS GN +
Sbjct: 252 NFRSASSGPVNEEPESSFGHSKVQDTVSGS-------------SKDDFNHSVSKGNDWFQ 311
Query: 314 GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS 373
G W +N + P + + ND+ T +
Sbjct: 312 VDG-----------WRTSNLEVPNQSGKPELN---------------VDFNDTKTAESAT 371
Query: 374 SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRG--DIQ 433
S++ D + T VV++++++ SF G WN FT S+ D
Sbjct: 372 SSSTRNLDWMQDDQWQRNDNKTTAAVVTEAVEY---SFDG----WNDFTGSARASAQDPS 431
Query: 434 SQINSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNND 493
S I+ E G + D N A E + I D SW D
Sbjct: 432 SIISRSNITEQVGKSEITADNNNTKA--------------EGNSSSIEDF-----SWMQD 491
Query: 494 VHLGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTE 553
GI G +K+ +++TNN+ +SFD WNDF GS + Q + +++KI+ E
Sbjct: 492 ---GI------WPGSNNKTTDTKSTNNVEDSFDDWNDFSGSASA-QYLPSNLSNSKITGE 551
Query: 554 ----DVNRCDEWNDFAGPSVGKK---DQQASNLEVVDRKGNGNQSDSLALWND--NSHDA 613
++ + ++ AG G D + ++V + NQ+ + N+ +S D
Sbjct: 552 SGKSELAKNNDDKRIAGGDSGSSRNFDWMQDDQQLV----SNNQASDVVTTNEDADSFDN 611
Query: 614 WSDFAGRGVFDSKPISNDISAVVWNDFSSIIASDAN-QTQSSDTRNSENQAAPENDNPFS 673
W+D G S V N +S++ S+ N S+T +Q E N S
Sbjct: 612 WNDLTG-------------SPVTQNPSNSVLYSETNILAGKSETSVDAHQTKTEVGNS-S 671
Query: 674 SWSDFRSSGTHTGNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSIL 733
S DF +G+ P NN +D +D +S + + F+ + +S L
Sbjct: 672 SIEDF---NWMQDDGW--PSGNNKTTDAKGTNEDADSF---DDWNDFAGLANAQHLSSNL 696
Query: 734 EPQLQAAVAGSPENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDD 780
+G E ++ D + S S + F NQ + + E D
Sbjct: 732 SNSRITGESGKSELAKNNDDKRIAGDDSGSPRNFDWMQANQWQGSNDQASGIATTNEGAD 696
BLAST of Spo26617.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9QXG9_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_130260 PE=4 SV=1)
HSP 1 Score: 1844.7 bits (4777), Expect = 0.000e+0
Identity = 935/937 (99.79%), Postives = 936/937 (99.89%), Query Frame = 1
Query: 1 MENSYSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCK 60
MENSYSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCK
Sbjct: 1 MENSYSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCK 60
Query: 61 NCSGELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRD 120
NCSGELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRD
Sbjct: 61 NCSGELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRD 120
Query: 121 NQFGDRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDIS 180
NQFGDRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDIS
Sbjct: 121 NQFGDRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDIS 180
Query: 181 GQESIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFS 240
GQESIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFS
Sbjct: 181 GQESIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFS 240
Query: 241 DWGAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE 300
DWGAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE
Sbjct: 241 DWGAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVE 300
Query: 301 GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS 360
GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS
Sbjct: 301 GGGKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPS 360
Query: 361 SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQ 420
SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQ
Sbjct: 361 SAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQ 420
Query: 421 INSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVH 480
INSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVH
Sbjct: 421 INSDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVH 480
Query: 481 LGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDV 540
LGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDV
Sbjct: 481 LGIVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDV 540
Query: 541 NRCDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAGRGVFD 600
+RCDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAGRGVFD
Sbjct: 541 SRCDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAGRGVFD 600
Query: 601 SKPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHT 660
SKPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHT
Sbjct: 601 SKPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHT 660
Query: 661 GNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSP 720
GNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSP
Sbjct: 661 GNGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSP 720
Query: 721 ENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS 780
ENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS
Sbjct: 721 ENILSSGDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS 780
Query: 781 NMQANLSSSANAPDDKEAVKGDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNMK 840
NMQANLSSSANAPDDKEAVKGDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNMK
Sbjct: 781 NMQANLSSSANAPDDKEAVKGDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNMK 840
Query: 841 SQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNLIES 900
SQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNL ES
Sbjct: 841 SQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNLSES 900
Query: 901 SSKEALNVMSVAETLISEMHDLSFMLESSLSIPNSGR 938
SSKEALNVMSVAETLISEMHDLSFMLESSLSIPNSGR
Sbjct: 901 SSKEALNVMSVAETLISEMHDLSFMLESSLSIPNSGR 937
BLAST of Spo26617.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8C2R3_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_6g144000 PE=4 SV=1)
HSP 1 Score: 934.1 bits (2413), Expect = 1.300e-268
Identity = 537/937 (57.31%), Postives = 662/937 (70.65%), Query Frame = 1
Query: 5 YSVSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSG 64
+ +S +L V ILLKKEA F+S N +DHS DS SSF EEA+AGS PS D LRCKNC G
Sbjct: 4 HEISADLVYLVLILLKKEANFESFNSDDHSFDSLASSFLEEAIAGSSPSVDVLRCKNCRG 63
Query: 65 ELLRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFG 124
+LLRGS+SIICVYCG+A+ QDF +PISFK + AFQWFLKSLDLDG ESVG P+ Q G
Sbjct: 64 QLLRGSESIICVYCGNAQLQDFVPEPISFKPTSAFQWFLKSLDLDGNESVGTPLGGTQSG 123
Query: 125 DRGRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDISGQES 184
DRG+SS+KEDEI LSDFL+LQLKW D+SR FGV++EE LPGK Y SLIG+DIE+I QES
Sbjct: 124 DRGQSSVKEDEILLSDFLDLQLKWLDNSRGFGVKNEE-LPGKRYSSLIGIDIEEILTQES 183
Query: 185 IDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFSDWGA 244
D+ N + S NEE G+R LSLFENTQ +EAA +SGEGEN ESFSDWGA
Sbjct: 184 RDHTSSDSNLQGIASPDNLNEEFETLGQRKLSLFENTQSSEAAPRSGEGENDESFSDWGA 243
Query: 245 EFQSAVLESGSEKSANSLLSDALSGS-MTISS-DNLWSNLESVQPFNIRVSDGNILVEGG 304
EFQSA+ G EK +N +SDA SGS TISS +N+WSNLE PFN +VSDGN++VEG
Sbjct: 244 EFQSAIPGPGPEKVSNISVSDASSGSTSTISSNNNIWSNLEPGPPFNSKVSDGNLVVEGD 303
Query: 305 GKATEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPSSA 364
GKA E + SFDW P + T D+ N EKRDGDDSF +WG+F +S+++ ITQ EPS+
Sbjct: 304 GKAFESANASFDWLPDHRDTTREDSVEN-EKRDGDDSFDEWGEFNISVSNKITQCEPSNT 363
Query: 365 AVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGDIQSQIN 424
AV TDSS L TSD ++FTG +VS +IQ T++ + PD+AWN+FTT ST G+IQSQ
Sbjct: 364 AVPDTDSSGGLVTSDPVNDFTGSIVSSNIQLQTNNLRFPDDAWNNFTTPSTGGNIQSQTI 423
Query: 425 SDKTEEHAGIGDPAMDGENDLAPLDVTYFEHEGINVERTQNKISDLGDSIGSWNNDVHLG 484
SD +++H G+PA G+NDL LD YF H+ VE +QNK+S+ GD IG W +D+ G
Sbjct: 424 SDNSDQHVVTGNPAFAGQNDLGLLDGIYFSHQDTLVENSQNKVSEEGDPIGLWIDDIQFG 483
Query: 485 IVGGLDQLQGRESKSPASRTTNNLHNSFDLWNDFKGSDNQHQAITKEEADNKISTEDVNR 544
I G QLQG +SKS AS+ +SF LWNDFKGSDNQ QA E AD ISTE +
Sbjct: 484 IKSG--QLQGGDSKSHASKEL----DSFGLWNDFKGSDNQQQATNSEVADTIISTEGGST 543
Query: 545 CDEWNDFAGPSVGKKDQQASNLEVVDRKGNGNQSDSLALWNDNSHDAWSDFAG-RGVFDS 604
D WNDFAG SVG+K+++ L+VV K NGN++D +AL NDNS+D WSDF G G D+
Sbjct: 544 FDAWNDFAGHSVGRKEERDIGLKVVGVKENGNRNDLVALCNDNSNDLWSDFKGYTGGSDN 603
Query: 605 KPISNDISAVVWNDFSSIIASDANQTQSSDTRNSENQAAPENDNPFSSWSDFRSSGTHTG 664
K ISNDIS+ WNDFSS++ + NQ Q+S+T S++QA E+DN F+SW +FRSSG T
Sbjct: 604 KAISNDISSGAWNDFSSLMVTHTNQPQTSNTSTSDDQAVLEDDNIFNSWPEFRSSGVQTK 663
Query: 665 NGFLGPDDNNANSDQISFQKDPNSLIKVEGNSTFSSSGDIGGSTSILEPQLQAAVAGSPE 724
+ L +D A+S+Q+SFQK P+SL VE NSTFSS GD GG+ SILE QL AA AGS E
Sbjct: 664 STSLQSNDTKASSNQVSFQKAPSSLKLVEDNSTFSSWGDFGGTASILEQQLHAAGAGSSE 723
Query: 725 NILSS-GDGYLFSGFSNSGKQFISQPDNQLRFDGVAIHNGESAPESDDPFGAWSDFTSSS 784
N LS+ GDG+L+ G SN KQ SQP +QL+ GV I + +S E+DD F +W+DFTSSS
Sbjct: 724 NKLSNGGDGFLY-GLSNIAKQSSSQPADQLQCHGVTIFDDKSTCENDDLFDSWTDFTSSS 783
Query: 785 NMQANLSSSANAPDDKEAVK-GDLFDAWNVIVSSPNMVTGKDTNVTMVNTPELNHFSPNM 844
Q ++SSS AP DKEAV+ G+LFDAW +I +SPN+ KD V++ TPE N S N+
Sbjct: 784 TAQTSISSSVKAPIDKEAVQHGNLFDAWKLIDNSPNVQADKDIAVSVGQTPETNLLSTNI 843
Query: 845 KSQYAGPNRSIPLDLNVGMFTHQNGSAPDGFSPPEGPDLVRSQRGDVNSDCYVDGMNLIE 904
QY GPN S+ DL GM + QNG AP PE PDL R + GDV +D YVD +N
Sbjct: 844 NLQYVGPNSSLQPDLCAGMSSSQNGLAPVALLSPEVPDLARMEHGDVKADGYVDRLNPSG 903
Query: 905 SSSKEALNVMSV-AETLISEMHDLSFMLESSLSIPNS 936
+SKE L + S AETLISEMH+LSFMLES LSIPNS
Sbjct: 904 MNSKETLGLKSADAETLISEMHNLSFMLESGLSIPNS 931
BLAST of Spo26617.1 vs. UniProtKB/TrEMBL
Match:
D7TEB2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0059g01030 PE=4 SV=1)
HSP 1 Score: 161.0 bits (406), Expect = 7.000e-36
Identity = 134/410 (32.68%), Postives = 205/410 (50.00%), Query Frame = 1
Query: 10 NLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGELLRG 69
+L QVQILL+K+A S + +D SL + PS+ ++ +A PS LRC NC G L RG
Sbjct: 10 DLVRQVQILLRKDANLSSYDPHDPSLPNLPST--DQVIAEFDPSPPHLRCANCRGRLPRG 69
Query: 70 SDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFGDRGRS 129
SIICVYCG ++ + +PI FKN+ F+W L++LDLDG E++G P + RGRS
Sbjct: 70 LQSIICVYCGLEQKGEVAPEPILFKNTAGFRWLLETLDLDGSEALG-PSTVVKEASRGRS 129
Query: 130 SIKEDEITLSDFLNLQLKWSDDSREF--GVESEERLPGKSYLSLIGVDIEDISGQESIDN 189
++K DE +LSD L+L++KW+ +S + GV +E + KS L+L GVD+++ + D
Sbjct: 130 ALK-DEGSLSDLLDLEIKWTSESEKLGAGVSNEASVRSKSPLNLAGVDLDNFLSEARRDT 189
Query: 190 GFIVRNEHVVVSRKFGNEES-GISGKRSLSLFENTQFTEAASKSGEGENTESFSDWGAEF 249
E +++ ++ES + G +LSLFEN +E + E +N+ +FS W AEF
Sbjct: 190 VIKASEEQFAATKEIRSKESNALQGHENLSLFENVHPSETVVRPAEDKNSAAFSGWEAEF 249
Query: 250 QSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVEGGGKAT 309
Q+A ES E S D GS +L S++++ V G GK
Sbjct: 250 QNANSESVHEGSKE---FDPFVGSTV----DLSSHMDA--------------VFGSGKDI 309
Query: 310 EPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSEPSSAAVTG 369
S D +PA+ + + ++ K G DS Q+E +
Sbjct: 310 NSAHVSDDTTPASRTNDWIQD--DLYKNLNSKVPAHVGQV-----DSTIQAEDAQNLAGP 369
Query: 370 TDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSSTRGD 417
+ + + F DQ+ N + I + +AWN F +SST D
Sbjct: 370 SSTRNDWFQDDQWKNSSAKSTDNKIALGKND--NLFDAWNDFPSSSTSQD 385
BLAST of Spo26617.1 vs. UniProtKB/TrEMBL
Match:
M5X5N7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003889mg PE=4 SV=1)
HSP 1 Score: 150.6 bits (379), Expect = 9.500e-33
Identity = 125/419 (29.83%), Postives = 205/419 (48.93%), Query Frame = 1
Query: 7 VSPNLFDQVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGEL 66
+ ++ +VQI L++EA S + +D L + PS EE +A PS LRCK+C G L
Sbjct: 5 IPTDMIKEVQISLRREAKMSSYDPDDTPLPNLPS--VEETIADLDPSPPYLRCKHCKGRL 64
Query: 67 LRGSDSIICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFGDR 126
LRG S+IC +CG +D PI+F+++F ++W LKSL LDG E V +P ++F +R
Sbjct: 65 LRGVQSLICAFCGREHCKDLPPDPINFRDTFGYRWLLKSLSLDGSEIVELPTGADEF-NR 124
Query: 127 GRSSIKEDEITLSDFLNLQLKWSDDSREFGVESEERLP--GKSYLSLIGVDIEDISGQES 186
G+++ ++D+++LSD LNL++KW+ + + P KS L GV++++ +
Sbjct: 125 GQTA-RKDDLSLSDLLNLEIKWTSKPEKVETDFSNETPTQPKSLPDLAGVNLDNFFSEGK 184
Query: 187 IDNGFIVRNEHVVVS--RKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFSDW 246
D + E + S + G E + + +LSLFEN Q E +S EGE+ +SFS W
Sbjct: 185 KDAAVNISEEQLFESSTQTTGEEINAFEVRETLSLFENVQPFETVVESTEGESGDSFSGW 244
Query: 247 GAEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNILVEG- 306
A FQSA E + A+ L A S ++ +S+N+ + + PF D + ++
Sbjct: 245 AANFQSAASE--TLPHASETLPHA-SENLHQASENIPQESKVIDPFVGSTVDLSAHIDTV 304
Query: 307 GGKATEPVDPSFDWSPANEQPTMLD-------NSPNIEKRDGDDSFCDWGDFRMSINDSI 366
G A D + S P D N G + F + + I +++
Sbjct: 305 FGSAVHSTDEKSNHSMTGSAPLTTDWFRGDLLGVSNSGFAGGPEQFETLAEVK-GITENV 364
Query: 367 TQSEPSSAAVTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPTDSFKGPDNAWNSFTTSST 414
S P+ D+ + +++ DN T TD + +AWN F TS++
Sbjct: 365 NNSFPADVDRV-QDNQLQTTSNNAPDNKT-----------TDEDEDSFDAWNDFATSNS 403
BLAST of Spo26617.1 vs. UniProtKB/TrEMBL
Match:
B9STL3_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1016710 PE=4 SV=1)
HSP 1 Score: 150.6 bits (379), Expect = 9.500e-33
Identity = 121/398 (30.40%), Postives = 198/398 (49.75%), Query Frame = 1
Query: 14 QVQILLKKEAGFDSINLNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGELLRGSDSI 73
++QI L+KEAG S + D SL + PS ++A++ +PS LRCK+C+G LLRG +S+
Sbjct: 12 ELQISLRKEAGLASYDPEDPSLPNLPS--LQDAISELEPSPSYLRCKSCNGRLLRGVNSV 71
Query: 74 ICVYCGHARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFGDRGRSSIKE 133
ICV+CG + +D PI F ++F +WF SLDLDG E V P + +RG+++ E
Sbjct: 72 ICVFCGRQQNKDVPPDPIKFTSTFGCRWFFHSLDLDGSELV-TPSAEANESNRGQNT-PE 131
Query: 134 DEITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDISGQESIDN-GFIVR 193
I LSD LNL+++W + EF + E+ P L+ G+DI++ + +D+
Sbjct: 132 IHIPLSDLLNLEIRWPSEPEEFETSALEKKP-IQMLNFSGIDIDNYFTESKLDSVSTSAE 191
Query: 194 NEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFSDWGAEFQSAVLE 253
+ + + E + G +LSLFE+ + +E A++S + E+ +SFS W A+FQS+ +
Sbjct: 192 GQFTLKQHEDAAENNAFQGHENLSLFESVEPSETAARSKKDESGDSFSGWEADFQSSGAK 251
Query: 254 SGSEKSANSLLSDALSGSMTIS----SDNLW---SNLESVQPFNIRVSDGNILVEGGGKA 313
+ +KS D GS ++ D L+ SNL + + S N+
Sbjct: 252 TQHQKSN---FPDPFVGSSSVDLSSHMDALFGPGSNLSNEKTKENVTSASNMNDWFERDT 311
Query: 314 TEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSF--CDWGDFRMSINDSITQSEPSSAA 373
+ + + + + DN G+ S DW + D+ Q+ SS
Sbjct: 312 SSNANAGVAFQNDQFEVPVSDNRDGTVGNTGNSSSMNVDW------VQDNQWQTSSSSRK 371
Query: 374 VTGTDSSSRLFTSDQFDNFTGFVVSKSIQFPT-DSFKG 401
T D + D FD + F S ++Q P+ +S KG
Sbjct: 372 ATDNDEN-----DDSFDTWNDFTSSSNVQVPSNNSLKG 390
BLAST of Spo26617.1 vs. TAIR (Arabidopsis)
Match:
AT4G20720.1 (dentin sialophosphoprotein-related)
HSP 1 Score: 123.6 bits (309), Expect = 6.300e-28
Identity = 147/569 (25.83%), Postives = 260/569 (45.69%), Query Frame = 1
Query: 7 VSPNLFDQVQILLKKEAGFDSIN-LNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGE 66
+S +L +Q+++ L+KEA S++ +D S S P+S EEA+A S+ LRC+NC G+
Sbjct: 5 ISVDLINQLKVSLRKEAKLTSVDDCSDSSFPSLPTS--EEAIAELDASAPYLRCRNCKGK 64
Query: 67 LLRGSDSIICVYCGH-ARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFG 126
LLRG +S+ICV+CG+ R D PI F ++ A++WFL SL+LDG E V P+++
Sbjct: 65 LLRGIESLICVFCGNQQRTSDNPPDPIKFTSTSAYKWFLTSLNLDGSEMVE-PLKETDGS 124
Query: 127 DRGRSSIKEDE-ITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDISGQE 186
RG + + I LS FL+L+++WS + E + + + K+ L+L G++++D +
Sbjct: 125 SRGATKAPPSKGIALSKFLDLEIQWS--ALEEKSDDGQSVQKKNPLNLGGINLDDYFVER 184
Query: 187 SIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFSDWG 246
D + + E V E+ RSLSLF++ + ++ S + +N F
Sbjct: 185 RGDLSKVEQAESKPV------EDDDFKDPRSLSLFDSVK-SQGVVGSQQHDNVGLFDKKD 244
Query: 247 AEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNI-LVEGG 306
A +V+ SG ++ + S++ ++ + E N D N+ L EG
Sbjct: 245 A--PKSVVSSGEHENLSLFAGRDAQESVSFAAQGNFGFFEEKDARNSFKEDENLSLFEGK 304
Query: 307 GKA----TEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSE 366
A + VD SF + + + D+SF + + + +S ++ +
Sbjct: 305 EDAQRTSSSKVDESFGFFEGKD-------AQRTSSSKDDESFGMFEGKKDAQRNSSSKED 364
Query: 367 PSSAAVTGTDSSSRLFTSDQFDNFTGF----VVSKSIQFPTDSFKGPDNAWNSFTTSSTR 426
S G + + R +S + +NF F + + ++ D + W+S S+ +
Sbjct: 365 ESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQ 424
Query: 427 GDIQSQINSDKTEEHAGIGDPAMDGENDLAP-LDVTYFEHEGINVERTQNKISDLGDSIG 486
Q +I+ GDP + DLA +D + + + + + + G
Sbjct: 425 NLSQKKID----------GDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAG 484
Query: 487 SWNNDVHLGIVGGLDQLQGR--ESKSPASRTTNNLHNSFDLWNDFKGSD----NQHQAIT 546
W D G V G Q K+ N ++S D+ D+ G D N+ ++I
Sbjct: 485 DWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDI--DWIGDDLWQTNEKKSIE 535
Query: 547 KEEADNKISTEDVNRCDEWNDFAGPSVGK 557
K D ++ +D D+WNDFA + K
Sbjct: 545 KTPTD--VNDDD---DDDWNDFASSANSK 535
BLAST of Spo26617.1 vs. TAIR (Arabidopsis)
Match:
AT1G05090.1 (dentin sialophosphoprotein-related)
HSP 1 Score: 119.4 bits (298), Expect = 1.200e-26
Identity = 147/569 (25.83%), Postives = 252/569 (44.29%), Query Frame = 1
Query: 7 VSPNLFDQVQILLKKEAGFDSIN-LNDHSLDSPPSSFFEEAVAGSKPSSDALRCKNCSGE 66
+S +L +Q+++ L+KEA S++ +D S S P+S EEA+A S+ LRC+NC G+
Sbjct: 5 ISVDLINQLKVSLRKEAKLTSVDDCSDSSFPSLPTS--EEAIAELDASAPYLRCRNCKGK 64
Query: 67 LLRGSDSIICVYCGH-ARQQDFFRQPISFKNSFAFQWFLKSLDLDGFESVGIPVRDNQFG 126
LLRG +S+ICV+CG+ R D PI F ++ A++WFL SL+LDG E V P+++
Sbjct: 65 LLRGIESLICVFCGNQQRTSDNPPDPIKFTSTSAYKWFLTSLNLDGSEMVE-PLKETDGS 124
Query: 127 DRGRSSIKEDE-ITLSDFLNLQLKWSDDSREFGVESEERLPGKSYLSLIGVDIEDISGQE 186
RG + + I LS FL+L+++WS + E + + + K+ L+L G++++D +
Sbjct: 125 SRGATKAPPSKGIALSKFLDLEIQWS--ALEEKSDDGQSVQKKNPLNLGGINLDDYFVER 184
Query: 187 SIDNGFIVRNEHVVVSRKFGNEESGISGKRSLSLFENTQFTEAASKSGEGENTESFSDWG 246
D + + E V E+ RSLSLF++
Sbjct: 185 RGDLSKVEQAESKPV------EDDDFKDPRSLSLFDS----------------------- 244
Query: 247 AEFQSAVLESGSEKSANSLLSDALSGSMTISSDNLWSNLESVQPFNIRVSDGNI-LVEGG 306
+ Q V GS++ N L D ++ S NL + + D N+ L EG
Sbjct: 245 VKSQGVV---GSQQHDNVGLFDKKDAPKSVVSSGEHENLSLFAGRDAQEKDENLSLFEGK 304
Query: 307 GKA----TEPVDPSFDWSPANEQPTMLDNSPNIEKRDGDDSFCDWGDFRMSINDSITQSE 366
A + VD SF + + + D+SF + + + +S ++ +
Sbjct: 305 EDAQRTSSSKVDESFGFFEGKD-------AQRTSSSKDDESFGMFEGKKDAQRNSSSKED 364
Query: 367 PSSAAVTGTDSSSRLFTSDQFDNFTGF----VVSKSIQFPTDSFKGPDNAWNSFTTSSTR 426
S G + + R +S + +NF F + + ++ D + W+S S+ +
Sbjct: 365 ESFGMFEGKEDAQRNSSSKENENFGFFEGAPLSNADLKSFDDKIVAASSDWDSDFQSADQ 424
Query: 427 GDIQSQINSDKTEEHAGIGDPAMDGENDLAP-LDVTYFEHEGINVERTQNKISDLGDSIG 486
Q +I+ GDP + DLA +D + + + + + + G
Sbjct: 425 NLSQKKID----------GDPFVSSPVDLAAHMDSVFGSGKDLLYAQPADSSTAYVSKAG 484
Query: 487 SWNNDVHLGIVGGLDQLQGR--ESKSPASRTTNNLHNSFDLWNDFKGSD----NQHQAIT 546
W D G V G Q K+ N ++S D+ D+ G D N+ ++I
Sbjct: 485 DWLQDDLFGNVTGEAQTNDSAVHDKNEGQIVGGNGNSSMDI--DWIGDDLWQTNEKKSIE 512
Query: 547 KEEADNKISTEDVNRCDEWNDFAGPSVGK 557
K D ++ +D D+WNDFA + K
Sbjct: 545 KTPTD--VNDDD---DDDWNDFASSANSK 512
The following BLAST results are available for this feature: