Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCCAATCGGGCTGAAAGTTCTACGTAATAGTGTAATACGGAGATACAAAAAGAGGCCCGCCCGAAATGTCGTAGTGAATTTCAAAGCGTTTTCAAGTTCACTCCAAAATCACAACTACAGTAGAGCTCCATTTGCTTTGAAGCTTTACCGACTGTCTGAGCTTCTCATCCAACCTTAAAATTGATCTTCCATCTTCAAAGTTTGCATAAACCCTAAAACCCTCCGATAAACCCAAATTCTCCCCTTTTCACATTTTTAATCTCTCTGAAAATTCAATCTTCGCCCAAAAATGTTCACTCCTTCAACGACGAAGAAGAAATCGAACTACAGTACCTTCAAAAATCCAAATGCCACCCAGCAGAAGACGATATCCTTCGATTCTTCGCCGATAACTCCGCTTGCGGCGAATCGTAATTCCCTTACCGACGGTTCGATCCCTAATCGCCCCCAAACTGGCACTCCTGCTCCTTGGGCCTCTCGCCTATCTGTTCTTGCCAGGTCCCATCTAATTCCTTAACTATACATTTATAATTTCTGAGTTTTTTGTAGTTTTTGATGTCCCATTCTATCCTAGTATTGTATTTTAGTTTGGTAATTGGAAAATTGGCGGATAGTTTAACTTGTGACCAAATATTTCAAATGAAAATGAGATATTGCCTTCAGCATCGGCTGCTTTTGGGTAATAAGGATTTTTCTACTTTTAGCAACTTTTAAAGCTTGTAGTTCTGTCATAGACTCATTTCAAAAGCTTTATTTAGTTTCCATACTTCTACCCAAGGTAAGCATATAGGAATTACCTTACTCGATCGAGTAAGTATGCTTTTGTTTTGTTAAGTAGTTTATAATGACGGGTAACTGGTGCTTATGGACGATTATAGTTGGAACGGGGGTGGTTAGGTCATTTGTGCTTCCACGTATAAGGGCAGTGCGTTTCGTATACATAGGCGATACACCAGAGCAAAATTGTATTGTTGGAGCTTCTGAAATGAGCATGGTTGTGTTACAGTTAGAAAAGTTTGTATGTATGTTTGTATCGGAAGGGGGGTGGGGGGTATTTGGCCTTAGATATACTGAAGTAGTTGGCAACTCGGCATGTCATAAGTTGTTCAACACTGTTACTTAAGAAGCTGCATGTTAAAGCGAAGTGATAAAGTTTGTACTGATTACTTGGTGACTGGATTCCTGAACAAAGAGCAGTGCTGGAAATATTGCCGGATGATATATTTTCTCAAATGATGATGGTACACATTACAAATGTAGAATTTGATATGCTGATTTTTTGATGTTCTATTTGCATCGTATTTTAAATGTGTATACGCTTTTCTATCGTCCGTGTTTATTAAATAGCATTTTTTGATGGCCGTTGCTTCCCTCTTCTGTTTCTTTTCTAATTTAAGATAAGTTCTAACTTGGTGAGCATGATTCATGTTCTTGTCTTTCAAGTTGAATTTGTTTCCTGTTATAAAGGTGTAGCATTGGGGGTGACTACATGGTGCTCGGACTTAAAAATGAGTATTAGACTTGAGAGTTTTACGTTGATTTTTTTTTTACTTGAAGATATAGGGACAACACTTATAACAGTTACGAGATAGACACAAGGATAAGAATACTTTCTTTTGATGCATTGGTAAACGTAAGTAGGAAATCATAAAGTATGTGTGCCGTAGTTGATATAGGGCCTATAGGCACTAATATTTGAAAAGTATGCCCTGTTCCCATAGCAAAAAACCAGCAGTATTTCTAGCTTTATGTCTATTCTAACCTTTGTATACCATCTGAATCTTGAAATAGGGATCATTTTGGGTAGATTAATGTTTGTGGCTTGCTAACAGCATGAATTTGCATCTGTTTACTCAATGGTAATATCAATATTTGATGATAAGGATATAGGCCTGTGGAAGGGGCTTGTATTCAGAAGCAACTATCATTGTTGGACGCACTCTAGACGATATGCTTTAGTAAACGTAGGGCATTGTTTATAAGGGGAATTCATTTGTTTGATGGCTCTTATGTTGGGCCTTTTTAAGTTCATTGTTTTTTTCATCATCATCAGCCCAAGCTTTTGCTTTGATGCTCCTAGAGATTGTTTTTGTGTTGTTGAATTAACTGCCATTATCATTGGCAGTTTGAAACAGGACAGAGGTAGTGTTGGAAGTTCTTATTTTCTGTTTCAATGGCTTATTGTTGTGTGTTAATATGGCCAGAATCCCTCCAGCAAAGAAAAGTGAAGACAATAGAGAATCAGCAGAACCTGTGTATGTTGGAGACTTTCCACAAGCTGTTCATGATGAACAGGCAAATGTCATGCGGAATTCTGTCCCAGGTTAAAGATATAAGTTTGGTTTTTTAGTACAACAAACATTGCCTTTTTGGGCTGCCTTTTGGGAGGTTGAATGTTTTCTGAACAGGACACTGTTGGTGTTGGCAGGTGATGCATGCATTGCTGGTGGGATGGACAAGGAATATGGTTTGTCTTGGATGATTTGTGGAAGTAAACTTTTTGTATGGAGTTACCTATCACCTGCTGCGTCGAAGAGATGTGTGATTCTTGATCTCCCATCAGATGTTTCCGAAATCAGCAATAGAAATGCATACCTTGGTGATACTTGGTTGCTTTGTCTCATCAATTGGGAAAATGTTCACCAGAGCAGCAGTTTTGTAAAGCAGTTGACCTCAGCTGGAGTCGTTCTATGCAATCAAAGAAGTCGAGCTATTGTGTATTGGCATGATATATATTCAGATGACAGGGTTACTCCTAGAATCAGTCTTGCATCTCCTGAAGAGCTGGGAATTTCTTCAATGGGGAATGGAAAAAGTCCTCCGACCCGATTGCAATCAGACAACAGGCTAGAAAGCCTTTCATATAATCAGAGTGTTTTTAATTCCTTGATTGTCTCTGCAATCCCTTCCAATTACCATGCTTGTGTTGCACTTGCCACTAGTTCCAACGGTGAATTCTGGCAATTTATTTGCAGTCCTGCTGGAGTTCTCCGTCAGAATGTATGTCATATCTTGTCAACTTCTATTAGTGGCTGTAGTCATCCTTCTGTGAGTAAGGGGTATCCAAGATCTCTTTTGTGGCGTTTTCCATCCCTACAGATGGAAAATTCACATAGGCAGTTTTTCTTATTAACTGATCATCAAATACAGTGCTTCAGCATTGAACTTTCACCTAATCCAACTTTGTCGAAACTCTGGTCCCATGAGATCATCGGCACTGACAATGATCTAGGAATTAAGAAAGACTTGGCGGGTCAGAAGAAAATATGGCCTTTGGATATTCAGGTAGATGAACGTGGTAAAGAATTAACCATTCTTGTTGCTATATTTTGCAAGGATCGTCTAAGCAGTTCAAGCTACACACAGTATTCTCTTTTGACTATGCAATACAAATCTGGTTTAGATGAAACCTTAAAGGGCTTTACGTATGGAAGCGTGTTAGAGAAGAAGTCCCCTGTACAAGTTATTATTCCGAAGGCAAGAGTGGAAGAGGAGGAGTTCTTACTTTCAATGAGACTGAGAGCTGGGGGAAAACCTTCAGGATCGGCTGTCATACTCTCCAGAGATGGAACTGCAACTGTCTCCTATTTTTGGAGAAACGCAACCAGGCTTTACCAATTTGATCTCCCGTATGATGCTGGAAAAGTTATTGATGCTTCCGTATTTCCTTCTACTGATGATACCGAGGAGGGTGCCTGGGTTGTACTAACTGAGAAAGCCGGAGTCTGGGCTATACCCGAGAAGGCTGTTTTGTTTGGTGGAGTTGAACCTCCGGAGCGAAGCTTGTCACGCAAAGGAAGCTCGAATGAAAAGACTAGTGGGGAAGAAAAAAACCTTCTGTTTTCAGCTGGTGTTTCTCCTGGGAAAGCTGGTGTAGATGCTCGGGATATTGATGCTATGCAGAGGGCTGGGTTTACCGGGGTTGTTGGCAGGAATGCTCAAGATGAAGAAGCAGAGGCCTTACTGAATCGCCTGTTTCATGATTTTCTCTTGTCAGGCCAGGTTGATGGGTTTCTTGAAAAGCTGAATAATGCTAGGGCATTTGAGAGGGATGCAGAAACTAATGCGTTTGCTCGGGCAAGCAAATCAATTGTTGATTCGCTGGCTAAGCATTGGACAACTACCAGGGGTGCTGAAATTGTTTCCTTGGCTGTTGTATCGAACCAACTTGCAGAGAAGCAACAGAAACATGAAAAATTTCTTCAGTTTCTGGCTTTATCCAAGTGCCATGAAGAATTATGTATGAAACAAAGTAAGTATGTTTCTTCTGATATACATTTCTGGAAAAAAAAAATAACACAAGAAGATTCATCTGATTTATCTAAGCAGGTAAACAATAATTAACAATGAATTTCTGGAAATAAATTTTATGCTATTACCTGCCATTTTCCAAGGAGTACTTGGATAAAAAGTGAAGACTTCAAATTCTACGGAATTCGGTAATATGTATTCAGATCTCTTAGGTGATTGAAGATTACTGCTTTTGTGTGTGTGTGTGTGTGTGTGTGTTTGTCTTTTCTTCGTTGTATTGTCTAATGTATTTGTGAGTGAAAATGTGGATTACTTGTTTTTTCTTTCTTCCTTTTGCTTTCTCTGAGCACCAAGACACCAACTCTGCACAGTCTAATTTTTCTCGACTGACAAGTGCACCATCAATAATCTGTTTTCTAGGGCATGCTCTCCAAATTATTTTGGAAGATGGTGAAAAGCTGACTGGCATTATTCAGCTTAGGGAACTGCAAAATATGATCAATCAGAGCCGTTCAATTGGAGTCAGCCCCCAGCTTTCAAGTGCCAATGGGGAAGTTTCAGGTTCTTTATGGGATTTAATTCAGATAGTTGGTGAGAGAGCCCGCAGAAATACCGTTCTTTTGATGGATAGGGATAATGCTGAAGTGTTCTACAGTAAAATCTCTAATCTCGAAGAGGTATTTTATTGCTTGGACAGGCACTTGGATCACATTGTAAATGTGGGTCAGCCATTTAAGGTTCAGGTCCAGCGAGTTTGTGAACTGTCAAAAGCATGTGTGACTTTACTCCGCTGTGCTATGCACTATAAAAATGAGCATCATATGTGGTATCCACCTCCTGATGGCTTGGTGCCTTGGTATAGCCACCCTGTTGTGCGTGATGGGCTTTGGAGCATTGCGTCTTTTATGATTAAAATTTTGGATGATTCATCTGGAATTGATTTGTCTTCTAAGTCAGATATATGTTCTCATCTGGAGGTTCTAGCTGATGTAGTACTAGAGGCATATGTTGGTGCTATCACTGCGAAAGTTGAGCGCGGGGAAGAGCATAAAGGCCTCTCAGATGAGTATTGGAGAAAGAGGGATACACTTCTTGACTCCCTCTATCAACAGATTAAGGTTTTTGTGAATGCTAGATTCCAGGTTTGGCTTTTCTCTTCTTTACATATAAAGTTAGTATTGCCAACCAATGTAGTGTTGTTAGATGCCTATTTCCAGTTCTGTCTCTTCTTGGAAAGTAAGACCTTTTTTTAAAAGAGAAGCATAAACAAGGAAGGACAAGGAACTTCCCATGTCCTTTACTAAAAAATGTCCATGAAGACTACACATGATCAATGGAAGATGTATACTTCCCGCCTGTAGAAATGGCAAACAGGCGGGGTGGGTGTGGGTACTCCCTCCCCCATACAAGCCAATAGTTTCTACACCTGCCACCTACCTGCTACCCATGACGGGTACAAATATAAAATCCATGGCTGGTATAAGCTAAAATCCACACTCACTCTACCACCTAAGTGGATATCCACAACCAACTCTTACCCATCTATTACCTTCCATTTAAATAAGAAAATTTGACATGTCAATGCAGCGTATGACATGACATAAGACTGATAAGATTACCATACAAAACGATGAGAAGCTTAAGTTACTTCATTTTTAACGGTACAAAAGAAGGACAAGAGCCTAAAGCTACTGCATCAATGGGCAAAGAATATGAACTTCAATTAAAATTCAATTATGAAATATAAATCTAAAGTATTAAACTTTTAATCGAGAAGCTTCAACCAAATCTTAACAAAATCAATCTAAAAAACGGCAACCATTACCCAATGGGATGAAGAACGTCAATCGAATTACAAAATAATTTAAAGAACTCAAATATAATAAAGTACAAAAGATGAATTTAATCTACTTGTTTCCTTCAGTTTATCTCTTACTCTATACTATACTTGTTTACTGCGAAGTCCTTCATCCACCACCGCCACAAATACATAATCCAATTAAGCATGGATTGAAGCAAATCTTTACGATCTCGCAAATCAAAAACAAAGATTCAGAGTTCCCCCATTAAAACCATTAGGACACAGTGGGAAAAGCTTGCAAAGAAACTGAGAAAGTTCTTGGGGTTTAAAAGGAAAAGGTTACTGAGTCTTTGAGATAGAAAGGGAGGCGTGAATGATTGTTGTTTTAGTTAGGGTTTTGAGTTTTGAGTTTTGAGTTTTGAGTTTTGAGTTTTGAGAGTATTGTTTATATTGGCTTATGTATGGGCTCTCAGTCTCTCACAGCCAGGGTATTAGAACACAAGAGATTTAGTTAAGTCTTAGATTTGGAGTTGAATAAATGAAAAATATTATAAAAAAGTAATAATATAATGCGGGTACGGTTGAGAGGATGGATGCGGGGCCAATCAAGCGTGTATGGGGCGGGTATAACCTAAAATTCTCCACCTACCAACTACCCATGGCAGGTATGCCATTTTATATCCATNGCGGGTATAACCTAAAATTCTCCACCTACCAACTACCCATGGCAGGTATGCCATTTTATATCCATACCCGCCTAGTCCTACTACCCCTTAAAGTCATATCCATACCCGCAACGCTTATGTATGGGCTCTCATAGACTGATGTTTATGAGCTACTTTCCTAAAATGGACCTGAATCTCCAGAAGGGACTACATTGTGTGGATTATTGCGTGGTTAGATTGCCAAAAAAGGCAGGGGAATCGAGATCTGCCCATACGCTTGTTCCACCTTTGGAAAATATCCCCTCATTTTCAGATGTTGTTGGAATAACTCAAGCTTCGAAATTGCCCCAAACTTGCTACTACTCCCTCCGTATTTATTTAAGGGATACACTTGCTTTTTCCGGCCGTATTTATTTAAGAGATACACTTGCCATTTTTAGTAATTTATCAATCCCACTATCTAATTAAATAATCTATCTAATATATCATATGTCTCCCACCCTATTAAACAAATAATTTCAGAATTACACCCCACCCTCCACCCCCTTAAAATGACATGGTCCCCACTTGTTTATCTATTAAAATATCTACCCAACCCCACTTGTTTTATTACTTTATTTCATCCAATTTTTTTTCTTAATACTCGTGTCCGGCCAAGTGTATCCCTTAAATAAATACGGAGGGAGTATTACTTTATATTTACCTTCTTCATATACAAAACAACAAGGCAACGAAAACTACTATTGTATTTGCTGACAAATCTTCATAATTAGCAAGTTCGTATGTGTTAGNGGGGGGGGGGGGGGGGGAGGGGGGTTAGGTGAAGTGTATCTGTATATCTATTAGGTAAATTTGTATTTCCATGTCGAGTCACCTTGAAAACTCGTGGCCCAGTGAATTTCATGAAAGCACAACTCTTATGTTGTGATGATGGTGTATTCTTGTGAAATTTGTTATTATAGGTTATTAGAGTGAACTCATTGTGGTCTCTTTTCCTTTCTTCTTCATTTTTCTCATTGTGAACTCATTGTGGTCTCATTGTGGTCTCAATATCCACTACTTGTTAAAGAATAATAAAGCCATCCATATTCACCTAACATATCCACTACTTGTTAAAGAATAATAAAGCCATCCATATTCACCTAAGGATGTTTCCTGATGGTGGATGATTTTCTTGATAGGTCGTTTCATGGTCTGAGATTCAAGAGTAGCTAATTTCTGAGGGGGATAGGTGGCTGAAAGGAAAAAAAAATACTACAGTACATGTCCCCCTTCTGGAATAGTGACTAGTGAGTCAATAGAAAAAAATTACTGGAGAAGGTAGTTCTCTTTGTTTTATAATTCTCTTTTCTCAGTTTTCTTGGGGTTAGGGGTTGCAGAATGTGCTCTTTGTTTGTGGACCTTGTGAGCTGTCTTCTACTGAGTTTGTGACTTTGTGATTGTCCTCATTTCAAAGTTGATTTGTCTGTGGTGAGAGTTAGCACGAATTTTGAATTTGATAGCTTTTTTCATTTCCTCCCCGTCATATTTGTTTCAACTGTTGCACCTATCTAACTTCTTCTTTAATGCAATTGAATAGCCTGAAAATAGCAACCTGGGTAAGTTATTTGTTCTTTTTTTATCCGCAAGTTGAGGTCGACATTCACACCCCAGTTTGTGTTTGTGCTGCCATATCTATGTTTATCAAGATATTTGGTCGGTATAGGCATTGGTTTAGTCCCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCTAATTTCTGAAAATAACAAAATGAGTGTCAAGACTGCAGACTTGATTTTGATAAACTTTGTATTCTGCAGGATTCAGTTGTTGAGAAAGAACAGAAAGAAGTAATTCTTAGGGAATTATCGTCAAATTTGTTGTCTGTTGCAAGGCGGCATGAAGCCTACTGTACTCTCTGGAACATTTGCTGTGATCTCAATGATTCTACTCTTCTTAAACAGCTTATGGTACAGTCTTTGTTTCCAGTTCAGTGGAAAATTTCTGTTAAATTGCTGAATCTCTGCACTTATTGCAGTAACTCCTTCTGCTTCTTTTTATTGTTTAAGTTTTCTATGATGTAGAAGATATATATGGCATTTCCCCTCATGATTATGCTTTGTAACATTGTTAAGTGGCCAGGTGGGATACTTATCTTTCTTCACTTTGAGAGAACTTTTTGCTCCCTTTTTTTTTCAAAACAATTTCTTTCTAAGTGTTGCTTAAAATATGGAGAAGCAATGTACCTCAAATATCCTAAATTTTTGTACAAAAGGATTTAAATCATAAAGGTATTTATTTCTTCCTGTCTTTATACTAAGTTGGACATTTTTAAGTGGGTCCCTGACTTCCTGTACCTCCAAGTTAAGATTTCTCCATACTGGACACATGGAGTTGGATTCATACCCCAATCAAACACCTGTTTAAGTCTGAGCATCATAAATTAGGATTAAAGGTATGTGTTAACTGATCCCAAAACTGTCTGAGACATCTGTTATCTTATGAGAATCTTGATTTGTGCTGATTTCTTTTGATAAGCAATATAGCATTCATTGGCTGCGTGTTTCCTTTTTATTAGTATTATAGCATAAAAGCAGCTTCACCTGCCAATTTGTTGAACTTCTCCCTTGTGCGTTCTGGATCCTGATTAATATGTCTGCAGCTCTAATAAACTGCTTGTTGTTCTTTATCCAAAAAAAACTGCTTGATATTTGAGCTTGCAACTGCTGTGATATTTCTGTTTGAGGACTTCATTGTTGTTAAATAGAAACCCAGAGATCGAAGATGCTTGTTTGGTCTCGAAAAAGAGCTTTTTATATAACAGTGATCTTCTATCTCTTGCATATAGTGTTTGTCAGGTGACAGAGACCTTACTAATGATTTTCCTTGTGCAGCACAAAAGCATGGGAGCAAGAGGAGGGTTCAGTTATTTTGTTTTCAAGCAAATGTATGAGAATGGACAATTTCCAAAACTTCTGAGGCTAGGGGAAGAGTTTCCTCAGGAGCTTGAAACTTTCCTAAAGGACCACCCGGATCTCCTATGGTTTCATGAAGTTTTTCTTCATCAATACTCTGGTGCTTCAAAAACTCTTCATGGTTTGGCACTTTCTGGAGATGAAAGATCGTTTTCAGCTGCTGGAAAGGAGGCAGAACTAGATTCTGGAACAAAAGACTTAACATTGCCTGAAAGGAAGCGTCTCTTACATCTGTCAAAGATAGCTGCTATTGCAGGTTTGTTGTGTCTTGCTTTGTACAACACTTTGCTCTTGTTTTAGTAATTTTTAAGTGCAGCACTCCATCTAGTTGTTTATATCTACGGAACTCGTTAAAGTGATGCAATTAGAAGCAGATCTCTGCTGCTGGGTATATAACTGTGGTCCGTGGAGAATAAGTCCACTGATGCTGCTAGCTGGGTAAAATTTTCTCCAATTGAGTCTCCAGACAGGCCATTGGTTATTGAAAGTAGCAAAATCTGTTGTTGCCAGCCACGATAAATTTTTCCGGCGCCACTGTACTATTTCCAACTCCTAGTGGCATGCATCATTTTTTCCTCTCACCTTTGGTTAAACCATCTAAGAAGATTTTTCAGAGTATATATTGGATTTGTAAAGTTGCTGATTGAAAGCTGTTATGTACGAGGCAGCTACCTGGTGGCATTCCGTGTCCTGTAACCGTCCAAACGTAACCGTTATCAGAATTCATCCTGGGTAACACCCCCACTCAAATGGTTAGCGAGAACCTTAGCCAAAATCTTACAAATACACGAGAAAACTTAACAGAGACTAAAATCTGTTACATTCACTGACCTTTCCTTCATAGAAACTATTGTGATAAAAGTAGAGTTCATACTCTTACATCTTACACACTACCCTCAAGCAAGTCATTGTTAACCACATCCCAACACTGTTGTAAATATATCCTCTGTAAAAAACAGGAAACTTGTCAATTTCAAATAGCCTCTTTAGACTAAACTATGTTACTTGAACTCTTTTTTTCATCTCAATTACCCGTGTTGGACACTTCAAAACTTTATTTTGGGTCAAAATCATGAACACGGTTCACAAAATAGCCGAGTCGTGAACTTGGACACGTACTCGTCCAGACACGAGTATCCGAGTTTGAGTAACATAGAGCCTGGACACTATCATCTCCAAAAAGAAGGGGCTCCAACATAGATAAAAATGCGGTATCGGTCACGGTCGCGGAGTCTCGGTAACGGAAATGATGCGTTAGTCGCGGACCGTATCGCGGTTGTCGCGTAGGAATTTGAAAAAGAAGCCCCATGTCAACCTCAGTCACTACTCACTACCTACCCTAGTCTCCCTCCCCTAAACCGTACACGTGACATAATTAAATGAATTCCTTTNCCCATGTCAACCTCAGTCACTACTCACTACCTACCCTAGTCTCCCTCCCCTAAACCGTACACGTGACATAATTAAATGAATTCCTTTGTCTACCTTCTCTCTCCTCTCTTGTTTAGATCTGAAAATGGAATTCTCTACTCTATTTCAGTGGAGTCTCTCCATTTCTTTCCTTTTTCCCTTTCTTTTTGTCGAGAACATGGGGAATTCGGCTGAAAACCGAGTAGATGTGAGGCTAATAACGTTTAATGCGGACAAAATTCGTTCGGATCGGTTGAACCGGCCGAGATCTCGCCCTTTTTAAAAACACCTTTACAACTCGGGATATCTCGTATCGCCGGCCTCCAAAACCTAGTTAAATCGGGTGATACGAGTTAACTCGGGCGAGTTTTTACACCATGGGCTCCAATCTAGATCAACCTCTGCATCCATAAGAAGGAGATCACAGAGGATGAAGTTTCCTTAGAATTACCATCTTGCTCCCTGAAAGTATTCTCTCTCACAATTTTAAGCAACTGAATAAAATGTTTGCTTCTCCTCCAGTGATGAATGCCGCTCTTTTAAGAGGGATTGAAAGTGTGGGAGAGTTAAGGTATTGTTTCTAAGGGACACAAGGGCAAGTACATCTGTTTTAAGGGTCCTTTGTTTGGTTAATCAGCAATNTTTCTAAGGGACACAAGGGCAAGTACATCTGTTTTAAGGGTCCTTTGTTTGGTTAATCAGCAATTTTGTGATGGCACTCACAAATAAGAATAAGGTATGATTTATTAGAAAAGTGAAGCTTAAAAACCAAGTTTGAGGAGGTGTTTTGTTGAATATTTGGAGTTAGCATTACAACACTAAATTGGTAAGATAAGGTGATTCTAGTTCTAAGAATTTTTTTGAACGGTATTGTGGGAGGACCGGAGGAGCACTTGCGTTAAAAAGTTGTAAGGTTGCTTTAGGGATGGTTTTACTTAAACTAAGGTTATCAGGAGAACTATGCAGTTTTAGTAGACATCTTTAGAAGACATGGCAGTTAGGAGTTGTAGGTCTAGGATGAGGGTTTTGTTAGGAACTTTGAAATTTTCACAGGTGGGGATAGGCTAGAGGGGGTGGAGGGACATGAAGAGTAGGGAAGGTGAGCAGTATTAGGTTTGGACGTTGGACAGAGCTAGGGCGCGGGCAATTGATGGCTTTAGTTGAAACTTTGCACTGCTTGTCGAATCCTATGTTTTTGAGGTTGTTGGGAAAAATATGAACTCCAACTACATCTCTCCTTTTCTCAAGGATCAATCTAAGTTGGGATAATTGGCCTGTATTTTTGTGCTAGAAGTTATGAGTTTTTCATGAAATTGCTAGCATAGTAGGTTAAGGGGTTGCTCTTCTGCATGATTATGTATTCCTACCGTGCTTTTCAGGGGAAGTCAGTTCTTGTCTTCTTGAATTAGGATTACTAGAATTGGGTGAGCGTATTGGTTTTTAAGCTATATCTTTATAAGGCTTACGAGAGCCCGAGTTGGGCCATTATTGACGTGGTGCGTGAGGAGGGGGCCCATCATACGGTGAGCATTAGATGCACTTGATAAAAGGTTCTCTTCTAATGTTTTGTTTTTAGTGACGGATACAGTCATACAGATTCCTAAATAAGTGACTTTTTTCATCAACAGGGGCTGTAGGAGGGGTCATATGTCGTATTTTCTCTTCAATGTAGTGTTTGATGTGTTCAGTAGGTTTCTAAAGAGGGGTGTGCAAGAGAAGATGTCCATGTTTCCCATTAAGAATTCAGATGATATAATTTTGTTCCTAGCTACTGTCTGGATGGTTTCAAGAAAGTCATCACAATTTAAGGAATTTTGAATAAATTCAGGGATGAGGGTCAATCTCTCTCTTTGTAATATTGATGGTTTAGATCAATATTTTCTTAAAAGATTTCACTAATTAACAGGGAAAAAAATCCACGGGGGAGGCATCCATGTTCGAAGTATTTTTGGGGGCACTGGGTTTGAAGTGGAATGAATTGATAAAAAGCTGTTTCTCCAAGGTGCAGGATGGGCTCACTAGTCAGTAGTCACTACAATCTTGTTTATCTAGTGTTCCTCTATACTGTCTTCCTCCTTTTTAGGACTCAAGTAGCTATTCCAACAAAAATTTGAGCATTTCATGAGTGATTTCGTGGAGTCAGGGATTGGTGGTGAAAGTTTTGTTTTGTAAGGTCATTTTGGTTTTCTGTGATGGGAGGATTGGGGGTCAGGGAGGTTGGCGTCAGAAAAAAACTGTCCAGAGTCCAAATAAAATTGGGTATCAAGAAAAAGTACGTGTTGTAAAGAAATGGGTTCGGCCTCTTTTTATTTATTTATTTATTATATTATATTTCTTATATTTTTTTATCCCTGTTGGTCAAGTTTAAGATAGGAAATGACTGGCATGTTACTTTTGATGGCAAAATAACTCAAGAGAAAGTTGTTTGTGGGAGCCTTAGTTTGCCTTTTTCACTGTTCTGAAGATGTTAGGGATACACTTCAGATTTTTGCAGTTGCCAATAAGAAGTCGTGGTGCAAGTTTTAATGTGTCCTGTGGTGCATTTGGCAAGAAAGAGACACAAGTGTCTTACTAGGACAGTAGGACCCCCTTTACCGTTCAACTTTTTGTGATAGGGATATAATGTTCAGAATTCATATTACCTGTTTATGTGAACTTAGTATCGTATATTACTTCTCTAATGGTCTGTGGAACAATGATATTTACAATCTGGCAGGTAAAGATGCTGTCTATGAGGCAGAAATAGAACGTATTGATGCAGACTTGAAGATTTTGAAGTCACAGGTATCTCCAGATAAACTTGTTTACTTTACATTACTGATAAAGTTTTAGGACAAATTTTGCGAGGCTTGATTAACCTGGTGCTTTTCATTTATAATTCTATTGAACATAGTTCTAAATATGGTGCTTTAATAAAGGTTTCCTTGTGAAAGCTGTTCCCACTGTTTCAAAATGCAGGACATCTGGACATGTTATGTTTTTAATCATTTTTTCACAATGTAAATCGATATTTAAAAACTATTGTAGTCCTTCCATTCCATATCAAGGTTAGACAAGTTTTCAAGAGGTGTTAATTACTACATCCAAGTAATAGTCCATTGGTGTTTTAGACAAACAATAGTCCATTGATGTTAGGTGACAGATAATGTGGAACAAAAAATGTTTGTTAAGTGTCAAGTATGATGGAATTGAGGGAATAATTATATTATTCAAAGCTACAACAATTTTATTGTCTTTTTCTTTCCCTATAGCCATTTAATTTAAATATTCTCTAATCTATGTGCTACACTGCTATGAAGTGCAATATAAAATGCTCTTTATTTTAAGTCAGAGATTATCATTTTCATGTAATTTTGATTGATAAGAGTGATAACTGACCGAGTTTAAGAGTGATTGTTTGGGGGAGGGTATAGTCAAATCATACTTTTACCAAAATTTGAGTTATGTCCTAGTGGGTTAAGATTAAACTCATAGCCTAGGGATCTTTCTTGTCTCTGTACTTCGATGTAACGTCACTGTGATCCATATTCTGAAACCTGCTATTCACATCAAATCCCGTCAACCCACAATCAAAGTTCTTGGAATGGTTGGTGTCAGTGATCAACACCATAAGAAGAGGTCATGCTGATTACCTATCAGAAAACAACCTTTTAGTCAGCCTATAAGCAATGCTAACATCATTTAAAACCAGTAGCTGTAAGTTTTATTTCTCCTCAGGAGAACAATTACTTGATAACATAGTGTCATCTGACTCATCTCCTAGCCATCTTAAGCATTTTTAATTGACTGCTTGATAACTTCTTGCCATTTCTCCATCCTCCCAAAAACAAGCAACGAAAAAAAGTTTTATTTTTGGACATAGTTGGATCCAGCAATTGTTGTTGTTTTGCTTTGTGCTTATGCCTCATGAATTTTTTTGCTTAAGCTGCAAGTAGATGGTTAAGTAATTCTGCCTATTGCAGGAGGATATCTTGAATCTGTATCCTGACGACTTAGAGAAGCAGAAAATTGGACGTCGACTCCTCCCTCCTTGGGATCTTATAAAATTGTGCCTTAAAGGTCAGACACCAGAGCTGTTATTAAGGGCTTTTGATGTGTTTGCTTGGACAAGCTCTTCCTTCCGCAAATCCAATAAAACACTTTTAGAAGAGTGCTGGAGAGGTGCAGTTGATCAGGATGACTGGCAAAAGCTCTATCAGTTTTCCATAGCGGAAGGTTGGAGTGATGAGGACACAATACGGGTTCTGCAGAAAACAATGCTTTTCCTGGCAGCAAGAAGGTGTTATGGACCGGAATCTGAAATGTATGATGGAGGTTTTGATGAAGTGTTACCATTGAGGCAAGAGAGCAGTGAGCTGCCAAATATGCAAGATTCTGGTTCTTCTGTTGAAGTGATTTTGATGCAGCATAAAGACTTCTCAGATGCTGGCAAGCTAATGCTGACTGCTCTGATGCGGGGAAGTGAACAAGTAGATATTGGTGGGGCAGACGGGCCAATTCCCATGGAGTAGTAGCAGTACGATGATTGTGATAATGTAGGAGAGCAGGTTTGTATCTGAGGGTTTATGATCCCTTCCAAAAAAATGAAAAAGAACTTTAGCATAGATTTAGATTTTACTGTAAATATGAGCATCCTCTAGTATCATTAAAGGCCTTGCAGACATACAGTTTTGCTGGAATCTAAAGATAACAAAAAAGAAGAAGTGAAGTTATTTGTTGGAGTAAAATATTGGTTTTATCCTTGAAAAACAAGAAAAATCAGAGGTTCCCCTTACGGCTGCACATAGTTCAGCCAAAGTTAGAAGACATCTACTGTGTACTGTGGAATAGCCAACATGTACCTTGTAGAGTTGTAGCACGTTATTATTATTTGAGTGAAATGTACCTTGTAGCATGTTTACTATTTTTAGCTTTGAATTCTGATAGTCACAAAATTTGATCAAAATTCTATTTTATCGTTGCTATTTTATGTGGCATTTCAGTCACTGATGGCTTATCATCGAGTGTTTCCTGCAGTATGGTTAGAATGTGCAACGGGC
mRNA sequence
TCCAATCGGGCTGAAAGTTCTACGTAATAGTGTAATACGGAGATACAAAAAGAGGCCCGCCCGAAATGTCGTAGTGAATTTCAAAGCGTTTTCAAGTTCACTCCAAAATCACAACTACAGTAGAGCTCCATTTGCTTTGAAGCTTTACCGACTGTCTGAGCTTCTCATCCAACCTTAAAATTGATCTTCCATCTTCAAAGTTTGCATAAACCCTAAAACCCTCCGATAAACCCAAATTCTCCCCTTTTCACATTTTTAATCTCTCTGAAAATTCAATCTTCGCCCAAAAATGTTCACTCCTTCAACGACGAAGAAGAAATCGAACTACAGTACCTTCAAAAATCCAAATGCCACCCAGCAGAAGACGATATCCTTCGATTCTTCGCCGATAACTCCGCTTGCGGCGAATCGTAATTCCCTTACCGACGGTTCGATCCCTAATCGCCCCCAAACTGGCACTCCTGCTCCTTGGGCCTCTCGCCTATCTGTTCTTGCCAGAATCCCTCCAGCAAAGAAAAGTGAAGACAATAGAGAATCAGCAGAACCTGTGTATGTTGGAGACTTTCCACAAGCTGTTCATGATGAACAGGCAAATGTCATGCGGAATTCTGTCCCAGGTGATGCATGCATTGCTGGTGGGATGGACAAGGAATATGGTTTGTCTTGGATGATTTGTGGAAGTAAACTTTTTGTATGGAGTTACCTATCACCTGCTGCGTCGAAGAGATGTGTGATTCTTGATCTCCCATCAGATGTTTCCGAAATCAGCAATAGAAATGCATACCTTGGTGATACTTGGTTGCTTTGTCTCATCAATTGGGAAAATGTTCACCAGAGCAGCAGTTTTGTAAAGCAGTTGACCTCAGCTGGAGTCGTTCTATGCAATCAAAGAAGTCGAGCTATTGTGTATTGGCATGATATATATTCAGATGACAGGGTTACTCCTAGAATCAGTCTTGCATCTCCTGAAGAGCTGGGAATTTCTTCAATGGGGAATGGAAAAAGTCCTCCGACCCGATTGCAATCAGACAACAGGCTAGAAAGCCTTTCATATAATCAGAGTGTTTTTAATTCCTTGATTGTCTCTGCAATCCCTTCCAATTACCATGCTTGTGTTGCACTTGCCACTAGTTCCAACGGTGAATTCTGGCAATTTATTTGCAGTCCTGCTGGAGTTCTCCGTCAGAATGTATGTCATATCTTGTCAACTTCTATTAGTGGCTGTAGTCATCCTTCTGTGAGTAAGGGGTATCCAAGATCTCTTTTGTGGCGTTTTCCATCCCTACAGATGGAAAATTCACATAGGCAGTTTTTCTTATTAACTGATCATCAAATACAGTGCTTCAGCATTGAACTTTCACCTAATCCAACTTTGTCGAAACTCTGGTCCCATGAGATCATCGGCACTGACAATGATCTAGGAATTAAGAAAGACTTGGCGGGTCAGAAGAAAATATGGCCTTTGGATATTCAGGTAGATGAACGTGGTAAAGAATTAACCATTCTTGTTGCTATATTTTGCAAGGATCGTCTAAGCAGTTCAAGCTACACACAGTATTCTCTTTTGACTATGCAATACAAATCTGGTTTAGATGAAACCTTAAAGGGCTTTACGTATGGAAGCGTGTTAGAGAAGAAGTCCCCTGTACAAGTTATTATTCCGAAGGCAAGAGTGGAAGAGGAGGAGTTCTTACTTTCAATGAGACTGAGAGCTGGGGGAAAACCTTCAGGATCGGCTGTCATACTCTCCAGAGATGGAACTGCAACTGTCTCCTATTTTTGGAGAAACGCAACCAGGCTTTACCAATTTGATCTCCCGTATGATGCTGGAAAAGTTATTGATGCTTCCGTATTTCCTTCTACTGATGATACCGAGGAGGGTGCCTGGGTTGTACTAACTGAGAAAGCCGGAGTCTGGGCTATACCCGAGAAGGCTGTTTTGTTTGGTGGAGTTGAACCTCCGGAGCGAAGCTTGTCACGCAAAGGAAGCTCGAATGAAAAGACTAGTGGGGAAGAAAAAAACCTTCTGTTTTCAGCTGGTGTTTCTCCTGGGAAAGCTGGTGTAGATGCTCGGGATATTGATGCTATGCAGAGGGCTGGGTTTACCGGGGTTGTTGGCAGGAATGCTCAAGATGAAGAAGCAGAGGCCTTACTGAATCGCCTGTTTCATGATTTTCTCTTGTCAGGCCAGGTTGATGGGTTTCTTGAAAAGCTGAATAATGCTAGGGCATTTGAGAGGGATGCAGAAACTAATGCGTTTGCTCGGGCAAGCAAATCAATTGTTGATTCGCTGGCTAAGCATTGGACAACTACCAGGGGTGCTGAAATTGTTTCCTTGGCTGTTGTATCGAACCAACTTGCAGAGAAGCAACAGAAACATGAAAAATTTCTTCAGTTTCTGGCTTTATCCAAGTGCCATGAAGAATTATGTATGAAACAAAGGCATGCTCTCCAAATTATTTTGGAAGATGGTGAAAAGCTGACTGGCATTATTCAGCTTAGGGAACTGCAAAATATGATCAATCAGAGCCGTTCAATTGGAGTCAGCCCCCAGCTTTCAAGTGCCAATGGGGAAGTTTCAGGTTCTTTATGGGATTTAATTCAGATAGTTGGTGAGAGAGCCCGCAGAAATACCGTTCTTTTGATGGATAGGGATAATGCTGAAGTGTTCTACAGTAAAATCTCTAATCTCGAAGAGGTATTTTATTGCTTGGACAGGCACTTGGATCACATTGTAAATGTGGGTCAGCCATTTAAGGTTCAGGTCCAGCGAGTTTGTGAACTGTCAAAAGCATGTGTGACTTTACTCCGCTGTGCTATGCACTATAAAAATGAGCATCATATGTGGTATCCACCTCCTGATGGCTTGGTGCCTTGGTATAGCCACCCTGTTGTGCGTGATGGGCTTTGGAGCATTGCGTCTTTTATGATTAAAATTTTGGATGATTCATCTGGAATTGATTTGTCTTCTAAGTCAGATATATGTTCTCATCTGGAGGTTCTAGCTGATGTAGTACTAGAGGCATATGTTGGTGCTATCACTGCGAAAGTTGAGCGCGGGGAAGAGCATAAAGGCCTCTCAGATGAGTATTGGAGAAAGAGGGATACACTTCTTGACTCCCTCTATCAACAGATTAAGGTTTTTGTGAATGCTAGATTCCAGGATTCAGTTGTTGAGAAAGAACAGAAAGAAGTAATTCTTAGGGAATTATCGTCAAATTTGTTGTCTGTTGCAAGGCGGCATGAAGCCTACTGTACTCTCTGGAACATTTGCTGTGATCTCAATGATTCTACTCTTCTTAAACAGCTTATGCACAAAAGCATGGGAGCAAGAGGAGGGTTCAGTTATTTTGTTTTCAAGCAAATGTATGAGAATGGACAATTTCCAAAACTTCTGAGGCTAGGGGAAGAGTTTCCTCAGGAGCTTGAAACTTTCCTAAAGGACCACCCGGATCTCCTATGGTTTCATGAAGTTTTTCTTCATCAATACTCTGGTGCTTCAAAAACTCTTCATGGTTTGGCACTTTCTGGAGATGAAAGATCGTTTTCAGCTGCTGGAAAGGAGGCAGAACTAGATTCTGGAACAAAAGACTTAACATTGCCTGAAAGGAAGCGTCTCTTACATCTGTCAAAGATAGCTGCTATTGCAGGTAAAGATGCTGTCTATGAGGCAGAAATAGAACGTATTGATGCAGACTTGAAGATTTTGAAGTCACAGGAGGATATCTTGAATCTGTATCCTGACGACTTAGAGAAGCAGAAAATTGGACGTCGACTCCTCCCTCCTTGGGATCTTATAAAATTGTGCCTTAAAGGTCAGACACCAGAGCTGTTATTAAGGGCTTTTGATGTGTTTGCTTGGACAAGCTCTTCCTTCCGCAAATCCAATAAAACACTTTTAGAAGAGTGCTGGAGAGGTGCAGTTGATCAGGATGACTGGCAAAAGCTCTATCAGTTTTCCATAGCGGAAGGTTGGAGTGATGAGGACACAATACGGGTTCTGCAGAAAACAATGCTTTTCCTGGCAGCAAGAAGGTGTTATGGACCGGAATCTGAAATGTATGATGGAGGTTTTGATGAAGTGTTACCATTGAGGCAAGAGAGCAGTGAGCTGCCAAATATGCAAGATTCTGGTTCTTCTGTTGAAGTGATTTTGATGCAGCATAAAGACTTCTCAGATGCTGGCAAGCTAATGCTGACTGCTCTGATGCGGGGAAGTGAACAAGTAGATATTGGTGGGGCAGACGGGCCAATTCCCATGGAGTAGTAGCAGTACGATGATTGTGATAATGTAGGAGAGCAGGTTTGTATCTGAGGGTTTATGATCCCTTCCAAAAAAATGAAAAAGAACTTTAGCATAGATTTAGATTTTACTGTAAATATGAGCATCCTCTAGTATCATTAAAGGCCTTGCAGACATACAGTTTTGCTGGAATCTAAAGATAACAAAAAAGAAGAAGTGAAGTTATTTGTTGGAGTAAAATATTGGTTTTATCCTTGAAAAACAAGAAAAATCAGAGGTTCCCCTTACGGCTGCACATAGTTCAGCCAAAGTTAGAAGACATCTACTGTGTACTGTGGAATAGCCAACATGTACCTTGTAGAGTTGTAGCACGTTATTATTATTTGAGTGAAATGTACCTTGTAGCATGTTTACTATTTTTAGCTTTGAATTCTGATAGTCACAAAATTTGATCAAAATTCTATTTTATCGTTGCTATTTTATGTGGCATTTCAGTCACTGATGGCTTATCATCGAGTGTTTCCTGCAGTATGGTTAGAATGTGCAACGGGC
Coding sequence (CDS)
ATGTTCACTCCTTCAACGACGAAGAAGAAATCGAACTACAGTACCTTCAAAAATCCAAATGCCACCCAGCAGAAGACGATATCCTTCGATTCTTCGCCGATAACTCCGCTTGCGGCGAATCGTAATTCCCTTACCGACGGTTCGATCCCTAATCGCCCCCAAACTGGCACTCCTGCTCCTTGGGCCTCTCGCCTATCTGTTCTTGCCAGAATCCCTCCAGCAAAGAAAAGTGAAGACAATAGAGAATCAGCAGAACCTGTGTATGTTGGAGACTTTCCACAAGCTGTTCATGATGAACAGGCAAATGTCATGCGGAATTCTGTCCCAGGTGATGCATGCATTGCTGGTGGGATGGACAAGGAATATGGTTTGTCTTGGATGATTTGTGGAAGTAAACTTTTTGTATGGAGTTACCTATCACCTGCTGCGTCGAAGAGATGTGTGATTCTTGATCTCCCATCAGATGTTTCCGAAATCAGCAATAGAAATGCATACCTTGGTGATACTTGGTTGCTTTGTCTCATCAATTGGGAAAATGTTCACCAGAGCAGCAGTTTTGTAAAGCAGTTGACCTCAGCTGGAGTCGTTCTATGCAATCAAAGAAGTCGAGCTATTGTGTATTGGCATGATATATATTCAGATGACAGGGTTACTCCTAGAATCAGTCTTGCATCTCCTGAAGAGCTGGGAATTTCTTCAATGGGGAATGGAAAAAGTCCTCCGACCCGATTGCAATCAGACAACAGGCTAGAAAGCCTTTCATATAATCAGAGTGTTTTTAATTCCTTGATTGTCTCTGCAATCCCTTCCAATTACCATGCTTGTGTTGCACTTGCCACTAGTTCCAACGGTGAATTCTGGCAATTTATTTGCAGTCCTGCTGGAGTTCTCCGTCAGAATGTATGTCATATCTTGTCAACTTCTATTAGTGGCTGTAGTCATCCTTCTGTGAGTAAGGGGTATCCAAGATCTCTTTTGTGGCGTTTTCCATCCCTACAGATGGAAAATTCACATAGGCAGTTTTTCTTATTAACTGATCATCAAATACAGTGCTTCAGCATTGAACTTTCACCTAATCCAACTTTGTCGAAACTCTGGTCCCATGAGATCATCGGCACTGACAATGATCTAGGAATTAAGAAAGACTTGGCGGGTCAGAAGAAAATATGGCCTTTGGATATTCAGGTAGATGAACGTGGTAAAGAATTAACCATTCTTGTTGCTATATTTTGCAAGGATCGTCTAAGCAGTTCAAGCTACACACAGTATTCTCTTTTGACTATGCAATACAAATCTGGTTTAGATGAAACCTTAAAGGGCTTTACGTATGGAAGCGTGTTAGAGAAGAAGTCCCCTGTACAAGTTATTATTCCGAAGGCAAGAGTGGAAGAGGAGGAGTTCTTACTTTCAATGAGACTGAGAGCTGGGGGAAAACCTTCAGGATCGGCTGTCATACTCTCCAGAGATGGAACTGCAACTGTCTCCTATTTTTGGAGAAACGCAACCAGGCTTTACCAATTTGATCTCCCGTATGATGCTGGAAAAGTTATTGATGCTTCCGTATTTCCTTCTACTGATGATACCGAGGAGGGTGCCTGGGTTGTACTAACTGAGAAAGCCGGAGTCTGGGCTATACCCGAGAAGGCTGTTTTGTTTGGTGGAGTTGAACCTCCGGAGCGAAGCTTGTCACGCAAAGGAAGCTCGAATGAAAAGACTAGTGGGGAAGAAAAAAACCTTCTGTTTTCAGCTGGTGTTTCTCCTGGGAAAGCTGGTGTAGATGCTCGGGATATTGATGCTATGCAGAGGGCTGGGTTTACCGGGGTTGTTGGCAGGAATGCTCAAGATGAAGAAGCAGAGGCCTTACTGAATCGCCTGTTTCATGATTTTCTCTTGTCAGGCCAGGTTGATGGGTTTCTTGAAAAGCTGAATAATGCTAGGGCATTTGAGAGGGATGCAGAAACTAATGCGTTTGCTCGGGCAAGCAAATCAATTGTTGATTCGCTGGCTAAGCATTGGACAACTACCAGGGGTGCTGAAATTGTTTCCTTGGCTGTTGTATCGAACCAACTTGCAGAGAAGCAACAGAAACATGAAAAATTTCTTCAGTTTCTGGCTTTATCCAAGTGCCATGAAGAATTATGTATGAAACAAAGGCATGCTCTCCAAATTATTTTGGAAGATGGTGAAAAGCTGACTGGCATTATTCAGCTTAGGGAACTGCAAAATATGATCAATCAGAGCCGTTCAATTGGAGTCAGCCCCCAGCTTTCAAGTGCCAATGGGGAAGTTTCAGGTTCTTTATGGGATTTAATTCAGATAGTTGGTGAGAGAGCCCGCAGAAATACCGTTCTTTTGATGGATAGGGATAATGCTGAAGTGTTCTACAGTAAAATCTCTAATCTCGAAGAGGTATTTTATTGCTTGGACAGGCACTTGGATCACATTGTAAATGTGGGTCAGCCATTTAAGGTTCAGGTCCAGCGAGTTTGTGAACTGTCAAAAGCATGTGTGACTTTACTCCGCTGTGCTATGCACTATAAAAATGAGCATCATATGTGGTATCCACCTCCTGATGGCTTGGTGCCTTGGTATAGCCACCCTGTTGTGCGTGATGGGCTTTGGAGCATTGCGTCTTTTATGATTAAAATTTTGGATGATTCATCTGGAATTGATTTGTCTTCTAAGTCAGATATATGTTCTCATCTGGAGGTTCTAGCTGATGTAGTACTAGAGGCATATGTTGGTGCTATCACTGCGAAAGTTGAGCGCGGGGAAGAGCATAAAGGCCTCTCAGATGAGTATTGGAGAAAGAGGGATACACTTCTTGACTCCCTCTATCAACAGATTAAGGTTTTTGTGAATGCTAGATTCCAGGATTCAGTTGTTGAGAAAGAACAGAAAGAAGTAATTCTTAGGGAATTATCGTCAAATTTGTTGTCTGTTGCAAGGCGGCATGAAGCCTACTGTACTCTCTGGAACATTTGCTGTGATCTCAATGATTCTACTCTTCTTAAACAGCTTATGCACAAAAGCATGGGAGCAAGAGGAGGGTTCAGTTATTTTGTTTTCAAGCAAATGTATGAGAATGGACAATTTCCAAAACTTCTGAGGCTAGGGGAAGAGTTTCCTCAGGAGCTTGAAACTTTCCTAAAGGACCACCCGGATCTCCTATGGTTTCATGAAGTTTTTCTTCATCAATACTCTGGTGCTTCAAAAACTCTTCATGGTTTGGCACTTTCTGGAGATGAAAGATCGTTTTCAGCTGCTGGAAAGGAGGCAGAACTAGATTCTGGAACAAAAGACTTAACATTGCCTGAAAGGAAGCGTCTCTTACATCTGTCAAAGATAGCTGCTATTGCAGGTAAAGATGCTGTCTATGAGGCAGAAATAGAACGTATTGATGCAGACTTGAAGATTTTGAAGTCACAGGAGGATATCTTGAATCTGTATCCTGACGACTTAGAGAAGCAGAAAATTGGACGTCGACTCCTCCCTCCTTGGGATCTTATAAAATTGTGCCTTAAAGGTCAGACACCAGAGCTGTTATTAAGGGCTTTTGATGTGTTTGCTTGGACAAGCTCTTCCTTCCGCAAATCCAATAAAACACTTTTAGAAGAGTGCTGGAGAGGTGCAGTTGATCAGGATGACTGGCAAAAGCTCTATCAGTTTTCCATAGCGGAAGGTTGGAGTGATGAGGACACAATACGGGTTCTGCAGAAAACAATGCTTTTCCTGGCAGCAAGAAGGTGTTATGGACCGGAATCTGAAATGTATGATGGAGGTTTTGATGAAGTGTTACCATTGAGGCAAGAGAGCAGTGAGCTGCCAAATATGCAAGATTCTGGTTCTTCTGTTGAAGTGATTTTGATGCAGCATAAAGACTTCTCAGATGCTGGCAAGCTAATGCTGACTGCTCTGATGCGGGGAAGTGAACAAGTAGATATTGGTGGGGCAGACGGGCCAATTCCCATGGAGTAG
Protein sequence
MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAPWASRLSVLARIPPAKKSEDNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGMDKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWENVHQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNGKSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVLRQNVCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSSSSYTQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLLFSAGVSPGKAGVDARDIDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNARFQDSVVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDADLKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIGGADGPIPME
Homology
BLAST of Spo05021.1 vs. NCBI nr
Match:
gi|902211663|gb|KNA16789.1| (hypothetical protein SOVF_085660 [Spinacia oleracea])
HSP 1 Score: 2651.7 bits (6872), Expect = 0.000e+0
Identity = 1325/1325 (100.00%), Postives = 1325/1325 (100.00%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP
Sbjct: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
Query: 61 WASRLSVLARIPPAKKSEDNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGMDK 120
WASRLSVLARIPPAKKSEDNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGMDK
Sbjct: 61 WASRLSVLARIPPAKKSEDNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGMDK 120
Query: 121 EYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWENV 180
EYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWENV
Sbjct: 121 EYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWENV 180
Query: 181 HQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNGKSP 240
HQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNGKSP
Sbjct: 181 HQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNGKSP 240
Query: 241 PTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVLRQN 300
PTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVLRQN
Sbjct: 241 PTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVLRQN 300
Query: 301 VCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELSPNP 360
VCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELSPNP
Sbjct: 301 VCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELSPNP 360
Query: 361 TLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSSSSY 420
TLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSSSSY
Sbjct: 361 TLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSSSSY 420
Query: 421 TQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGGKPS 480
TQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGGKPS
Sbjct: 421 TQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGGKPS 480
Query: 481 GSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLTEKA 540
GSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLTEKA
Sbjct: 481 GSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLTEKA 540
Query: 541 GVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLLFSAGVSPGKAGVDARDIDAM 600
GVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLLFSAGVSPGKAGVDARDIDAM
Sbjct: 541 GVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLLFSAGVSPGKAGVDARDIDAM 600
Query: 601 QRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNAFARA 660
QRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNAFARA
Sbjct: 601 QRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNAFARA 660
Query: 661 SKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMKQRHA 720
SKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMKQRHA
Sbjct: 661 SKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMKQRHA 720
Query: 721 LQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGERARR 780
LQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGERARR
Sbjct: 721 LQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGERARR 780
Query: 781 NTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKACVTLL 840
NTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKACVTLL
Sbjct: 781 NTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKACVTLL 840
Query: 841 RCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKSDICS 900
RCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKSDICS
Sbjct: 841 RCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKSDICS 900
Query: 901 HLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNARFQDS 960
HLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNARFQDS
Sbjct: 901 HLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNARFQDS 960
Query: 961 VVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARGGFSY 1020
VVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARGGFSY
Sbjct: 961 VVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARGGFSY 1020
Query: 1021 FVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHGLALS 1080
FVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHGLALS
Sbjct: 1021 FVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHGLALS 1080
Query: 1081 GDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDADLKIL 1140
GDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDADLKIL
Sbjct: 1081 GDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDADLKIL 1140
Query: 1141 KSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSFRKSN 1200
KSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSFRKSN
Sbjct: 1141 KSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSFRKSN 1200
Query: 1201 KTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESEMYDG 1260
KTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESEMYDG
Sbjct: 1201 KTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESEMYDG 1260
Query: 1261 GFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIGGADG 1320
GFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIGGADG
Sbjct: 1261 GFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIGGADG 1320
Query: 1321 PIPME 1326
PIPME
Sbjct: 1321 PIPME 1325
BLAST of Spo05021.1 vs. NCBI nr
Match:
gi|731371096|ref|XP_010665934.1| (PREDICTED: nuclear pore complex protein NUP133 [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 2234.5 bits (5789), Expect = 0.000e+0
Identity = 1122/1329 (84.42%), Postives = 1207/1329 (90.82%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+PS+ +KKS YS+ K+ N QQKT FDSSPITP+A NRNS+ DGSIPNRPQ+GTPAP
Sbjct: 1 MFSPSS-RKKSTYSSLKDRNVNQQKTPLFDSSPITPVAGNRNSINDGSIPNRPQSGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WASRLSV+ARIP AKK+E +N + +PVYVGDFPQAVHDEQ + +RNSVPG+ACI+GGM
Sbjct: 61 WASRLSVIARIPSAKKNEKMENVDLPQPVYVGDFPQAVHDEQTSFLRNSVPGEACISGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWE 180
DKE GLSWMICGSKLF+WSYLSPAASKRCV+LDLP+DVS+I +RNAYLGDTWLLCLINW+
Sbjct: 121 DKETGLSWMICGSKLFLWSYLSPAASKRCVVLDLPADVSDICSRNAYLGDTWLLCLINWD 180
Query: 181 NVHQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGI-SSMGNG 240
NVHQSSS V QLTSAGV+LCNQRSRAIVYW DIYSDD TPRISLAS EE I S MG+G
Sbjct: 181 NVHQSSSIVNQLTSAGVILCNQRSRAIVYWPDIYSDDGATPRISLASSEEPEIISPMGHG 240
Query: 241 KSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVL 300
+SPP +LQS+NRLE LS N S FNSLI S IP N+HACVALATSSNGE WQFIC+PAGVL
Sbjct: 241 RSPPRQLQSENRLERLSTNPSSFNSLIASTIPFNHHACVALATSSNGELWQFICTPAGVL 300
Query: 301 RQNVCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELS 360
RQ VC ILSTSISG SHPS SKGYPRSL+WRFPSL S+RQFF+LTDH+IQCFSI+LS
Sbjct: 301 RQKVCDILSTSISGYSHPSGSKGYPRSLIWRFPSLLTGKSNRQFFVLTDHEIQCFSIQLS 360
Query: 361 PNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSS 420
PNPTL K+W HEIIGTDN+LGIKKDLAGQK+IWPLDIQVDERGKELTILVAIFCKDRLSS
Sbjct: 361 PNPTLLKVWCHEIIGTDNELGIKKDLAGQKRIWPLDIQVDERGKELTILVAIFCKDRLSS 420
Query: 421 SSYTQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGG 480
SSYT+YSLLTMQYKSGLDET K FT G VLEKKSP+QVIIPKARVEEEEFL SM+LR GG
Sbjct: 421 SSYTEYSLLTMQYKSGLDETSKDFTQGRVLEKKSPIQVIIPKARVEEEEFLFSMKLRVGG 480
Query: 481 KPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLT 540
KPSGSA+ILS DGTATVSYFWRN+TRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLT
Sbjct: 481 KPSGSAIILSGDGTATVSYFWRNSTRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLT 540
Query: 541 EKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEE-KNLLFSAGVSPGKAGVDARD 600
EKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEK+ EE KNLL +AGVSPGKAG+DARD
Sbjct: 541 EKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKSPAEENKNLLNAAGVSPGKAGLDARD 600
Query: 601 IDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNA 660
+ QR+GFTGV G+NAQDEEAEALLNRLFHDFLLSGQVDG LEKL N RAFERD ETN
Sbjct: 601 VGGKQRSGFTGVAGKNAQDEEAEALLNRLFHDFLLSGQVDGSLEKLKNVRAFERDGETNV 660
Query: 661 FARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMK 720
FARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQL EKQQKHEKFLQFLALSKCHEELC++
Sbjct: 661 FARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLIEKQQKHEKFLQFLALSKCHEELCLE 720
Query: 721 QRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGE 780
QRHALQIILEDGEKL G+IQLRELQNMI+QS S S Q SSAN VSGSLWDLIQIVGE
Sbjct: 721 QRHALQIILEDGEKLIGMIQLRELQNMISQSHS---SSQHSSANAGVSGSLWDLIQIVGE 780
Query: 781 RARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKAC 840
RARRN VLLMDRDNAEVFYSKISNLEEVFYCLD+HLD+I+NVGQPFK QVQRVCELSKAC
Sbjct: 781 RARRNAVLLMDRDNAEVFYSKISNLEEVFYCLDKHLDYILNVGQPFKAQVQRVCELSKAC 840
Query: 841 VTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKS 900
VTL+R AMHY+NEHH+WYPPPDGLVPWYS PVVRDG+WS+ASFMIKILDDSSGIDLS KS
Sbjct: 841 VTLIRSAMHYRNEHHVWYPPPDGLVPWYSQPVVRDGIWSVASFMIKILDDSSGIDLSIKS 900
Query: 901 DICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNAR 960
DICSHLEVLADVVLEAYVGAITAK+ERGEE+KGLSDEY RKRD LLDSLYQQIKVFVNAR
Sbjct: 901 DICSHLEVLADVVLEAYVGAITAKIERGEEYKGLSDEYSRKRDALLDSLYQQIKVFVNAR 960
Query: 961 FQDSVVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARG 1020
FQDS EKEQK++ILRELSSNLLSVARRHEAY TLWNICCDLNDS LLK LM++SMG RG
Sbjct: 961 FQDSGDEKEQKDMILRELSSNLLSVARRHEAYRTLWNICCDLNDSALLKDLMNESMGPRG 1020
Query: 1021 GFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHG 1080
GFSYFVF+Q+ ENGQFPKLLRLGEEFPQEL FLKDH DL+W HEVFLHQYSGASKTLHG
Sbjct: 1021 GFSYFVFEQLNENGQFPKLLRLGEEFPQELVIFLKDHSDLMWLHEVFLHQYSGASKTLHG 1080
Query: 1081 LALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDAD 1140
LALSGDE FSAA KE ELD TLP+RKRLLHLSKIAAIAGKDA YEA+IERIDAD
Sbjct: 1081 LALSGDE-MFSAAEKETELDLRKTVYTLPQRKRLLHLSKIAAIAGKDADYEADIERIDAD 1140
Query: 1141 LKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSF 1200
LKILKSQEDILNL PDD+EKQKIG RLLPPWDL++LCLKGQTPELLLRAFDVFAWTSSSF
Sbjct: 1141 LKILKSQEDILNLCPDDMEKQKIGHRLLPPWDLVRLCLKGQTPELLLRAFDVFAWTSSSF 1200
Query: 1201 RKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESE 1260
R SNK+LLEECW+ AVD DDW+KLY FSIAEGWSDEDTIRVLQKTMLF AARRCYG ESE
Sbjct: 1201 RNSNKSLLEECWKSAVDHDDWEKLYHFSIAEGWSDEDTIRVLQKTMLFQAARRCYGQESE 1260
Query: 1261 MYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIG 1320
MYDGGFDEVLPLRQE++ELP+M+DSGSSVE ILMQHKD+SDAGKLMLT LM GS QVD+G
Sbjct: 1261 MYDGGFDEVLPLRQENTELPSMKDSGSSVEAILMQHKDYSDAGKLMLTTLMWGSAQVDVG 1320
Query: 1321 GADGPIPME 1326
G DGP+ ME
Sbjct: 1321 GKDGPVSME 1324
BLAST of Spo05021.1 vs. NCBI nr
Match:
gi|225447584|ref|XP_002272021.1| (PREDICTED: nuclear pore complex protein NUP133 [Vitis vinifera])
HSP 1 Score: 1713.0 bits (4435), Expect = 0.000e+0
Identity = 865/1338 (64.65%), Postives = 1064/1338 (79.52%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P+T K+ N+S+ K+ N Q + +SPITPL NR SL + SIPNRP TGTPAP
Sbjct: 1 MFSPAT--KRPNFSSRKDRNLGQ----AVPNSPITPLTENRRSLNENSIPNRPSTGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
W SRLSV ARIP KKSE D + +PVYVG+FPQ V DEQA+ ++ VPGDA I GGM
Sbjct: 61 WTSRLSVYARIPQLKKSEKGDEIDPVQPVYVGEFPQVVRDEQASFLQKRVPGDASIFGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWE 180
DK LSW+ICG+KLF+WSYL+ ASK+CV+L+LPSD + NRN Y ++WLLC+++W
Sbjct: 121 DKGTALSWIICGNKLFIWSYLTSVASKKCVVLELPSDENGDVNRNNYHANSWLLCVVDWH 180
Query: 181 NVHQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGIS-SMGNG 240
+S +Q SAGVVLCNQ++R +VYW DIY+ V P +S AS + ++ S GNG
Sbjct: 181 GTFRSVG-KQQGNSAGVVLCNQKTRTVVYWPDIYAQGDVAPVVSFASSDGSELNFSPGNG 240
Query: 241 KSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVL 300
K P +L +RL S S S FNSLI SA+P H C+ALA+SSNGE WQF CSPAG+
Sbjct: 241 KITPNKLWQHSRLGSNSVGSSSFNSLIASAVPDTQHKCIALASSSNGELWQFQCSPAGIH 300
Query: 301 RQNVCH-ILSTSI----SGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCF 360
R+ + IL +S SG +P SKGYP+SL W S +E S+RQFFLLTD++IQCF
Sbjct: 301 RKQIYQEILGSSSQSNDSGNPNPIRSKGYPKSLTWHHSSFSLEKSNRQFFLLTDNEIQCF 360
Query: 361 SIELSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCK 420
+ SP+ ++KLWSHEIIGTD DLGIKKDLAGQK+IWPLD+QVD GK +TILVA FCK
Sbjct: 361 RVNFSPDLNVTKLWSHEIIGTDGDLGIKKDLAGQKRIWPLDVQVDAHGKVITILVATFCK 420
Query: 421 DRLSSSSYTQYSLLTMQYKSGLD--ETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLS 480
DR+SSSSYTQYSLLTMQYKSG++ E+++ + +VLEKKSPVQVIIPKARVE+E+FL S
Sbjct: 421 DRVSSSSYTQYSLLTMQYKSGINISESVEPI-HETVLEKKSPVQVIIPKARVEKEDFLFS 480
Query: 481 MRLRAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEE 540
M+LR GGKPSGSAVILS DGTATVS+++ N+TRLYQFDLPYDAGKV+DASVFPSTDD E+
Sbjct: 481 MKLRVGGKPSGSAVILSEDGTATVSHYYGNSTRLYQFDLPYDAGKVLDASVFPSTDDGED 540
Query: 541 GAWVVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEK-NLLFSAGVSPGK 600
GAWVVLTEKAGVWAIPEKAVL GGVEPPERSLSRKGSSNE ++ EE+ NL F+ ++P +
Sbjct: 541 GAWVVLTEKAGVWAIPEKAVLLGGVEPPERSLSRKGSSNEGSAQEERRNLAFATNIAPRR 600
Query: 601 AGVDARDIDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFE 660
A +A D QRA TGV R A+DEE+EALL+ LFHDFLLSGQVD LEKL N AFE
Sbjct: 601 ASSEAWDAGDRQRAALTGVARRTARDEESEALLSHLFHDFLLSGQVDDSLEKLRNCGAFE 660
Query: 661 RDAETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKC 720
RD ETN F R SKSIVD+LAKHWTTTRGAEIV++AVVS QL++KQQKH+KFLQFLALS+C
Sbjct: 661 RDGETNVFVRTSKSIVDTLAKHWTTTRGAEIVAMAVVSTQLSDKQQKHKKFLQFLALSRC 720
Query: 721 HEELCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWD 780
HEELC KQR +LQII+E GEKL G+IQLRELQNMI+Q+R G SS+ +SGSLWD
Sbjct: 721 HEELCSKQRESLQIIMEHGEKLIGMIQLRELQNMISQNRLAGAGSPYSSSESGISGSLWD 780
Query: 781 LIQIVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRV 840
LIQ+VGERARRNTVLLMDRDNAEVFYSK+S++EEVFYCLDR L+++++ P VQ+QR
Sbjct: 781 LIQLVGERARRNTVLLMDRDNAEVFYSKVSDIEEVFYCLDRQLEYVISAELPLMVQIQRA 840
Query: 841 CELSKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSG 900
CELS ACVTL++ A HYKNE+H+WYP P+GL PWY PVVR+G WS+ASFM+++L+D +G
Sbjct: 841 CELSNACVTLIQAATHYKNENHIWYPSPEGLTPWYCQPVVRNGQWSVASFMLQLLNDRTG 900
Query: 901 IDLSSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQI 960
+D+S KSD+ S+LE LA+V+LEAY GAITAKVERGEEHKGL +EYW +RDTLL+SLYQ +
Sbjct: 901 LDMSLKSDLYSNLEALAEVLLEAYTGAITAKVERGEEHKGLLNEYWNRRDTLLNSLYQVV 960
Query: 961 KVFVNARFQDSVVE-KEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLM 1020
K FV + +QDS +EQKEVIL++LSS+LLS+A+RHE Y TLWNICCDLND+ LL+ +M
Sbjct: 961 KGFVESGYQDSNEGIEEQKEVILKKLSSSLLSIAKRHEGYLTLWNICCDLNDAVLLRNIM 1020
Query: 1021 HKSMGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYS 1080
H+SMG + GFSYFVF+Q+YE+ QF KLLRLGEEF ++L FL++H DL W HE+FLHQ+S
Sbjct: 1021 HESMGPKAGFSYFVFRQLYESRQFSKLLRLGEEFQEDLSIFLQEHQDLRWLHELFLHQFS 1080
Query: 1081 GASKTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEA 1140
AS+TL LALS D S S+A K DSGT L ER+RLL+LSKIA +AGKDA YE
Sbjct: 1081 SASETLQLLALSQDGSSISSAEKGINPDSGTSGKKLVERRRLLNLSKIAVLAGKDADYET 1140
Query: 1141 EIERIDADLKILKSQEDILNLYP-DDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFD 1200
+I+RI+ADLKILK QE+I+ L P D++ ++ + +RLLPP DLI+LCLK + PEL L AF+
Sbjct: 1141 KIKRIEADLKILKLQEEIIRLLPSDEVVEKGMEQRLLPPRDLIELCLKAEIPELPLLAFE 1200
Query: 1201 VFAWTSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAA 1260
V AWTSSSFRK+N++LLEECW+ A +QDDW KLY+ S+AEGWSDEDT+RVL++TMLF A+
Sbjct: 1201 VLAWTSSSFRKANRSLLEECWKCAANQDDWGKLYEASVAEGWSDEDTLRVLRETMLFQAS 1260
Query: 1261 RRCYGPESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALM 1320
RCYGP +E ++GGFDEVL LRQE+ E+PN+++SGSSVE ILMQHKDF DAGKLMLTA+M
Sbjct: 1261 NRCYGPGTETFEGGFDEVLVLRQENMEIPNLKESGSSVETILMQHKDFPDAGKLMLTAVM 1320
Query: 1321 RGSEQVDIGGADGPIPME 1326
GS ++D+ +GP PME
Sbjct: 1321 MGSVEIDVRSYEGPSPME 1330
BLAST of Spo05021.1 vs. NCBI nr
Match:
gi|703161382|ref|XP_010112777.1| (hypothetical protein L484_020008 [Morus notabilis])
HSP 1 Score: 1602.0 bits (4147), Expect = 0.000e+0
Identity = 826/1334 (61.92%), Postives = 1021/1334 (76.54%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P T K+S+ S+ ++P+ T +SP+TPLA NR S +D +P+RP TGTPAP
Sbjct: 1 MFSPGT--KRSHGSSRRDPSLGHAAT----ASPVTPLAENRRSSSDNLVPHRPATGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WA RLSVLARIP K+E D+ + +PVYVG+FPQ V DEQ +++ VPG+A I GGM
Sbjct: 61 WAPRLSVLARIPIVNKNEKGDDIDPIKPVYVGEFPQVVRDEQTKLLQKRVPGEAFIYGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWE 180
+K ++W+ICGS+LF+WSYLSPAAS +CV+L++PS+V E + GDTW LC +NW+
Sbjct: 121 EKGKCIAWIICGSRLFIWSYLSPAASMKCVVLEIPSNVLENGDIRRSDGDTWSLCAVNWD 180
Query: 181 NVH-QSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNG 240
++ V+ A +VLCNQ++RA++YW DIYS + P IS AS +EL +
Sbjct: 181 MTSSRTKKVVEHNNYAAIVLCNQKTRAVIYWRDIYSKVKTAPVISTASSDELEVIF---- 240
Query: 241 KSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVL 300
+ +R Q +R S FNSLI SA+P++ H CVA+A+SSNGE WQF+CSP+G+
Sbjct: 241 -TTLSRQQHSSRQRSGLTELYSFNSLIASAVPNSQHVCVAIASSSNGELWQFLCSPSGIK 300
Query: 301 RQNVCHILSTSISGCS----HPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFS 360
RQ V H ++S++ H + SKGYPRSL+WRF + S+RQFFLLTDH+I CF+
Sbjct: 301 RQKV-HWNTSSLTSQGGDNGHVTGSKGYPRSLIWRFSHSSVHESNRQFFLLTDHEIHCFN 360
Query: 361 IELSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKD 420
+EL + +SK+WSHEIIGTD DLGIKKDLAGQK++WPLD+QVD GK +TILVA FCKD
Sbjct: 361 VELFLDINVSKVWSHEIIGTDGDLGIKKDLAGQKRVWPLDVQVDIYGKVITILVATFCKD 420
Query: 421 RLSSSSYTQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRL 480
R+SSSSYTQYSLLTMQYKSG+ + + +LEKK+P+QVIIPKARVE+E+FL SMRL
Sbjct: 421 RVSSSSYTQYSLLTMQYKSGVSTEVG---HERILEKKAPIQVIIPKARVEDEDFLFSMRL 480
Query: 481 RAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAW 540
R GGKPSGS +ILS DGTATVS+++RN TRLYQFDLPYDAGKV+DASV PSTDD E GAW
Sbjct: 481 RVGGKPSGSTIILSNDGTATVSHYYRNFTRLYQFDLPYDAGKVLDASVLPSTDDGE-GAW 540
Query: 541 VVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEE-KNLLFSAGVSPGKAGV 600
VVLTEKAG+WAIPEKAV+ GGVEPPERSLSRKGSSNE ++ EE KNL F ++P +A
Sbjct: 541 VVLTEKAGIWAIPEKAVILGGVEPPERSLSRKGSSNEGSAQEERKNLTFGGNMAPRRASS 600
Query: 601 DARDIDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDA 660
+A++ Q+A + RN DEE+E LL +LFHDF LSGQV+G LEKL +RAFER
Sbjct: 601 EAQEPVDRQKAVKGVIARRNTLDEESETLLGQLFHDFQLSGQVEGSLEKLQKSRAFERGE 660
Query: 661 ETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEE 720
ETN FAR SKSIVD+LAKHWTTTRGAEI+++AVVS+QL +KQQKHEKFLQFLALSKCHEE
Sbjct: 661 ETNVFARLSKSIVDTLAKHWTTTRGAEILAMAVVSSQLLDKQQKHEKFLQFLALSKCHEE 720
Query: 721 LCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQ 780
LC +QRH+LQIILE GEKL G+IQLRELQN I+Q+RS G+ SS + SG+LWDLIQ
Sbjct: 721 LCSRQRHSLQIILEHGEKLAGMIQLRELQNAISQNRSAGIGSSHSSQEIQTSGALWDLIQ 780
Query: 781 IVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCEL 840
+VGERARR+TVLLMDRDNAEVFYSKIS+LEEVFYCLDR LD+I++ QPF VQ QR CEL
Sbjct: 781 LVGERARRSTVLLMDRDNAEVFYSKISDLEEVFYCLDRQLDYIISTEQPFGVQNQRACEL 840
Query: 841 SKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDL 900
S ACV +++ AMHYKNEHH+WYPPP+GL PWY VVR G+WSIASFM+++L ++S +D+
Sbjct: 841 SNACVAIVQTAMHYKNEHHLWYPPPEGLTPWYCKHVVRSGIWSIASFMLQLLKEASTLDV 900
Query: 901 SSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVF 960
S+KSD+ +HLE LA+++LEAY GAI AKVE GE+HKGL DEYW +RD LLDSLYQQ+K F
Sbjct: 901 SAKSDLYTHLEALAEILLEAYAGAIKAKVELGEDHKGLLDEYWCRRDLLLDSLYQQVKEF 960
Query: 961 VNARFQD-SVVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKS 1020
V QD S E K+ L++ SS LLS+A RHE Y TLW ICCDLNDS LL+ LM +S
Sbjct: 961 VEDGHQDISEETSEHKKDSLKKFSSQLLSIANRHECYNTLWKICCDLNDSELLRNLMRES 1020
Query: 1021 MGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGAS 1080
MG GGFSYFVFKQ+Y++ QF KLLRLGEEF +EL FLK H DLLW HE+FLHQ+S AS
Sbjct: 1021 MGPNGGFSYFVFKQLYKSRQFSKLLRLGEEFLEELSIFLKRHQDLLWLHELFLHQFSLAS 1080
Query: 1081 KTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIE 1140
+TLH LALS ERS S + + GT L +RKRLL+LSKIAAIAGK EA ++
Sbjct: 1081 ETLHLLALSQHERSMSET-EGTDPHYGTMVPKLQDRKRLLNLSKIAAIAGKGE--EANVK 1140
Query: 1141 RIDADLKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAW 1200
RI+ADLKILK QE+I+ DD KQ +G RLL P +LIKLCL+ ++PEL L AFDVFAW
Sbjct: 1141 RIEADLKILKLQEEIVKFLSDDGTKQSVGERLLNPEELIKLCLEMKSPELALCAFDVFAW 1200
Query: 1201 TSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCY 1260
TSSSFRK++K LLEECW+ A +QDDW KLYQ S EGW+DE+T++ L+ TMLF A+ RCY
Sbjct: 1201 TSSSFRKAHKNLLEECWKNAAEQDDWSKLYQASTIEGWTDEETLQNLKHTMLFKASSRCY 1260
Query: 1261 GPESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSE 1320
GP +E + GFD+VLPLRQE+SE P M+DSGSSV LMQHKD+ +AGKL+LTA+M GS
Sbjct: 1261 GPLAETFGEGFDQVLPLRQETSEPPIMKDSGSSVLANLMQHKDYPEAGKLLLTAIMLGSL 1315
Query: 1321 QVDIGGADGPIPME 1326
+ D G +G PME
Sbjct: 1321 EDDTGEEEGTTPME 1315
BLAST of Spo05021.1 vs. NCBI nr
Match:
gi|590712148|ref|XP_007049309.1| (Nucleoporin, Nup133/Nup155-like, putative isoform 1 [Theobroma cacao])
HSP 1 Score: 1592.0 bits (4121), Expect = 0.000e+0
Identity = 803/1333 (60.24%), Postives = 1017/1333 (76.29%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P K+S S+ K N Q ++ SP+TP NR S + SIP+RP TGTPAP
Sbjct: 1 MFSPGL--KRSKLSSRKERNLGQN--LATPDSPVTPYTVNRKSAHETSIPDRPNTGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WA RLSVLARIPPA K+E D + +PV+VG+FPQ VHDEQ + +R +P D CI+GGM
Sbjct: 61 WAPRLSVLARIPPANKNEKGDELDPIKPVFVGEFPQVVHDEQTSFLRKCLPADVCISGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISN--RNAYLGDTWLLCLIN 180
+K LSW+ICG+K+F+WSYLS AASK+C+ L+LPSDV E ++ RN+Y + WLL ++N
Sbjct: 121 EKGTCLSWIICGNKIFIWSYLSSAASKKCITLELPSDVLENADVGRNSYHCNNWLLTVVN 180
Query: 181 WENVHQSSSFV-KQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEE-LGISSM 240
W + + ++ V K SAG+VLCNQ++RA+VYW DI++D P S AS +E L SS
Sbjct: 181 WNSTSKGTNKVPKDCYSAGIVLCNQKTRAVVYWSDIFADVGNAPVTSFASSDESLVTSSP 240
Query: 241 GNGKSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPA 300
+G + +R Q +R S FNSLI SAIP H CVALA SS+GE WQF CSP+
Sbjct: 241 IDGNNTTSRQQQRSRHGMSFIGSSSFNSLIASAIPGTQHVCVALACSSSGELWQFYCSPS 300
Query: 301 GVLRQNVC-HILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFS 360
G+ V +I ++ +G SKGYPRS++WR + + +RQF LLTD +IQCF+
Sbjct: 301 GIQCDKVYQNIQNSQGTGIGQLVGSKGYPRSMIWRLRYFSVSDHNRQFLLLTDREIQCFN 360
Query: 361 IELSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKD 420
I+L P+ +SKLWS EI+G D DLGIKKDLAGQK+IWPLD+QVD+ GK +T+LVA FCKD
Sbjct: 361 IKLCPDIEVSKLWSQEIVGNDGDLGIKKDLAGQKRIWPLDLQVDDPGKVITVLVATFCKD 420
Query: 421 RLSSSSYTQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRL 480
R+SSSSYTQYSLLTMQ+KSG+ ++ + VLEKK+P+QVIIPKARVE+E+FL SMRL
Sbjct: 421 RVSSSSYTQYSLLTMQHKSGVRVSISSDVHERVLEKKAPIQVIIPKARVEDEDFLFSMRL 480
Query: 481 RAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAW 540
+ GGKPSGS +ILS DGTATVS+++RN+TRLYQFDLPYDAGKV+DASV PSTDD E+GAW
Sbjct: 481 QVGGKPSGSTIILSGDGTATVSHYYRNSTRLYQFDLPYDAGKVLDASVLPSTDDGEDGAW 540
Query: 541 VVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEK-NLLFSAGVSPGKAGV 600
VVLTEKAG+WAIPEKAV+ GGVEPPERSLSRKGSSNE ++ EE+ NL+F+ V+P +A
Sbjct: 541 VVLTEKAGIWAIPEKAVVLGGVEPPERSLSRKGSSNEGSAQEERRNLMFAGNVAPRRASS 600
Query: 601 DARDIDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDA 660
DA D Q TG++ R AQDEE+EALL + FH+FL+SG+VDG LEKL N+ AFERD
Sbjct: 601 DAWDAGDRQPPVMTGIIRRTAQDEESEALLGQFFHEFLISGKVDGSLEKLKNSGAFERDG 660
Query: 661 ETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEE 720
ET+ F R SKSIVD+LAKHWTTTRGAEIVSL ++S QL +KQQKH+KFLQFLALSKCHEE
Sbjct: 661 ETSIFVRTSKSIVDTLAKHWTTTRGAEIVSLGIISAQLMDKQQKHQKFLQFLALSKCHEE 720
Query: 721 LCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQ 780
LC QRH+LQIILE GEKL+ IIQLRELQN+I+Q+RS GV S+ +SG+LWDLIQ
Sbjct: 721 LCSGQRHSLQIILEHGEKLSAIIQLRELQNVISQNRSTGVGSTHLSSETLISGALWDLIQ 780
Query: 781 IVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCEL 840
+VGERARRNTVLLMDRDNAEVFYSK+S+ ++VFYCL+RHL++I+++ QP ++Q+QR CEL
Sbjct: 781 LVGERARRNTVLLMDRDNAEVFYSKVSDFDQVFYCLERHLEYIISLEQPVEIQIQRSCEL 840
Query: 841 SKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDL 900
S ACVT+ R AM YKNE+H+WYPPP+GL PWY VVR+GLWSIASFM+++L ++S +D+
Sbjct: 841 SNACVTIFRAAMDYKNEYHLWYPPPEGLTPWYCQLVVRNGLWSIASFMLQLLKETSELDV 900
Query: 901 SSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVF 960
S+KS++ SHLE L +V+LE GAITAK+ERGEEHKGL +EYW +RD LLDSLYQQ+K
Sbjct: 901 SAKSELYSHLEALTEVLLEVSSGAITAKIERGEEHKGLLNEYWSRRDALLDSLYQQVKGL 960
Query: 961 VNARFQDSVVE-KEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKS 1020
V A QD +E + ILR+LSS+LLS +++HEAY T+WNICCDLNDS LL+ LMH+S
Sbjct: 961 VEAGNQDITESIEENNQEILRKLSSSLLSTSKQHEAYQTMWNICCDLNDSGLLRNLMHES 1020
Query: 1021 MGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGAS 1080
+G RGGFSYFVFKQ+YE QF KLLRLGEEF ++L FL H DLLW HEVFLHQ+S AS
Sbjct: 1021 VGPRGGFSYFVFKQLYEKKQFSKLLRLGEEFQEDLSNFLNHHRDLLWLHEVFLHQFSAAS 1080
Query: 1081 KTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIE 1140
+TLH LALS +E S S E + D TL +R+R+L+LS IAA AGKD + +++
Sbjct: 1081 ETLHILALSQEEDSISTTEDETDADHANPVPTLADRRRILNLSMIAAFAGKDPDSQPKVK 1140
Query: 1141 RIDADLKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAW 1200
RI+ADLKIL+ QE+I+ + P D Q + + LL P +LI+LCL+ ++ EL L+ FDVFAW
Sbjct: 1141 RIEADLKILRLQEEIMEVLPTDDTMQHVEKHLLRPEELIELCLQSRSRELALQVFDVFAW 1200
Query: 1201 TSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCY 1260
TSSSFRKS++ LLEECW+ A DQD W +LY+ S+ EGWSDE+T++ L +T+LF A+ RCY
Sbjct: 1201 TSSSFRKSHRNLLEECWKNAADQDPWSQLYEASVTEGWSDEETLQQLSQTILFQASNRCY 1260
Query: 1261 GPESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSE 1320
GP++E + GFDEVLPLRQE+ E ++ D SSVE ILMQH+DF AGKLMLTA+M G
Sbjct: 1261 GPKAETIEEGFDEVLPLRQENLEAASLNDKRSSVEAILMQHRDFPYAGKLMLTAIMLGCV 1320
Query: 1321 QVDIGGADGPIPM 1325
Q +G P+
Sbjct: 1321 QDHAKKEEGLSPV 1329
BLAST of Spo05021.1 vs. UniProtKB/TrEMBL
Match:
A0A0K9RCR8_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_085660 PE=4 SV=1)
HSP 1 Score: 2651.7 bits (6872), Expect = 0.000e+0
Identity = 1325/1325 (100.00%), Postives = 1325/1325 (100.00%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP
Sbjct: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
Query: 61 WASRLSVLARIPPAKKSEDNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGMDK 120
WASRLSVLARIPPAKKSEDNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGMDK
Sbjct: 61 WASRLSVLARIPPAKKSEDNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGMDK 120
Query: 121 EYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWENV 180
EYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWENV
Sbjct: 121 EYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWENV 180
Query: 181 HQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNGKSP 240
HQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNGKSP
Sbjct: 181 HQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNGKSP 240
Query: 241 PTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVLRQN 300
PTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVLRQN
Sbjct: 241 PTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVLRQN 300
Query: 301 VCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELSPNP 360
VCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELSPNP
Sbjct: 301 VCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELSPNP 360
Query: 361 TLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSSSSY 420
TLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSSSSY
Sbjct: 361 TLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSSSSY 420
Query: 421 TQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGGKPS 480
TQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGGKPS
Sbjct: 421 TQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGGKPS 480
Query: 481 GSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLTEKA 540
GSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLTEKA
Sbjct: 481 GSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLTEKA 540
Query: 541 GVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLLFSAGVSPGKAGVDARDIDAM 600
GVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLLFSAGVSPGKAGVDARDIDAM
Sbjct: 541 GVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLLFSAGVSPGKAGVDARDIDAM 600
Query: 601 QRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNAFARA 660
QRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNAFARA
Sbjct: 601 QRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNAFARA 660
Query: 661 SKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMKQRHA 720
SKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMKQRHA
Sbjct: 661 SKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMKQRHA 720
Query: 721 LQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGERARR 780
LQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGERARR
Sbjct: 721 LQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGERARR 780
Query: 781 NTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKACVTLL 840
NTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKACVTLL
Sbjct: 781 NTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKACVTLL 840
Query: 841 RCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKSDICS 900
RCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKSDICS
Sbjct: 841 RCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKSDICS 900
Query: 901 HLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNARFQDS 960
HLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNARFQDS
Sbjct: 901 HLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNARFQDS 960
Query: 961 VVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARGGFSY 1020
VVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARGGFSY
Sbjct: 961 VVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARGGFSY 1020
Query: 1021 FVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHGLALS 1080
FVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHGLALS
Sbjct: 1021 FVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHGLALS 1080
Query: 1081 GDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDADLKIL 1140
GDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDADLKIL
Sbjct: 1081 GDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDADLKIL 1140
Query: 1141 KSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSFRKSN 1200
KSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSFRKSN
Sbjct: 1141 KSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSFRKSN 1200
Query: 1201 KTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESEMYDG 1260
KTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESEMYDG
Sbjct: 1201 KTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESEMYDG 1260
Query: 1261 GFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIGGADG 1320
GFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIGGADG
Sbjct: 1261 GFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIGGADG 1320
Query: 1321 PIPME 1326
PIPME
Sbjct: 1321 PIPME 1325
BLAST of Spo05021.1 vs. UniProtKB/TrEMBL
Match:
A0A0J8E006_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_9g224940 PE=4 SV=1)
HSP 1 Score: 2234.5 bits (5789), Expect = 0.000e+0
Identity = 1122/1329 (84.42%), Postives = 1207/1329 (90.82%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+PS+ +KKS YS+ K+ N QQKT FDSSPITP+A NRNS+ DGSIPNRPQ+GTPAP
Sbjct: 1 MFSPSS-RKKSTYSSLKDRNVNQQKTPLFDSSPITPVAGNRNSINDGSIPNRPQSGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WASRLSV+ARIP AKK+E +N + +PVYVGDFPQAVHDEQ + +RNSVPG+ACI+GGM
Sbjct: 61 WASRLSVIARIPSAKKNEKMENVDLPQPVYVGDFPQAVHDEQTSFLRNSVPGEACISGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWE 180
DKE GLSWMICGSKLF+WSYLSPAASKRCV+LDLP+DVS+I +RNAYLGDTWLLCLINW+
Sbjct: 121 DKETGLSWMICGSKLFLWSYLSPAASKRCVVLDLPADVSDICSRNAYLGDTWLLCLINWD 180
Query: 181 NVHQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGI-SSMGNG 240
NVHQSSS V QLTSAGV+LCNQRSRAIVYW DIYSDD TPRISLAS EE I S MG+G
Sbjct: 181 NVHQSSSIVNQLTSAGVILCNQRSRAIVYWPDIYSDDGATPRISLASSEEPEIISPMGHG 240
Query: 241 KSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVL 300
+SPP +LQS+NRLE LS N S FNSLI S IP N+HACVALATSSNGE WQFIC+PAGVL
Sbjct: 241 RSPPRQLQSENRLERLSTNPSSFNSLIASTIPFNHHACVALATSSNGELWQFICTPAGVL 300
Query: 301 RQNVCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIELS 360
RQ VC ILSTSISG SHPS SKGYPRSL+WRFPSL S+RQFF+LTDH+IQCFSI+LS
Sbjct: 301 RQKVCDILSTSISGYSHPSGSKGYPRSLIWRFPSLLTGKSNRQFFVLTDHEIQCFSIQLS 360
Query: 361 PNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRLSS 420
PNPTL K+W HEIIGTDN+LGIKKDLAGQK+IWPLDIQVDERGKELTILVAIFCKDRLSS
Sbjct: 361 PNPTLLKVWCHEIIGTDNELGIKKDLAGQKRIWPLDIQVDERGKELTILVAIFCKDRLSS 420
Query: 421 SSYTQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRLRAGG 480
SSYT+YSLLTMQYKSGLDET K FT G VLEKKSP+QVIIPKARVEEEEFL SM+LR GG
Sbjct: 421 SSYTEYSLLTMQYKSGLDETSKDFTQGRVLEKKSPIQVIIPKARVEEEEFLFSMKLRVGG 480
Query: 481 KPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLT 540
KPSGSA+ILS DGTATVSYFWRN+TRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLT
Sbjct: 481 KPSGSAIILSGDGTATVSYFWRNSTRLYQFDLPYDAGKVIDASVFPSTDDTEEGAWVVLT 540
Query: 541 EKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEE-KNLLFSAGVSPGKAGVDARD 600
EKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEK+ EE KNLL +AGVSPGKAG+DARD
Sbjct: 541 EKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKSPAEENKNLLNAAGVSPGKAGLDARD 600
Query: 601 IDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDAETNA 660
+ QR+GFTGV G+NAQDEEAEALLNRLFHDFLLSGQVDG LEKL N RAFERD ETN
Sbjct: 601 VGGKQRSGFTGVAGKNAQDEEAEALLNRLFHDFLLSGQVDGSLEKLKNVRAFERDGETNV 660
Query: 661 FARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEELCMK 720
FARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQL EKQQKHEKFLQFLALSKCHEELC++
Sbjct: 661 FARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLIEKQQKHEKFLQFLALSKCHEELCLE 720
Query: 721 QRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQIVGE 780
QRHALQIILEDGEKL G+IQLRELQNMI+QS S S Q SSAN VSGSLWDLIQIVGE
Sbjct: 721 QRHALQIILEDGEKLIGMIQLRELQNMISQSHS---SSQHSSANAGVSGSLWDLIQIVGE 780
Query: 781 RARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCELSKAC 840
RARRN VLLMDRDNAEVFYSKISNLEEVFYCLD+HLD+I+NVGQPFK QVQRVCELSKAC
Sbjct: 781 RARRNAVLLMDRDNAEVFYSKISNLEEVFYCLDKHLDYILNVGQPFKAQVQRVCELSKAC 840
Query: 841 VTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDLSSKS 900
VTL+R AMHY+NEHH+WYPPPDGLVPWYS PVVRDG+WS+ASFMIKILDDSSGIDLS KS
Sbjct: 841 VTLIRSAMHYRNEHHVWYPPPDGLVPWYSQPVVRDGIWSVASFMIKILDDSSGIDLSIKS 900
Query: 901 DICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVFVNAR 960
DICSHLEVLADVVLEAYVGAITAK+ERGEE+KGLSDEY RKRD LLDSLYQQIKVFVNAR
Sbjct: 901 DICSHLEVLADVVLEAYVGAITAKIERGEEYKGLSDEYSRKRDALLDSLYQQIKVFVNAR 960
Query: 961 FQDSVVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKSMGARG 1020
FQDS EKEQK++ILRELSSNLLSVARRHEAY TLWNICCDLNDS LLK LM++SMG RG
Sbjct: 961 FQDSGDEKEQKDMILRELSSNLLSVARRHEAYRTLWNICCDLNDSALLKDLMNESMGPRG 1020
Query: 1021 GFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGASKTLHG 1080
GFSYFVF+Q+ ENGQFPKLLRLGEEFPQEL FLKDH DL+W HEVFLHQYSGASKTLHG
Sbjct: 1021 GFSYFVFEQLNENGQFPKLLRLGEEFPQELVIFLKDHSDLMWLHEVFLHQYSGASKTLHG 1080
Query: 1081 LALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIERIDAD 1140
LALSGDE FSAA KE ELD TLP+RKRLLHLSKIAAIAGKDA YEA+IERIDAD
Sbjct: 1081 LALSGDE-MFSAAEKETELDLRKTVYTLPQRKRLLHLSKIAAIAGKDADYEADIERIDAD 1140
Query: 1141 LKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAWTSSSF 1200
LKILKSQEDILNL PDD+EKQKIG RLLPPWDL++LCLKGQTPELLLRAFDVFAWTSSSF
Sbjct: 1141 LKILKSQEDILNLCPDDMEKQKIGHRLLPPWDLVRLCLKGQTPELLLRAFDVFAWTSSSF 1200
Query: 1201 RKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCYGPESE 1260
R SNK+LLEECW+ AVD DDW+KLY FSIAEGWSDEDTIRVLQKTMLF AARRCYG ESE
Sbjct: 1201 RNSNKSLLEECWKSAVDHDDWEKLYHFSIAEGWSDEDTIRVLQKTMLFQAARRCYGQESE 1260
Query: 1261 MYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSEQVDIG 1320
MYDGGFDEVLPLRQE++ELP+M+DSGSSVE ILMQHKD+SDAGKLMLT LM GS QVD+G
Sbjct: 1261 MYDGGFDEVLPLRQENTELPSMKDSGSSVEAILMQHKDYSDAGKLMLTTLMWGSAQVDVG 1320
Query: 1321 GADGPIPME 1326
G DGP+ ME
Sbjct: 1321 GKDGPVSME 1324
BLAST of Spo05021.1 vs. UniProtKB/TrEMBL
Match:
F6HHM2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0057g00610 PE=4 SV=1)
HSP 1 Score: 1606.7 bits (4159), Expect = 0.000e+0
Identity = 831/1337 (62.15%), Postives = 1029/1337 (76.96%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P+T K+ N+S+ K+ N Q + +SPITPL NR SL + SIPNRP TGTPAP
Sbjct: 1 MFSPAT--KRPNFSSRKDRNLGQ----AVPNSPITPLTENRRSLNENSIPNRPSTGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
W SRLSV ARIP KKSE D + +PVYVG+FPQ V DEQA+ ++ VPGDA I GGM
Sbjct: 61 WTSRLSVYARIPQLKKSEKGDEIDPVQPVYVGEFPQVVRDEQASFLQKRVPGDASIFGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWE 180
DK LSW+ICG+KLF+WSYL+ ASK+CV+L+LPSD + NRN Y ++WLLC+++W
Sbjct: 121 DKGTALSWIICGNKLFIWSYLTSVASKKCVVLELPSDENGDVNRNNYHANSWLLCVVDWH 180
Query: 181 NVHQSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISL---ASPEELGISSMG 240
+S +Q SAGVVLCNQ++R +VYW P I +P SS G
Sbjct: 181 GTFRSVG-KQQGNSAGVVLCNQKTRTVVYW----------PDIYAQGDVAPVVSFASSDG 240
Query: 241 NGK--SPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSP 300
+ SP + N+L + S S V + S++++ +A A
Sbjct: 241 SELNFSPGNGKITPNKL----WQHSRLGSNSVGS--SSFNSLIASAVPDT---------- 300
Query: 301 AGVLRQNVCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFS 360
Q+ C L++S S GYP+SL W S +E S+RQFFLLTD++IQCF
Sbjct: 301 -----QHKCIALASS---------SNGYPKSLTWHHSSFSLEKSNRQFFLLTDNEIQCFR 360
Query: 361 IELSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKD 420
+ SP+ ++KLWSHEIIGTD DLGIKKDLAGQK+IWPLD+QVD GK +TILVA FCKD
Sbjct: 361 VNFSPDLNVTKLWSHEIIGTDGDLGIKKDLAGQKRIWPLDVQVDAHGKVITILVATFCKD 420
Query: 421 RLSSSSYTQYSLLTMQYKSGLD--ETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSM 480
R+SSSSYTQYSLLTMQYKSG++ E+++ + +VLEKKSPVQVIIPKARVE+E+FL SM
Sbjct: 421 RVSSSSYTQYSLLTMQYKSGINISESVEPI-HETVLEKKSPVQVIIPKARVEKEDFLFSM 480
Query: 481 RLRAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEG 540
+LR GGKPSGSAVILS DGTATVS+++ N+TRLYQFDLPYDAGKV+DASVFPSTDD E+G
Sbjct: 481 KLRVGGKPSGSAVILSEDGTATVSHYYGNSTRLYQFDLPYDAGKVLDASVFPSTDDGEDG 540
Query: 541 AWVVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEK-NLLFSAGVSPGKA 600
AWVVLTEKAGVWAIPEKAVL GGVEPPERSLSRKGSSNE ++ EE+ NL F+ ++P +A
Sbjct: 541 AWVVLTEKAGVWAIPEKAVLLGGVEPPERSLSRKGSSNEGSAQEERRNLAFATNIAPRRA 600
Query: 601 GVDARDIDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFER 660
+A D QRA TGV R A+DEE+EALL+ LFHDFLLSGQVD LEKL N AFER
Sbjct: 601 SSEAWDAGDRQRAALTGVARRTARDEESEALLSHLFHDFLLSGQVDDSLEKLRNCGAFER 660
Query: 661 DAETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCH 720
D ETN F R SKSIVD+LAKHWTTTRGAEIV++AVVS QL++KQQKH+KFLQFLALS+CH
Sbjct: 661 DGETNVFVRTSKSIVDTLAKHWTTTRGAEIVAMAVVSTQLSDKQQKHKKFLQFLALSRCH 720
Query: 721 EELCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDL 780
EELC KQR +LQII+E GEKL G+IQLRELQNMI+Q+R G SS+ +SGSLWDL
Sbjct: 721 EELCSKQRESLQIIMEHGEKLIGMIQLRELQNMISQNRLAGAGSPYSSSESGISGSLWDL 780
Query: 781 IQIVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVC 840
IQ+VGERARRNTVLLMDRDNAEVFYSK+S++EEVFYCLDR L+++++ P VQ+QR C
Sbjct: 781 IQLVGERARRNTVLLMDRDNAEVFYSKVSDIEEVFYCLDRQLEYVISAELPLMVQIQRAC 840
Query: 841 ELSKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGI 900
ELS ACVTL++ A HYKNE+H+WYP P+GL PWY PVVR+G WS+ASFM+++L+D +G+
Sbjct: 841 ELSNACVTLIQAATHYKNENHIWYPSPEGLTPWYCQPVVRNGQWSVASFMLQLLNDRTGL 900
Query: 901 DLSSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIK 960
D+S KSD+ S+LE LA+V+LEAY GAITAKVERGEEHKGL +EYW +RDTLL+SLYQ +K
Sbjct: 901 DMSLKSDLYSNLEALAEVLLEAYTGAITAKVERGEEHKGLLNEYWNRRDTLLNSLYQVVK 960
Query: 961 VFVNARFQDSVVE-KEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMH 1020
FV + +QDS +EQKEVIL++LSS+LLS+A+RHE Y TLWNICCDLND+ LL+ +MH
Sbjct: 961 GFVESGYQDSNEGIEEQKEVILKKLSSSLLSIAKRHEGYLTLWNICCDLNDAVLLRNIMH 1020
Query: 1021 KSMGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSG 1080
+SMG + GFSYFVF+Q+YE+ QF KLLRLGEEF ++L FL++H DL W HE+FLHQ+S
Sbjct: 1021 ESMGPKAGFSYFVFRQLYESRQFSKLLRLGEEFQEDLSIFLQEHQDLRWLHELFLHQFSS 1080
Query: 1081 ASKTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAE 1140
AS+TL LALS D S S+A K DSGT L ER+RLL+LSKIA +AGKDA YE +
Sbjct: 1081 ASETLQLLALSQDGSSISSAEKGINPDSGTSGKKLVERRRLLNLSKIAVLAGKDADYETK 1140
Query: 1141 IERIDADLKILKSQEDILNLYP-DDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDV 1200
I+RI+ADLKILK QE+I+ L P D++ ++ + +RLLPP DLI+LCLK + PEL L AF+V
Sbjct: 1141 IKRIEADLKILKLQEEIIRLLPSDEVVEKGMEQRLLPPRDLIELCLKAEIPELPLLAFEV 1200
Query: 1201 FAWTSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAAR 1260
AWTSSSFRK+N++LLEECW+ A +QDDW KLY+ S+AEGWSDEDT+RVL++TMLF A+
Sbjct: 1201 LAWTSSSFRKANRSLLEECWKCAANQDDWGKLYEASVAEGWSDEDTLRVLRETMLFQASN 1260
Query: 1261 RCYGPESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMR 1320
RCYGP +E ++GGFDEVL LRQE+ E+PN+++SGSSVE ILMQHKDF DAGKLMLTA+M
Sbjct: 1261 RCYGPGTETFEGGFDEVLVLRQENMEIPNLKESGSSVETILMQHKDFPDAGKLMLTAVMM 1289
Query: 1321 GSEQVDIGGADGPIPME 1326
GS ++D+ +GP PME
Sbjct: 1321 GSVEIDVRSYEGPSPME 1289
BLAST of Spo05021.1 vs. UniProtKB/TrEMBL
Match:
W9SLZ9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_020008 PE=4 SV=1)
HSP 1 Score: 1602.0 bits (4147), Expect = 0.000e+0
Identity = 826/1334 (61.92%), Postives = 1021/1334 (76.54%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P T K+S+ S+ ++P+ T +SP+TPLA NR S +D +P+RP TGTPAP
Sbjct: 1 MFSPGT--KRSHGSSRRDPSLGHAAT----ASPVTPLAENRRSSSDNLVPHRPATGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WA RLSVLARIP K+E D+ + +PVYVG+FPQ V DEQ +++ VPG+A I GGM
Sbjct: 61 WAPRLSVLARIPIVNKNEKGDDIDPIKPVYVGEFPQVVRDEQTKLLQKRVPGEAFIYGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISNRNAYLGDTWLLCLINWE 180
+K ++W+ICGS+LF+WSYLSPAAS +CV+L++PS+V E + GDTW LC +NW+
Sbjct: 121 EKGKCIAWIICGSRLFIWSYLSPAASMKCVVLEIPSNVLENGDIRRSDGDTWSLCAVNWD 180
Query: 181 NVH-QSSSFVKQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMGNG 240
++ V+ A +VLCNQ++RA++YW DIYS + P IS AS +EL +
Sbjct: 181 MTSSRTKKVVEHNNYAAIVLCNQKTRAVIYWRDIYSKVKTAPVISTASSDELEVIF---- 240
Query: 241 KSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAGVL 300
+ +R Q +R S FNSLI SA+P++ H CVA+A+SSNGE WQF+CSP+G+
Sbjct: 241 -TTLSRQQHSSRQRSGLTELYSFNSLIASAVPNSQHVCVAIASSSNGELWQFLCSPSGIK 300
Query: 301 RQNVCHILSTSISGCS----HPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFS 360
RQ V H ++S++ H + SKGYPRSL+WRF + S+RQFFLLTDH+I CF+
Sbjct: 301 RQKV-HWNTSSLTSQGGDNGHVTGSKGYPRSLIWRFSHSSVHESNRQFFLLTDHEIHCFN 360
Query: 361 IELSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKD 420
+EL + +SK+WSHEIIGTD DLGIKKDLAGQK++WPLD+QVD GK +TILVA FCKD
Sbjct: 361 VELFLDINVSKVWSHEIIGTDGDLGIKKDLAGQKRVWPLDVQVDIYGKVITILVATFCKD 420
Query: 421 RLSSSSYTQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRL 480
R+SSSSYTQYSLLTMQYKSG+ + + +LEKK+P+QVIIPKARVE+E+FL SMRL
Sbjct: 421 RVSSSSYTQYSLLTMQYKSGVSTEVG---HERILEKKAPIQVIIPKARVEDEDFLFSMRL 480
Query: 481 RAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAW 540
R GGKPSGS +ILS DGTATVS+++RN TRLYQFDLPYDAGKV+DASV PSTDD E GAW
Sbjct: 481 RVGGKPSGSTIILSNDGTATVSHYYRNFTRLYQFDLPYDAGKVLDASVLPSTDDGE-GAW 540
Query: 541 VVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEE-KNLLFSAGVSPGKAGV 600
VVLTEKAG+WAIPEKAV+ GGVEPPERSLSRKGSSNE ++ EE KNL F ++P +A
Sbjct: 541 VVLTEKAGIWAIPEKAVILGGVEPPERSLSRKGSSNEGSAQEERKNLTFGGNMAPRRASS 600
Query: 601 DARDIDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDA 660
+A++ Q+A + RN DEE+E LL +LFHDF LSGQV+G LEKL +RAFER
Sbjct: 601 EAQEPVDRQKAVKGVIARRNTLDEESETLLGQLFHDFQLSGQVEGSLEKLQKSRAFERGE 660
Query: 661 ETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEE 720
ETN FAR SKSIVD+LAKHWTTTRGAEI+++AVVS+QL +KQQKHEKFLQFLALSKCHEE
Sbjct: 661 ETNVFARLSKSIVDTLAKHWTTTRGAEILAMAVVSSQLLDKQQKHEKFLQFLALSKCHEE 720
Query: 721 LCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQ 780
LC +QRH+LQIILE GEKL G+IQLRELQN I+Q+RS G+ SS + SG+LWDLIQ
Sbjct: 721 LCSRQRHSLQIILEHGEKLAGMIQLRELQNAISQNRSAGIGSSHSSQEIQTSGALWDLIQ 780
Query: 781 IVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCEL 840
+VGERARR+TVLLMDRDNAEVFYSKIS+LEEVFYCLDR LD+I++ QPF VQ QR CEL
Sbjct: 781 LVGERARRSTVLLMDRDNAEVFYSKISDLEEVFYCLDRQLDYIISTEQPFGVQNQRACEL 840
Query: 841 SKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDL 900
S ACV +++ AMHYKNEHH+WYPPP+GL PWY VVR G+WSIASFM+++L ++S +D+
Sbjct: 841 SNACVAIVQTAMHYKNEHHLWYPPPEGLTPWYCKHVVRSGIWSIASFMLQLLKEASTLDV 900
Query: 901 SSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVF 960
S+KSD+ +HLE LA+++LEAY GAI AKVE GE+HKGL DEYW +RD LLDSLYQQ+K F
Sbjct: 901 SAKSDLYTHLEALAEILLEAYAGAIKAKVELGEDHKGLLDEYWCRRDLLLDSLYQQVKEF 960
Query: 961 VNARFQD-SVVEKEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKS 1020
V QD S E K+ L++ SS LLS+A RHE Y TLW ICCDLNDS LL+ LM +S
Sbjct: 961 VEDGHQDISEETSEHKKDSLKKFSSQLLSIANRHECYNTLWKICCDLNDSELLRNLMRES 1020
Query: 1021 MGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGAS 1080
MG GGFSYFVFKQ+Y++ QF KLLRLGEEF +EL FLK H DLLW HE+FLHQ+S AS
Sbjct: 1021 MGPNGGFSYFVFKQLYKSRQFSKLLRLGEEFLEELSIFLKRHQDLLWLHELFLHQFSLAS 1080
Query: 1081 KTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIE 1140
+TLH LALS ERS S + + GT L +RKRLL+LSKIAAIAGK EA ++
Sbjct: 1081 ETLHLLALSQHERSMSET-EGTDPHYGTMVPKLQDRKRLLNLSKIAAIAGKGE--EANVK 1140
Query: 1141 RIDADLKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAW 1200
RI+ADLKILK QE+I+ DD KQ +G RLL P +LIKLCL+ ++PEL L AFDVFAW
Sbjct: 1141 RIEADLKILKLQEEIVKFLSDDGTKQSVGERLLNPEELIKLCLEMKSPELALCAFDVFAW 1200
Query: 1201 TSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCY 1260
TSSSFRK++K LLEECW+ A +QDDW KLYQ S EGW+DE+T++ L+ TMLF A+ RCY
Sbjct: 1201 TSSSFRKAHKNLLEECWKNAAEQDDWSKLYQASTIEGWTDEETLQNLKHTMLFKASSRCY 1260
Query: 1261 GPESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSE 1320
GP +E + GFD+VLPLRQE+SE P M+DSGSSV LMQHKD+ +AGKL+LTA+M GS
Sbjct: 1261 GPLAETFGEGFDQVLPLRQETSEPPIMKDSGSSVLANLMQHKDYPEAGKLLLTAIMLGSL 1315
Query: 1321 QVDIGGADGPIPME 1326
+ D G +G PME
Sbjct: 1321 EDDTGEEEGTTPME 1315
BLAST of Spo05021.1 vs. UniProtKB/TrEMBL
Match:
A0A061DN01_THECC (Nucleoporin, Nup133/Nup155-like, putative isoform 1 OS=Theobroma cacao GN=TCM_002334 PE=4 SV=1)
HSP 1 Score: 1592.0 bits (4121), Expect = 0.000e+0
Identity = 803/1333 (60.24%), Postives = 1017/1333 (76.29%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P K+S S+ K N Q ++ SP+TP NR S + SIP+RP TGTPAP
Sbjct: 1 MFSPGL--KRSKLSSRKERNLGQN--LATPDSPVTPYTVNRKSAHETSIPDRPNTGTPAP 60
Query: 61 WASRLSVLARIPPAKKSE--DNRESAEPVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WA RLSVLARIPPA K+E D + +PV+VG+FPQ VHDEQ + +R +P D CI+GGM
Sbjct: 61 WAPRLSVLARIPPANKNEKGDELDPIKPVFVGEFPQVVHDEQTSFLRKCLPADVCISGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDVSEISN--RNAYLGDTWLLCLIN 180
+K LSW+ICG+K+F+WSYLS AASK+C+ L+LPSDV E ++ RN+Y + WLL ++N
Sbjct: 121 EKGTCLSWIICGNKIFIWSYLSSAASKKCITLELPSDVLENADVGRNSYHCNNWLLTVVN 180
Query: 181 WENVHQSSSFV-KQLTSAGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEE-LGISSM 240
W + + ++ V K SAG+VLCNQ++RA+VYW DI++D P S AS +E L SS
Sbjct: 181 WNSTSKGTNKVPKDCYSAGIVLCNQKTRAVVYWSDIFADVGNAPVTSFASSDESLVTSSP 240
Query: 241 GNGKSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPA 300
+G + +R Q +R S FNSLI SAIP H CVALA SS+GE WQF CSP+
Sbjct: 241 IDGNNTTSRQQQRSRHGMSFIGSSSFNSLIASAIPGTQHVCVALACSSSGELWQFYCSPS 300
Query: 301 GVLRQNVC-HILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFS 360
G+ V +I ++ +G SKGYPRS++WR + + +RQF LLTD +IQCF+
Sbjct: 301 GIQCDKVYQNIQNSQGTGIGQLVGSKGYPRSMIWRLRYFSVSDHNRQFLLLTDREIQCFN 360
Query: 361 IELSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKD 420
I+L P+ +SKLWS EI+G D DLGIKKDLAGQK+IWPLD+QVD+ GK +T+LVA FCKD
Sbjct: 361 IKLCPDIEVSKLWSQEIVGNDGDLGIKKDLAGQKRIWPLDLQVDDPGKVITVLVATFCKD 420
Query: 421 RLSSSSYTQYSLLTMQYKSGLDETLKGFTYGSVLEKKSPVQVIIPKARVEEEEFLLSMRL 480
R+SSSSYTQYSLLTMQ+KSG+ ++ + VLEKK+P+QVIIPKARVE+E+FL SMRL
Sbjct: 421 RVSSSSYTQYSLLTMQHKSGVRVSISSDVHERVLEKKAPIQVIIPKARVEDEDFLFSMRL 480
Query: 481 RAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGAW 540
+ GGKPSGS +ILS DGTATVS+++RN+TRLYQFDLPYDAGKV+DASV PSTDD E+GAW
Sbjct: 481 QVGGKPSGSTIILSGDGTATVSHYYRNSTRLYQFDLPYDAGKVLDASVLPSTDDGEDGAW 540
Query: 541 VVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEK-NLLFSAGVSPGKAGV 600
VVLTEKAG+WAIPEKAV+ GGVEPPERSLSRKGSSNE ++ EE+ NL+F+ V+P +A
Sbjct: 541 VVLTEKAGIWAIPEKAVVLGGVEPPERSLSRKGSSNEGSAQEERRNLMFAGNVAPRRASS 600
Query: 601 DARDIDAMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFERDA 660
DA D Q TG++ R AQDEE+EALL + FH+FL+SG+VDG LEKL N+ AFERD
Sbjct: 601 DAWDAGDRQPPVMTGIIRRTAQDEESEALLGQFFHEFLISGKVDGSLEKLKNSGAFERDG 660
Query: 661 ETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKCHEE 720
ET+ F R SKSIVD+LAKHWTTTRGAEIVSL ++S QL +KQQKH+KFLQFLALSKCHEE
Sbjct: 661 ETSIFVRTSKSIVDTLAKHWTTTRGAEIVSLGIISAQLMDKQQKHQKFLQFLALSKCHEE 720
Query: 721 LCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWDLIQ 780
LC QRH+LQIILE GEKL+ IIQLRELQN+I+Q+RS GV S+ +SG+LWDLIQ
Sbjct: 721 LCSGQRHSLQIILEHGEKLSAIIQLRELQNVISQNRSTGVGSTHLSSETLISGALWDLIQ 780
Query: 781 IVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRVCEL 840
+VGERARRNTVLLMDRDNAEVFYSK+S+ ++VFYCL+RHL++I+++ QP ++Q+QR CEL
Sbjct: 781 LVGERARRNTVLLMDRDNAEVFYSKVSDFDQVFYCLERHLEYIISLEQPVEIQIQRSCEL 840
Query: 841 SKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSGIDL 900
S ACVT+ R AM YKNE+H+WYPPP+GL PWY VVR+GLWSIASFM+++L ++S +D+
Sbjct: 841 SNACVTIFRAAMDYKNEYHLWYPPPEGLTPWYCQLVVRNGLWSIASFMLQLLKETSELDV 900
Query: 901 SSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQIKVF 960
S+KS++ SHLE L +V+LE GAITAK+ERGEEHKGL +EYW +RD LLDSLYQQ+K
Sbjct: 901 SAKSELYSHLEALTEVLLEVSSGAITAKIERGEEHKGLLNEYWSRRDALLDSLYQQVKGL 960
Query: 961 VNARFQDSVVE-KEQKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLMHKS 1020
V A QD +E + ILR+LSS+LLS +++HEAY T+WNICCDLNDS LL+ LMH+S
Sbjct: 961 VEAGNQDITESIEENNQEILRKLSSSLLSTSKQHEAYQTMWNICCDLNDSGLLRNLMHES 1020
Query: 1021 MGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYSGAS 1080
+G RGGFSYFVFKQ+YE QF KLLRLGEEF ++L FL H DLLW HEVFLHQ+S AS
Sbjct: 1021 VGPRGGFSYFVFKQLYEKKQFSKLLRLGEEFQEDLSNFLNHHRDLLWLHEVFLHQFSAAS 1080
Query: 1081 KTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEAEIE 1140
+TLH LALS +E S S E + D TL +R+R+L+LS IAA AGKD + +++
Sbjct: 1081 ETLHILALSQEEDSISTTEDETDADHANPVPTLADRRRILNLSMIAAFAGKDPDSQPKVK 1140
Query: 1141 RIDADLKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDVFAW 1200
RI+ADLKIL+ QE+I+ + P D Q + + LL P +LI+LCL+ ++ EL L+ FDVFAW
Sbjct: 1141 RIEADLKILRLQEEIMEVLPTDDTMQHVEKHLLRPEELIELCLQSRSRELALQVFDVFAW 1200
Query: 1201 TSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAARRCY 1260
TSSSFRKS++ LLEECW+ A DQD W +LY+ S+ EGWSDE+T++ L +T+LF A+ RCY
Sbjct: 1201 TSSSFRKSHRNLLEECWKNAADQDPWSQLYEASVTEGWSDEETLQQLSQTILFQASNRCY 1260
Query: 1261 GPESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALMRGSE 1320
GP++E + GFDEVLPLRQE+ E ++ D SSVE ILMQH+DF AGKLMLTA+M G
Sbjct: 1261 GPKAETIEEGFDEVLPLRQENLEAASLNDKRSSVEAILMQHRDFPYAGKLMLTAIMLGCV 1320
Query: 1321 QVDIGGADGPIPM 1325
Q +G P+
Sbjct: 1321 QDHAKKEEGLSPV 1329
BLAST of Spo05021.1 vs. ExPASy Swiss-Prot
Match:
NU133_ARATH (Nuclear pore complex protein NUP133 OS=Arabidopsis thaliana GN=NUP133 PE=1 SV=1)
HSP 1 Score: 1347.4 bits (3486), Expect = 0.000e+0
Identity = 723/1339 (54.00%), Postives = 927/1339 (69.23%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P T + K + K P + + SP+TP NRN+ I +RP TGTPAP
Sbjct: 1 MFSPLTKRAKQSSRNEKTP----RNRVPPPDSPVTPATQNRNNF----ISDRPATGTPAP 60
Query: 61 WASRLSVLARIPPAKKSEDNRESAE--PVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WA RLSVLAR+ P + +S + PV+VG+FPQ + DEQ S PGDAC++GGM
Sbjct: 61 WAPRLSVLARVSPGNNGDKGVDSDQLKPVFVGEFPQLLRDEQ------SYPGDACVSGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDV--SEISNRNAYLGDTWLLCLIN 180
DKE LSW I GSK+FVWS+L+ S++CV+L+LP V +E S G +WL+ +++
Sbjct: 121 DKETCLSWFITGSKVFVWSHLTTLPSRKCVVLELPVVVLVNEESGSGLQDGKSWLVNVVS 180
Query: 181 WENVHQSSSFVKQLTS-AGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMG 240
W+ +++ + S GVV+CN+++RA+VYW DI+S P A I
Sbjct: 181 WDTSAGAATRASRSRSPVGVVMCNRKTRAVVYWSDIFSGQEAAP----AEKARHLIKRQS 240
Query: 241 NGKSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAG 300
NG + S S NSLI +A+ + C+A+A SSNGE WQF CSP G
Sbjct: 241 NG------------IRSSRAENSDLNSLITTAVAAAERLCIAIACSSNGELWQFTCSPTG 300
Query: 301 VLRQNVCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIE 360
V V +S+S SVS+GYPRSL+WRF S +F +LTD I CF+IE
Sbjct: 301 VKSNQVQLNISSS-------SVSEGYPRSLIWRFSQGLARESCWEFLMLTDCDIHCFTIE 360
Query: 361 LSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRL 420
P+ T+S++W HEI+GTD D GIKKD+A QK+IWPLD+QVD++GK +T+LVA C DR
Sbjct: 361 PYPDLTVSEVWQHEIVGTDGDSGIKKDIASQKQIWPLDLQVDDQGKVITVLVATICMDRA 420
Query: 421 SSSSYTQYSLLTMQYKSGLDETLKGFTYG---SVLEKKSPVQVIIPKARVEEEEFLLSMR 480
SSSSYTQYSLLT+Q+KS + F G VLEK+ P+QVIIPKARVE+++FL SMR
Sbjct: 421 SSSSYTQYSLLTLQHKSEM-----RFADGREEKVLEKQGPIQVIIPKARVEDKDFLFSMR 480
Query: 481 LRAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGA 540
LR GG+P GSA+ILS DGTATV Y ++TRLY+FDLPYDAGKV+DASV STD+ E GA
Sbjct: 481 LRVGGRPPGSAIILSGDGTATVCYCHGSSTRLYKFDLPYDAGKVLDASVLSSTDEHEYGA 540
Query: 541 WVVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLL-FSAGVSPGKAG 600
W VLTEKAGVWAIPEKAV+ GGVEPPERSLSRK SSNE+++ +E + + + G+
Sbjct: 541 WTVLTEKAGVWAIPEKAVVLGGVEPPERSLSRKNSSNERSTRDETRVTPYGVDRTAGREN 600
Query: 601 VDARDID--AMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFE 660
D ++I+ + GFT + A+DEE+EALL +LF FLLSG+VDG LEKL+ + AF+
Sbjct: 601 SDIQNIEDKGNPKMGFT---RQTARDEESEALLGQLFEGFLLSGKVDGSLEKLSQSGAFD 660
Query: 661 RDAETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKC 720
RD E N FAR SKSIVD+LAKHWTTTRGAEIV++ V+S+QL EKQQKHE FL FLALSKC
Sbjct: 661 RDGEANVFARKSKSIVDTLAKHWTTTRGAEIVAMTVISSQLVEKQQKHENFLHFLALSKC 720
Query: 721 HEELCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWD 780
HEELC KQRH+LQIILE+GEKL +IQLRELQNMINQ+RS + + +VS +LWD
Sbjct: 721 HEELCSKQRHSLQIILENGEKLAAMIQLRELQNMINQNRSARFGSPQAGSEDQVSCALWD 780
Query: 781 LIQIVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRV 840
LIQ VGERARRNTVLLMDRDNAEVFYSK+S LEEVFYCL+R L++I+ QP Q+QR
Sbjct: 781 LIQFVGERARRNTVLLMDRDNAEVFYSKVSELEEVFYCLNRQLEYIIRADQPLGTQLQRA 840
Query: 841 CELSKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSG 900
CELS ACVT+L+ A+ YKNEH MWYPP +GL+PW+S VV +GLW IASFM+ +L ++S
Sbjct: 841 CELSNACVTILQTALDYKNEHQMWYPPLEGLIPWHSQTVVCNGLWCIASFMLHLLTEASR 900
Query: 901 IDLSSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQI 960
ID+S+KSDI +HLEVL +V+LEA G+ AK+ER EE+KGL +EYW +RDT+ DSLY+Q
Sbjct: 901 IDISAKSDIYTHLEVLTEVLLEACAGSTFAKLEREEENKGLLNEYWTRRDTIFDSLYRQA 960
Query: 961 KVFVNARFQDSVVEKE-QKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLM 1020
K F+ A Q E E I R SNL+S+A+RH Y +W IC DLND+ LL+ LM
Sbjct: 961 KEFMEAEIQGIRERTEATDEDIFRNRCSNLISIAKRHAGYKIMWKICYDLNDTGLLRNLM 1020
Query: 1021 HKSMGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYS 1080
H+ +G +GGFSYFVF+Q+Y+ QF KLLRLGEEF EL FLK H DL+W H+VFLHQ+S
Sbjct: 1021 HEGVGPQGGFSYFVFQQLYDMKQFSKLLRLGEEFQDELLIFLKRHSDLVWLHQVFLHQFS 1080
Query: 1081 GASKTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEA 1140
AS TLH LALS DE S + + + T +RKR L+LSKIA +A KDA E+
Sbjct: 1081 SASDTLHTLALSQDEESMTTVEERTGPEPEDVQPTFADRKRFLNLSKIAYVADKDADSES 1140
Query: 1141 EIERIDADLKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDV 1200
+++RI+ADL +LK QE+I P+ + RL P +LI+ CL Q ++AF+V
Sbjct: 1141 KVKRIEADLNLLKLQEEITKALPNG----EARNRLFRPEELIETCLNIQGRWTAIKAFEV 1200
Query: 1201 FAWTSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAAR 1260
FAWTSSSFR+++++LLEECWR A DQDDW + +Q S EGWS+E+T++ L+ T LF A++
Sbjct: 1201 FAWTSSSFRENHRSLLEECWRNAADQDDWDRHHQASTNEGWSEEETLQNLRNTALFQASK 1260
Query: 1261 RCYGP-ESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALM 1320
RCYGP +DG F +VLPLR+E N +DS SSVE +LM HKDF++AGKLMLTA+M
Sbjct: 1261 RCYGPTRVNTFDGDFAQVLPLRRE-----NPEDSTSSVEDVLMSHKDFAEAGKLMLTAIM 1285
Query: 1321 RGS-EQVDIGGADGPIPME 1326
G E+ I + PME
Sbjct: 1321 LGCVEEEGIVAEEFSSPME 1285
BLAST of Spo05021.1 vs. TAIR (Arabidopsis)
Match:
AT2G05120.1 (Nucleoporin, Nup133/Nup155-like)
HSP 1 Score: 1347.4 bits (3486), Expect = 0.000e+0
Identity = 723/1339 (54.00%), Postives = 927/1339 (69.23%), Query Frame = 1
Query: 1 MFTPSTTKKKSNYSTFKNPNATQQKTISFDSSPITPLAANRNSLTDGSIPNRPQTGTPAP 60
MF+P T + K + K P + + SP+TP NRN+ I +RP TGTPAP
Sbjct: 1 MFSPLTKRAKQSSRNEKTP----RNRVPPPDSPVTPATQNRNNF----ISDRPATGTPAP 60
Query: 61 WASRLSVLARIPPAKKSEDNRESAE--PVYVGDFPQAVHDEQANVMRNSVPGDACIAGGM 120
WA RLSVLAR+ P + +S + PV+VG+FPQ + DEQ S PGDAC++GGM
Sbjct: 61 WAPRLSVLARVSPGNNGDKGVDSDQLKPVFVGEFPQLLRDEQ------SYPGDACVSGGM 120
Query: 121 DKEYGLSWMICGSKLFVWSYLSPAASKRCVILDLPSDV--SEISNRNAYLGDTWLLCLIN 180
DKE LSW I GSK+FVWS+L+ S++CV+L+LP V +E S G +WL+ +++
Sbjct: 121 DKETCLSWFITGSKVFVWSHLTTLPSRKCVVLELPVVVLVNEESGSGLQDGKSWLVNVVS 180
Query: 181 WENVHQSSSFVKQLTS-AGVVLCNQRSRAIVYWHDIYSDDRVTPRISLASPEELGISSMG 240
W+ +++ + S GVV+CN+++RA+VYW DI+S P A I
Sbjct: 181 WDTSAGAATRASRSRSPVGVVMCNRKTRAVVYWSDIFSGQEAAP----AEKARHLIKRQS 240
Query: 241 NGKSPPTRLQSDNRLESLSYNQSVFNSLIVSAIPSNYHACVALATSSNGEFWQFICSPAG 300
NG + S S NSLI +A+ + C+A+A SSNGE WQF CSP G
Sbjct: 241 NG------------IRSSRAENSDLNSLITTAVAAAERLCIAIACSSNGELWQFTCSPTG 300
Query: 301 VLRQNVCHILSTSISGCSHPSVSKGYPRSLLWRFPSLQMENSHRQFFLLTDHQIQCFSIE 360
V V +S+S SVS+GYPRSL+WRF S +F +LTD I CF+IE
Sbjct: 301 VKSNQVQLNISSS-------SVSEGYPRSLIWRFSQGLARESCWEFLMLTDCDIHCFTIE 360
Query: 361 LSPNPTLSKLWSHEIIGTDNDLGIKKDLAGQKKIWPLDIQVDERGKELTILVAIFCKDRL 420
P+ T+S++W HEI+GTD D GIKKD+A QK+IWPLD+QVD++GK +T+LVA C DR
Sbjct: 361 PYPDLTVSEVWQHEIVGTDGDSGIKKDIASQKQIWPLDLQVDDQGKVITVLVATICMDRA 420
Query: 421 SSSSYTQYSLLTMQYKSGLDETLKGFTYG---SVLEKKSPVQVIIPKARVEEEEFLLSMR 480
SSSSYTQYSLLT+Q+KS + F G VLEK+ P+QVIIPKARVE+++FL SMR
Sbjct: 421 SSSSYTQYSLLTLQHKSEM-----RFADGREEKVLEKQGPIQVIIPKARVEDKDFLFSMR 480
Query: 481 LRAGGKPSGSAVILSRDGTATVSYFWRNATRLYQFDLPYDAGKVIDASVFPSTDDTEEGA 540
LR GG+P GSA+ILS DGTATV Y ++TRLY+FDLPYDAGKV+DASV STD+ E GA
Sbjct: 481 LRVGGRPPGSAIILSGDGTATVCYCHGSSTRLYKFDLPYDAGKVLDASVLSSTDEHEYGA 540
Query: 541 WVVLTEKAGVWAIPEKAVLFGGVEPPERSLSRKGSSNEKTSGEEKNLL-FSAGVSPGKAG 600
W VLTEKAGVWAIPEKAV+ GGVEPPERSLSRK SSNE+++ +E + + + G+
Sbjct: 541 WTVLTEKAGVWAIPEKAVVLGGVEPPERSLSRKNSSNERSTRDETRVTPYGVDRTAGREN 600
Query: 601 VDARDID--AMQRAGFTGVVGRNAQDEEAEALLNRLFHDFLLSGQVDGFLEKLNNARAFE 660
D ++I+ + GFT + A+DEE+EALL +LF FLLSG+VDG LEKL+ + AF+
Sbjct: 601 SDIQNIEDKGNPKMGFT---RQTARDEESEALLGQLFEGFLLSGKVDGSLEKLSQSGAFD 660
Query: 661 RDAETNAFARASKSIVDSLAKHWTTTRGAEIVSLAVVSNQLAEKQQKHEKFLQFLALSKC 720
RD E N FAR SKSIVD+LAKHWTTTRGAEIV++ V+S+QL EKQQKHE FL FLALSKC
Sbjct: 661 RDGEANVFARKSKSIVDTLAKHWTTTRGAEIVAMTVISSQLVEKQQKHENFLHFLALSKC 720
Query: 721 HEELCMKQRHALQIILEDGEKLTGIIQLRELQNMINQSRSIGVSPQLSSANGEVSGSLWD 780
HEELC KQRH+LQIILE+GEKL +IQLRELQNMINQ+RS + + +VS +LWD
Sbjct: 721 HEELCSKQRHSLQIILENGEKLAAMIQLRELQNMINQNRSARFGSPQAGSEDQVSCALWD 780
Query: 781 LIQIVGERARRNTVLLMDRDNAEVFYSKISNLEEVFYCLDRHLDHIVNVGQPFKVQVQRV 840
LIQ VGERARRNTVLLMDRDNAEVFYSK+S LEEVFYCL+R L++I+ QP Q+QR
Sbjct: 781 LIQFVGERARRNTVLLMDRDNAEVFYSKVSELEEVFYCLNRQLEYIIRADQPLGTQLQRA 840
Query: 841 CELSKACVTLLRCAMHYKNEHHMWYPPPDGLVPWYSHPVVRDGLWSIASFMIKILDDSSG 900
CELS ACVT+L+ A+ YKNEH MWYPP +GL+PW+S VV +GLW IASFM+ +L ++S
Sbjct: 841 CELSNACVTILQTALDYKNEHQMWYPPLEGLIPWHSQTVVCNGLWCIASFMLHLLTEASR 900
Query: 901 IDLSSKSDICSHLEVLADVVLEAYVGAITAKVERGEEHKGLSDEYWRKRDTLLDSLYQQI 960
ID+S+KSDI +HLEVL +V+LEA G+ AK+ER EE+KGL +EYW +RDT+ DSLY+Q
Sbjct: 901 IDISAKSDIYTHLEVLTEVLLEACAGSTFAKLEREEENKGLLNEYWTRRDTIFDSLYRQA 960
Query: 961 KVFVNARFQDSVVEKE-QKEVILRELSSNLLSVARRHEAYCTLWNICCDLNDSTLLKQLM 1020
K F+ A Q E E I R SNL+S+A+RH Y +W IC DLND+ LL+ LM
Sbjct: 961 KEFMEAEIQGIRERTEATDEDIFRNRCSNLISIAKRHAGYKIMWKICYDLNDTGLLRNLM 1020
Query: 1021 HKSMGARGGFSYFVFKQMYENGQFPKLLRLGEEFPQELETFLKDHPDLLWFHEVFLHQYS 1080
H+ +G +GGFSYFVF+Q+Y+ QF KLLRLGEEF EL FLK H DL+W H+VFLHQ+S
Sbjct: 1021 HEGVGPQGGFSYFVFQQLYDMKQFSKLLRLGEEFQDELLIFLKRHSDLVWLHQVFLHQFS 1080
Query: 1081 GASKTLHGLALSGDERSFSAAGKEAELDSGTKDLTLPERKRLLHLSKIAAIAGKDAVYEA 1140
AS TLH LALS DE S + + + T +RKR L+LSKIA +A KDA E+
Sbjct: 1081 SASDTLHTLALSQDEESMTTVEERTGPEPEDVQPTFADRKRFLNLSKIAYVADKDADSES 1140
Query: 1141 EIERIDADLKILKSQEDILNLYPDDLEKQKIGRRLLPPWDLIKLCLKGQTPELLLRAFDV 1200
+++RI+ADL +LK QE+I P+ + RL P +LI+ CL Q ++AF+V
Sbjct: 1141 KVKRIEADLNLLKLQEEITKALPNG----EARNRLFRPEELIETCLNIQGRWTAIKAFEV 1200
Query: 1201 FAWTSSSFRKSNKTLLEECWRGAVDQDDWQKLYQFSIAEGWSDEDTIRVLQKTMLFLAAR 1260
FAWTSSSFR+++++LLEECWR A DQDDW + +Q S EGWS+E+T++ L+ T LF A++
Sbjct: 1201 FAWTSSSFRENHRSLLEECWRNAADQDDWDRHHQASTNEGWSEEETLQNLRNTALFQASK 1260
Query: 1261 RCYGP-ESEMYDGGFDEVLPLRQESSELPNMQDSGSSVEVILMQHKDFSDAGKLMLTALM 1320
RCYGP +DG F +VLPLR+E N +DS SSVE +LM HKDF++AGKLMLTA+M
Sbjct: 1261 RCYGPTRVNTFDGDFAQVLPLRRE-----NPEDSTSSVEDVLMSHKDFAEAGKLMLTAIM 1285
Query: 1321 RGS-EQVDIGGADGPIPME 1326
G E+ I + PME
Sbjct: 1321 LGCVEEEGIVAEEFSSPME 1285
The following BLAST results are available for this feature: