Spo18849.1 (mRNA)

Overview
NameSpo18849.1
TypemRNA
OrganismSpinacia oleracea (Spinach)
DescriptionHydroxyproline-rich glycoprotein
Locationchr2 : 48021594 .. 48031171 (+)
Sequence length1655
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTGCACCCAAATTGTAATCGTATAAACAAATTTAACATTCGAAATACAAATACCCCATACTCCAAATGTGGCAAATGCGGGAACGCATCTCCTTCCCCCTTCCTCCTCAGTCTTCCCGCATCTCCTTCCTTCTCCTCTCGAATCTCCTCCTCTTAAACCCTAACAACAAAACCAGCATCTTGTAGTTGTAGTCGGCGGCACCTCTGTCAACCACCTCTCCTCTGCTGCTGCTATTCTCGGTAATTATCTACAAGTATTTTTGTATTGATTTTTCTGTTGATTTTATTTGTATTTTTATTGATTATAATTCTCTTAATTGTCATATACATTGTGTTGGTTTTTAGTTAGCTTTGTTTTGATACGATTCTAATAAGATTGATTAGGATTAAAGATTCAATTTTTCTGATATGATGGGTTTCAAATTGTTAAATATATTCAGTTGGGTTAGTTTTGGTGTTGTTTTATGCCTAATTAGTGTATACCCACTTCATGATTCTACTTCACATTCGTATAATTCCACTTCATAATTCTACTTGGAATTCGCATAATCCTTTTTGATATGGTTTGATTTTTCTGACAATCACTGTTCATCTCTCTTGGCCCAGGGTGGGCGCCCCTCCTGCGCTAGCTCATGGACGGGACTGGACAGCATCAGCCTGGGTTATTAGCAGCTCCAACCACACCTCAAACTCAAACCTACTTGCATCCTCGCTTGGCCAATGCTTCAGATTCGATTCGATATCCATTAGCATCTACCGGTCGTGGATTGCTTCCGGCACCTAGAGGCATATTGCCTGATCCAGTGGCAAGTTCCATTGCCCCTAACTCGTGGGCTCTTCAGCAGTATCAACATCAGCACCATCACTCTCGATTGTCTGCTCCCTCTAGTCATCTTGTTCGCGGTCCTTTTTTGCATCCTTCTCAGGCTCATATATCACCTGTTTCATTCTCTGCTGCTGCTAGAGGCGTTCTATTCAACAATCGCCCTCAGACCATGGTTTCCTTTATTCCTTTCTTTTTCTTTCCTGCTGTCTTAACCTTTGTATATTTCTTAACTGTCTAAGCTCTAGGATTTGTTTTGTGATGCTATGTTGCTGCTTGTTTCATGGTTTTTTGGAAGCTTTTTTTTTAGAAGTTATAACTTTTTTCTGGTCGGAATGGGTCTTTTCTGGCTAGGTTTTATGGGTATTACATTGTCATATCACATGTGCCTCATTCGAGACCAAAATTCAACTATCCTGACTGGTCTGGTTTAAGGTGCAAAAGTGGGTAGACTGTTTGTGCACATAATTTCTTATTGAGGATTAAGGAGCCCTCATGCACACCAATGTGACCATTTTGGAACAACCTTTCTAATTTACTCTTTATATATCCATCTTAATTTTTGTTTTCTGGAACAATCTGTCTACTACTTTAGATTGTTGCGGTAGCACTGCATACTTATGGCCTATTATCTTCTCTTGTTCAGGTTGGTTCATCTCCCTACTCTGCTCCTGATAGTAATGATCCTAAAGATCCCATGTAAAACATCTACTCTGGCCTATTTCCTTCCTTGTGTGAACTTCGGTGTCATGGTTGCACACTTGTAGTTACGGTTCTTGTTTTGACTTGTGCAGAGACAAAGGTACTGATGATACGTTGGTCACTCTTAGAGATCGAAAGGTAAGGTTAAGGTCTACTATTCTCTTTAAGAATCATACTATAGATATTCATTGATTCCCAAGTTTCTTTAATAATTCATCTTTGCCGTGCATATTCCCTAAGAGATATCAATAATAATGAGAAATGTATGAGTAGTCTATATTATAATTTTTCCTGTCTTTTGGGGATGGAAGGAATTAACCTCTCGCTGATTTTTTTCCATTATATCGAAAGAACTGAAGTTCGTTACAGGTAATGAGCTGAAGCTCATGACACCAACTGAGCCAAGGCTAATAGCAACTAGCAAAACAGGGCTTCTAAAACCTTGTGACTCGATGCTTGTAATGTGCTAATGAGTGCATATTGTTTTATGTGTTGGTTTAATTTGTGCTCTAGGTACTTATTCTTGTGATTCAGGTGGCTTAAGTGTAGTTTGGAAGTTGAGCTGTTTATGGAAACTTTTCATGCTTCTGATATTTTATATAGTAATATTCACCTTGTCGTTTTGTTTGAATAATTTATATGCTTAAGGTGTGGCCTGGCTTGTTTTTTTACGTAGGTCTTTACAATGAAACAATTAGTACATATTCCCTCATTAGAAACATTGTCTTCTTGTGACTTACTGGAAGTCTGGAACTGATTCTGGATAAGTGCTTTAATAGTTTTGAGAGTGGAACCTATTCCTGCTATCAGATTACCTTTTCGGAGAACAGGTAGTTTGGAAGTTGTTGGGATGAGTTGAGAAGGTGTACGATTTTATTCATATTATTTTGTGGGGAATAGTTGGGTAAAGTGATGGACACTTGGACAGCAAGGGTTTTAGACTTACTTTTAGTCCCAGTGGAGCAAATCTTTTGTGTATCAGAGTGTGAATTTTGTTCTGAAATAGCAAACCAAAGTTATAATTACGAAAAGTTCAACACTCGAACCGTAAAGAATAAAGACCCACTCACTAAATTCAAGATAAAGACTCTGTGATTCTTTTATATTTTAACATAAATAACATAGTTCCTGGGCTAAGTTTTGGCTATCATAGACTATTAACCTCGGTATTGCGAAATGTCCTTATTTTTGAAAAATACAAAGTCTACACTAATTTTCATGTAAAAATTCCCGATGAATGGCTTATTTGTGCTTGTAATATGGACATGAGTGCACAGTGCTCCTTATGCACCAAAAAGCACCGATGCATATGAAAAAGAAAATCCAATATAACAATCTGTACCAGTGAGTTTAATGGTCCTTTATGTGAGAGAAGATGTGACCTCAAATCTACTTTTCTTTGGTTAACTAGGTTGAGCACAAAGTTGAAAATAAGAACACAACACTTTTCAGTGCACAAGGCTTGTTGACTTTGTTCCTCTTAAGAAACCAGGGTAGATGGCAAAAGATTTCTAAATTATGAGCTATTCACTATCATTGCGGTTCAACAATCTCCTCCTTACTAGAAGCAACTTTTACATGAAACCCTTCTCTTTAAAGCTGGTTTGACTTGTCAAGTTCCTCACTAACTTATTTCTTGTAGGATCATAGTATGCCTGTTATGGAAGTCACTTTGCACAGACTCATAACCATAACACTTGAAGCCATGCACCTTATCCATCTTGATCTCTTTTGTTGGTTTCTCATGCAAAGTTACTTCCTTGTCATCATTAGTCAAAGAATCAGATTTCGGTATACAGATTTTCATTGAAGAAGACTCCTTAATTAACCGTTTTACACGCTCGTGAACTTGCTTTCACAGTTTCAGGGCCAAAGTGCACAATTGCACATTCATCAAAACTCTTCCAAGGTAAGAGTACGCTGAAATACCAAATTATGAAATTTGGCTGATAAATATGAACTGATATAACCCAGTAAAGTAAATTCTACTTGTCAATACTTAAAGACTAGGCAATTATTACCCACTTTGATGTAAAGGATGGAGCACCCGCTTTGAGAAATTTAAAGATTACTATCTGTTGTATAACCGTTGACCTGTAAATAATTAAATTTAATTGGTCTTTTTCTGTTTTGTGATGTGTCAATAAGATCACTAATTCGAAATCTAGTTTACAACGCGAAAGTAACCGATATTTCTCCTTGTGGAACACTCCTGAACTCACAATTCGAATCCCCAAGCCTAGAAGATCATTCCTGAAGCCCCTAAGTATTAGATCTCTAAGTCCTAGAAACACTCCTGAATCTAAGCCTAGATTTACTTACGTCCAAACCACTCCTGAAATTCAATCCAAGCAACGAAAGAATAAAACTTTCAATTTAATTAAGCAAAGCTCGCGAATTTTACTTAATTCTCAAAAATCTGAATAATAGAAATTACAAGTCTTCACTATATAGGAGTATGACTAACAGTAGTCCTGTATAAATGGAACTCTAAAAAGCTATTCATAAAAGGTCTTAGGTAGATTTGTTTTAGCTTGTGTATTATTGCTGCAATTACGGTGTCTCGTGTGAAGAATTGTAATCGCATGCCAGGAATTTTGCAATGATAATGAAATACTGGCAACATATTATTAATGGCTCAAAGTGCAACACGTTTTGGGCTTGTAGTTGCAGTAGTATGTGTTTTCCAATTCAGCAAAATAACTTGATGATGAATTATTCTGACCCTTCACTTGGGATTATGCTGAAACGAGAAGGCATTTTAGTTGGTTGTTGTTAGAGAGTGCAAGTATGTTAAGGGCAGGTGTTTAGGTATGTGTTTCGATATATGTTCTTGTATTTGACTTATCCACTTGGCGTTTCAGCTTGTGCAGTTAGGTGTTTTCTGAAAATTTGCAGTGATTATAACTTGAAACCAAATCATGTTTTGGTCTTTCATCAGATTTCTACCCTAAGTCTCTCTTTGGATACTTTTTGTAGGTCAGGATATCAGACGGGGCATCACTGTATGCTCTCTGTCGGTCGTGGTTGAGGAATGGATGTGTAGAAGATAATCAAGTACGTAGCATTTGGTCATATCTACTGTTATTTTTATTATCTGTAGAATACAATTCATCTATGGAGCTTTCTAAATTCATGCAACCATATGCAGCTGACTTCTATATATCTGTGTATCTTGAAGCAATTGTTTGTTTGATCAAAACCATCTTTGGGTTCAGACTGGTAACAAATATTCCAAGTTTTTATATTTAACATAGGAGTTGCCATTTGTTGACATATACCCACAGACACACACATGGATAGCTTGTTGACATTCTTAATTAATAGGTCAGAAATGGAAGACTCAATTCTCCAACCGGACTGTCTACCTTACAAGAACTATTAACTTTCTTCCTCATTACTTGTCCAGTAAAACATGAAGGGTTTAGAATTAGAATGTACATAATATATCAGTATTTTATAGTTAAAAGTTTCATTTTTTTGGGCTAACATATGAATATATCAGCATAAGCAGATCACCTCCTCTCTGGTTCTCCACCTTCAAGTAGCTTATAGTTATAATCACAATACGATCATTGGAAAACACAAGTATACTAATCTTGTTCCATGCTAAGACTATTGAAGTACTTGTGACAAACTGAGTAAGATTGCTGGATTTTGATTCAACAAAGTTGAGGGAGGAGGCCCAAAAAATTGTTGAGAATTTGTTAATCACCTGTGCGGCTGTGCCTTGAGTGTGATTCTAAATTATAGAGGAGAAGAGCTTAACTATTATTTTTGTTTTTATAAAAATGGGAGCAGTAGGAAGATGAAAGGACAGATCGAGGAACAGAATCCCAACTTGTCCAATATGGAATGAGAGAAATCAAAAGCAGGAAAGAGAAATGGTCCTTTGACCACTTTAGGAAGGTGTGAATCATTATTTACCTTTTTTAAATAACATAGGTGTCATTATTTACCTTTTTTAAGTCGTAAGTCTTGTAACAGGTGTCAGACTTGTTATTTTCCTTTCCCTAAATCCTTCACTGTTCAGGTCCAAATTCAGTTCTGCTCCGGTTTGTGCTCACTCCCACTTTCAATTAGCAACTAGTTATAATCTCTATATGATCCTTGGCCATAGAATCATTATGCTGTTGGACGTTGAAGTAGCAATGAAACTATATGTTGCGTACTTGGTCAGAGTTCTTCTCTTTCATTTTTCTAAATTGTTTAAAAACTTGCAGCCACAGTATGGAGATGTTCTAAAGTCTCATCCGAAACCCTTTCATCTGCCTCTTCCTCTGCCTCCGCATGAATATTCCTCACCGAAGAGAAAGGAGAGAGATGATGAAACGGAAGTTGACGAGGTTGGTTGTTCTTTGACTGTAATATTAGTACTATTGTCTATTTTTCTTAATTATTCAAAACTGTTTAAGGGTGGTGTTTTGAGATTTTTCAATGTTTTTCTTTTTTCTTGTGCAAGGTCATAACTTCTACTCCCTCCGTCCACAAAGATTGCTCATACCCTTTGTTTCTTTTTTTGAAAAGTTGGTTCATTTAAAAGTGCGTGAAGCTCTCCACCCCCCCCCCCCCCCCNCCCCCCCCTTTTGTAGTTGCACATGGGTTGTTGAGTGGGGAAAAATCAATGGGACATCCATTTTAGGAACGTGGAGCAATCTTAGTGGGACGGAGGGAGTATATATGTGGAAGATGGTGTAACTTATCAATTCATTTTACTTGTTTAAAAGCAAGGTTCAGTTGGTAAGGTAATCAATAGCCTTTTGATGCCGTTCAATGTTAGCAGTATTCATGATAAATGATGTTGTTAGATCATTCACGATAAATTTTGGGTGGTAGAATTCATGACAAATATTGGTTATCTGTTAGAGTATTCATGATAAAACTTTGATGCCATAAGAGCATTCAAGATAACGTAGTATTGTGGTAGGTAGTATTAATTCTGCTTTTTTTCTTTTCTTTTCTGGATGGTAGTGTTACTAGTGTAGCCATGCATTACTTGGAATGCTGATATTTTTGACTTTTTTCTTTTTAGTTCTCTTTCATACTACGAAATTTTCTTCTGTTAGAATTGGAAAATTCCTGGTGGATACCATGAGTTCCAATGAAATTACAATGAGATACTTCAACATTTTATAAAATCTCTATCATCCCCTCCAATGTTTAAACTAGAGTCAGTAGCGTAGAGTTGTGAGAATTAGTTGAGAATGAGAAAGAAAGAGGGAGGGAGGGAGAATACTAACTTATAAAAACTTGAGAGTGCATCGTATGGCATCAAATGAGATGGCATTTATAGGACTTGTTACAAAGGATCCTTTTATTTTTGTGGTCGAGTTGCTTGGCGATTTGCTATCTCCTCTGGGCTCCGGGGGCTGCTGATTTATTTTTTCAATGTATTTTATGTTGGACTATTACTTGTAGCGTTGGGTAGGGCTTGAGTAGTTAAGTAATTCTTTTGGGCATTGGGTTGTGTCTTGTCTATTTTTCAGTTGAAAACAACTATTACTCTATTAGTGGTTTGAAGATGTGTCTCGTTTGTACTATTCTTTTGGACCAATTGTGGAATAGTAAGATAAGTGGATTTGGGCTGGCAACTCAATAAACACCCAAAAGAGTTGTGGTGGTTGTGTATGTGTGTTTATGTTTTACACTTTCTCCTAAATACTTATGCCCAGCTCAAATGGGAAGTTATTTGGGGAGGTAGGGAGGGAATGTCATTAAGGGATAAGTTGATTGTCATGATGTAACCCATTTTCAGGAGTCCTTATTCCACTATTTTTGCTTGCAGGACGAGAATTTAGTTGATATTTCAAATGCTCAAGAGTTGCTAACAAGCCATGTTAAGCAAGCTAAGAGAGTTCGAGCAAGGTATGTCCCTCCTGATTTTGTGTCGCCTGACATTGTTGTTTTTATTAGCTATGATTGAATCTTCAGTTGGAATTTTTCACATCAAATGCTATCTCTAAATCACCTGTACAGGCTCAATGTTGATAACCGTAGTATGAAACAGAGAAACTGTTATCTTGAATACAGTAGAAATGCAAACGTAAATCAAAAGAATTAATTAACATGCATATATTCATTGTATACTTTCTCCGTTTCTAGATAATTGCAACACTTTCTGTTTCTTTTATGTCCGAAAATTATTGCAATACCAAAATAACAAATTTGCCATTACATAACCAAACCACTAACATAATTAGAAAACCAAAAACTGAAAATAACCCTACCACAGCACTACCCCAACCAACTAACCCCCTTCTACCTTCTGTCCATGCCAACTAACCCCCAACTTTCCTCCCCTCTCTCTAATCCCGTCACCTACCATCCACCAGGCACCACCATCTTCTTTCCTCCCTTTCACTTGTTGCCAGCACTTCAAACCACCAACGAACACAGCCTCACGCCTTACCCTAGACAACAGTCCACTCCAAACTACAGCCACACCTATCGGAATGCTACAATTGAAACCACAACTAATATTTGTACTTCTGAGTTTTGATCACAAGTCCCCATAATTTGGCCAGCAAAGCTTTGAAACCTCTGGATATGATCACTGAAGGGTCAAATCTCAATTGTATGAAGATCAGCAGTAGCCACTAGGGAGTAGGGAGAGGCGAGAAAGCAGATGAGCAGAAGGGCGGTGATTCACCACATGAGATGAGAGAGAGAGTGGAAGGGAGGGCTTGTCAGTGGTGATCTAAGAGATAGGGGGAGGGGCATGCTGAGACGGGTGTGATTTGTGAAATATAGGAGGGCGGGCTTGGTGGGATTTAGGGGTCGTCGAGTGTGTGAATGACCCCTTACAACTTGCCCTACTTATGTGCTATGATTATTTAAAAAGCGTAGGAACTATGACTCAATGCAGTTGAAATAAAGTACAACTAGATCTAGAACAGAAGAATGAAAATGTGTTGGGGGGTGGGGGGGGGGGNGGGGGGGGGGAGAAAGCCCCTAAACATATCTCCTAAGCATATCTAATTACATCAAGAAAGAAAGTCTACATAGCTTCTTTGTACAAACAAGTAATTTCATTCCCATTTTCCGTTGTAACCGACCCCTAAATCAATATCCATGTGATCTTCCTTTAATCCTTCCAATGGGCTAAACTGTAACTTCCATACCCCTAAAGAAATTTAGACAGCATTTCCTACAAAAAAACAATGTTCCAATCAATTATTTTTTTTACTTCCAGTGGAATGTTAGTAATATTAGTAAATGTTAAACTTCGCATTTACCTTAGAGACATCTCTTCCCCTTGTTTATTTATTGCATTTGTAGGCTTATATAACTTTGAGCATATGAGCTAATTTGGTTAGAGGCTTAGAGCTTGCTACTGCCCAATGCTTATTACTAGGAGATGTGCATGTAATGTATGGATGGACCTTTTTGAACGTTTGCGCCTGTTTACAGATTAAGGGAAGAAAGATCACAACGCATTGAAAGATACAAAAGTAGACTTGCTCTTCTTCTTACACCTAACGGAGATGTGCAAAATCAGCAGGAATGATACAGTCCCTGGTCCCTGAGTAGTTGAGTACTACATAGCTAGTTTGTATGATTGATTTCTGGTTCCTGTTGCTTCCGTTCCTCCTGTAATATGTTACTAAAGGATGGGACGGCATCAGGTTTTGTAGATTAAGTTATGTCGCGAGTTTTGTTGTATGATTTTTCTTCAATGAGCAATCAGAGGTTTTGCCAACATTTTTCTTGGTTGGAAATATTGTTAAGAAGGGTAAGCTAAAGAGAAGTAGATTATTGTTATGTTGCTAATAGTTAGTTATTACTCGTAATAATTATTTTATGGGATATATTAATCCCACCGGGATATTTTTGGTAGAGGTCTCTCAGTTTGAGCAGAGTTGCTCACTTAATTGCTTATTGGTACTAGTCAACTGCTGGATCAAATAATTGTTCAATAATGAATCTAGTTGAAAGGAATAAAATCCTCCTACACAATTCTATCATTCTATGCACTTCTTTAATGATTAAAGAGTTATTGGATGAGCAGTAGAAACATGATTTGAGCACTTGTAATCTAGGAAATAGGAACAAG

mRNA sequence

GGTGCACCCAAATTGTAATCGTATAAACAAATTTAACATTCGAAATACAAATACCCCATACTCCAAATGTGGCAAATGCGGGAACGCATCTCCTTCCCCCTTCCTCCTCAGTCTTCCCGCATCTCCTTCCTTCTCCTCTCGAATCTCCTCCTCTTAAACCCTAACAACAAAACCAGCATCTTGTAGTTGTAGTCGGCGGCACCTCTGTCAACCACCTCTCCTCTGCTGCTGCTATTCTCGGGTGGGCGCCCCTCCTGCGCTAGCTCATGGACGGGACTGGACAGCATCAGCCTGGGTTATTAGCAGCTCCAACCACACCTCAAACTCAAACCTACTTGCATCCTCGCTTGGCCAATGCTTCAGATTCGATTCGATATCCATTAGCATCTACCGGTCGTGGATTGCTTCCGGCACCTAGAGGCATATTGCCTGATCCAGTGGCAAGTTCCATTGCCCCTAACTCGTGGGCTCTTCAGCAGTATCAACATCAGCACCATCACTCTCGATTGTCTGCTCCCTCTAGTCATCTTGTTCGCGGTCCTTTTTTGCATCCTTCTCAGGCTCATATATCACCTGTTTCATTCTCTGCTGCTGCTAGAGGCGTTCTATTCAACAATCGCCCTCAGACCATGGTTGGTTCATCTCCCTACTCTGCTCCTGATAGTAATGATCCTAAAGATCCCATAGACAAAGGTACTGATGATACGTTGGTCACTCTTAGAGATCGAAAGGTCAGGATATCAGACGGGGCATCACTGTATGCTCTCTGTCGGTCGTGGTTGAGGAATGGATGTGTAGAAGATAATCAACCACAGTATGGAGATGTTCTAAAGTCTCATCCGAAACCCTTTCATCTGCCTCTTCCTCTGCCTCCGCATGAATATTCCTCACCGAAGAGAAAGGAGAGAGATGATGAAACGGAAGTTGACGAGGACGAGAATTTAGTTGATATTTCAAATGCTCAAGAGTTGCTAACAAGCCATGTTAAGCAAGCTAAGAGAGTTCGAGCAAGATTAAGGGAAGAAAGATCACAACGCATTGAAAGATACAAAAGTAGACTTGCTCTTCTTCTTACACCTAACGGAGATGTGCAAAATCAGCAGGAATGATACAGTCCCTGGTCCCTGAGTAGTTGAGTACTACATAGCTAGTTTGTATGATTGATTTCTGGTTCCTGTTGCTTCCGTTCCTCCTGTAATATGTTACTAAAGGATGGGACGGCATCAGGTTTTGTAGATTAAGTTATGTCGCGAGTTTTGTTGTATGATTTTTCTTCAATGAGCAATCAGAGGTTTTGCCAACATTTTTCTTGGTTGGAAATATTGTTAAGAAGGGTAAGCTAAAGAGAAGTAGATTATTGTTATGTTGCTAATAGTTAGTTATTACTCGTAATAATTATTTTATGGGATATATTAATCCCACCGGGATATTTTTGGTAGAGGTCTCTCAGTTTGAGCAGAGTTGCTCACTTAATTGCTTATTGGTACTAGTCAACTGCTGGATCAAATAATTGTTCAATAATGAATCTAGTTGAAAGGAATAAAATCCTCCTACACAATTCTATCATTCTATGCACTTCTTTAATGATTAAAGAGTTATTGGATGAGCAGTAGAAACATGATTTGAGCACTTGTAATCTAGGAAATAGGAACAAG

Coding sequence (CDS)

ATGGACGGGACTGGACAGCATCAGCCTGGGTTATTAGCAGCTCCAACCACACCTCAAACTCAAACCTACTTGCATCCTCGCTTGGCCAATGCTTCAGATTCGATTCGATATCCATTAGCATCTACCGGTCGTGGATTGCTTCCGGCACCTAGAGGCATATTGCCTGATCCAGTGGCAAGTTCCATTGCCCCTAACTCGTGGGCTCTTCAGCAGTATCAACATCAGCACCATCACTCTCGATTGTCTGCTCCCTCTAGTCATCTTGTTCGCGGTCCTTTTTTGCATCCTTCTCAGGCTCATATATCACCTGTTTCATTCTCTGCTGCTGCTAGAGGCGTTCTATTCAACAATCGCCCTCAGACCATGGTTGGTTCATCTCCCTACTCTGCTCCTGATAGTAATGATCCTAAAGATCCCATAGACAAAGGTACTGATGATACGTTGGTCACTCTTAGAGATCGAAAGGTCAGGATATCAGACGGGGCATCACTGTATGCTCTCTGTCGGTCGTGGTTGAGGAATGGATGTGTAGAAGATAATCAACCACAGTATGGAGATGTTCTAAAGTCTCATCCGAAACCCTTTCATCTGCCTCTTCCTCTGCCTCCGCATGAATATTCCTCACCGAAGAGAAAGGAGAGAGATGATGAAACGGAAGTTGACGAGGACGAGAATTTAGTTGATATTTCAAATGCTCAAGAGTTGCTAACAAGCCATGTTAAGCAAGCTAAGAGAGTTCGAGCAAGATTAAGGGAAGAAAGATCACAACGCATTGAAAGATACAAAAGTAGACTTGCTCTTCTTCTTACACCTAACGGAGATGTGCAAAATCAGCAGGAATGA

Protein sequence

MDGTGQHQPGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTPNGDVQNQQE
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spo18849Spo18849gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spo18849.1Spo18849.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18849.1.exon.1Spo18849.1.exon.1exon
Spo18849.1.exon.2Spo18849.1.exon.2exon
Spo18849.1.exon.3Spo18849.1.exon.3exon
Spo18849.1.exon.4Spo18849.1.exon.4exon
Spo18849.1.exon.5Spo18849.1.exon.5exon
Spo18849.1.exon.6Spo18849.1.exon.6exon
Spo18849.1.exon.7Spo18849.1.exon.7exon
Spo18849.1.exon.8Spo18849.1.exon.8exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18849.1.utr5p.1Spo18849.1.utr5p.1five_prime_UTR
Spo18849.1.utr5p.2Spo18849.1.utr5p.2five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18849.1.CDS.1Spo18849.1.CDS.1CDS
Spo18849.1.CDS.2Spo18849.1.CDS.2CDS
Spo18849.1.CDS.3Spo18849.1.CDS.3CDS
Spo18849.1.CDS.4Spo18849.1.CDS.4CDS
Spo18849.1.CDS.5Spo18849.1.CDS.5CDS
Spo18849.1.CDS.6Spo18849.1.CDS.6CDS
Spo18849.1.CDS.7Spo18849.1.CDS.7CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spo18849.1.utr3p.1Spo18849.1.utr3p.1three_prime_UTR


Homology
BLAST of Spo18849.1 vs. NCBI nr
Match: gi|902162695|gb|KNA06718.1| (hypothetical protein SOVF_178440 [Spinacia oleracea])

HSP 1 Score: 541.6 bits (1394), Expect = 8.200e-151
Identity = 272/272 (100.00%), Postives = 272/272 (100.00%), Query Frame = 1

		  

Query: 9   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 68
           PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA
Sbjct: 2   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 61

Query: 69  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 128
           LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY
Sbjct: 62  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 121

Query: 129 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 188
           SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL
Sbjct: 122 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 181

Query: 189 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 248
           KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA
Sbjct: 182 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 241

Query: 249 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 281
           RLREERSQRIERYKSRLALLLTPNGDVQNQQE
Sbjct: 242 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 273

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|731350598|ref|XP_010686585.1| (PREDICTED: uncharacterized protein LOC104900777 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 244.2 bits (622), Expect = 2.700e-61
Identity = 155/270 (57.41%), Postives = 172/270 (63.70%), Query Frame = 1

		  

Query: 8   QPGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSW 67
           QP    APT+ Q+QT            IRYPLAS+GRG+L         P      PNS 
Sbjct: 60  QPVPFTAPTSIQSQT----------QPIRYPLASSGRGIL---------PTIGCTRPNSS 119

Query: 68  ALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSP 127
             QQYQHQ         S+HL+    L P  AHISP    A AR +  NN PQ+  GSSP
Sbjct: 120 QHQQYQHQ--------SSAHLL----LRPLNAHISPA--PAVARPIPLNNPPQSKAGSSP 179

Query: 128 YSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDV 187
           Y   DS+D KDP+DKGTDD LVTLRDRKVRI+DGAS YALCRSWLRNG VE+NQPQYGD+
Sbjct: 180 YFIADSSDSKDPLDKGTDDMLVTLRDRKVRITDGASFYALCRSWLRNGYVEENQPQYGDI 239

Query: 188 LKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVR 247
                    LP PLP  E  SPKRKE DDETEVDE E L    NA+ELL+ HV++AKRVR
Sbjct: 240 FSR-----SLPKPLPLPESFSPKRKECDDETEVDEAEKL---GNAEELLSMHVRRAKRVR 288

Query: 248 ARLREERSQRIERYKSRLALLL-TPNGDVQ 277
            RLREER QRIERYKSRL LLL  PNG  Q
Sbjct: 300 TRLREERLQRIERYKSRLGLLLHPPNGTTQ 288

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|731415441|ref|XP_010659555.1| (PREDICTED: SH3 domain-containing protein C23A1.17 [Vitis vinifera])

HSP 1 Score: 174.9 bits (442), Expect = 2.000e-40
Identity = 120/264 (45.45%), Postives = 155/264 (58.71%), Query Frame = 1

		  

Query: 25  HPRLANASD---SIRYPLASTGRGLLPAP-RGILPDPVASSIA-------PNSWALQQYQ 84
           +P+LA   D    I YP+AS+GRG +P P R    D    ++A       P S A     
Sbjct: 75  NPQLAKPHDPPQGILYPVASSGRGFIPKPLRPQSSDHNTVTVANPGAAFPPRSAATAAAA 134

Query: 85  HQHHHSRLSAPSS------HLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSP 144
             H       P S      H +R P L PS   ++ V  SA  +G+  +  P+  V  SP
Sbjct: 135 FSHQARPFGFPQSDLNYPVHSMRMPHLLPSHVGVTAVPGSAPIKGIPVSAHPK--VAPSP 194

Query: 145 YSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDV 204
            S  D N  KD  D+  DDT VT+RDRKVRISDGAS+YALCRSWLRNG  E+ QPQ+ D 
Sbjct: 195 PSVSDCNGYKDSRDRNRDDTFVTVRDRKVRISDGASIYALCRSWLRNGFSEETQPQHYDS 254

Query: 205 LKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVR 264
           +KS P+    PLP+P  + + PK+KE D+E   +EDE  V+    Q+LL  H+K+AK+VR
Sbjct: 255 MKSLPR----PLPIPVTDPNLPKKKEDDEE---EEDEGSVENLLPQDLLQRHIKRAKKVR 314

Query: 265 ARLREERSQRIERYKSRLALLLTP 272
           ARLRE+R +RI RYK+RLALLL P
Sbjct: 315 ARLREQRLKRIARYKTRLALLLPP 329

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|1009106179|ref|XP_015873160.1| (PREDICTED: uncharacterized protein LOC107410261 [Ziziphus jujuba])

HSP 1 Score: 166.0 bits (419), Expect = 9.400e-38
Identity = 126/294 (42.86%), Postives = 163/294 (55.44%), Query Frame = 1

		  

Query: 7   HQPGLLAAPTTPQTQTYLHPRL--------ANASDSIRYPLASTGRGLLPAPRGILPDPV 66
           HQP L AA T P      + +L        ++ +  I YPLAS+GRG +P  + + P PV
Sbjct: 71  HQP-LYAAQTLPIASPNPNFQLPAKPPNDPSSPAHPISYPLASSGRGFVP--KAVRPVPV 130

Query: 67  ASSIA-----PNSWALQQY-----------QHQHHHSRLSAPSSHLVRGPFLHPSQAHIS 126
            S  A     P  +  +             QH  H + L  P  +L       P Q    
Sbjct: 131 ISDQAVTVANPGGYPPRPVVNFHHGVDGVRQHLDHAAHLMRPPHNLQHHHHYLPHQQVHR 190

Query: 127 PVSFSAAA--RGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISD 186
           P   SAA   +GV  +   +    + P S PDSN  KD  DK  DDTL  +RDRKVRI++
Sbjct: 191 PHLGSAAVPVKGVPVSAHLKV---APPSSVPDSNGYKDSRDKSRDDTLAIIRDRKVRITE 250

Query: 187 GASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEV 246
           GASLYALCRSWLRNG  E++QPQYGD ++S PK    P P+       PK+KE +++ E 
Sbjct: 251 GASLYALCRSWLRNGAPEESQPQYGDAVRSLPK----PSPIHMTNTDLPKKKEGEEDGEQ 310

Query: 247 DE---DENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
           +E   DE  V+  ++QELL  HVK+AK++RARLREER +RI RYKSRLALLL P
Sbjct: 311 EEEVKDEESVEHLSSQELLKRHVKRAKKIRARLREERLKRIARYKSRLALLLPP 354

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|720084883|ref|XP_010243354.1| (PREDICTED: leucine-rich repeat extensin-like protein 5 [Nelumbo nucifera])

HSP 1 Score: 164.9 bits (416), Expect = 2.100e-37
Identity = 114/276 (41.30%), Postives = 145/276 (52.54%), Query Frame = 1

		  

Query: 10  GLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIA------ 69
           G  + P  P     +          I YP+AS+GRG +P  +   P PV   +       
Sbjct: 55  GKTSNPNPPAQMAKIQDPSVPPPQGILYPVASSGRGFIP--KSFRPQPVDQLVTVANPGG 114

Query: 70  --PNS---WALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNR 129
             P S   +A Q              + HL+R P + P   H+ P    A   G    + 
Sbjct: 115 FPPRSVVAFANQVRPFSFPPGDPQVQAVHLMRPPHMQPP--HLGPRHIGATVSGPPIKSI 174

Query: 130 PQTM---VGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNG 189
           P  +       P S  D N  K+  D+  DDT+VT+ DRKVR+SDGASLYALCRSW+RNG
Sbjct: 175 PLVVHPKAAQFPSSTSDFNGYKELRDRSRDDTVVTIHDRKVRLSDGASLYALCRSWVRNG 234

Query: 190 CVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQEL 249
             +++QPQ+G+ +K  P+    PLP    E   PK+ E DDE E  EDE  V+  +AQEL
Sbjct: 235 LPQESQPQFGEGVKLLPR----PLPTSISEIPLPKKTEGDDEDEKKEDEGSVEELSAQEL 294

Query: 250 LTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
           L  HVK AK+VRARLREER QRI RYK RLALLL P
Sbjct: 295 LQRHVKHAKKVRARLREERLQRIARYKQRLALLLPP 322

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A0K9QJ54_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_178440 PE=4 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 5.700e-151
Identity = 272/272 (100.00%), Postives = 272/272 (100.00%), Query Frame = 1

		  

Query: 9   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 68
           PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA
Sbjct: 2   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 61

Query: 69  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 128
           LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY
Sbjct: 62  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 121

Query: 129 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 188
           SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL
Sbjct: 122 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 181

Query: 189 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 248
           KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA
Sbjct: 182 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 241

Query: 249 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 281
           RLREERSQRIERYKSRLALLLTPNGDVQNQQE
Sbjct: 242 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 273

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BT56_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g184170 PE=4 SV=1)

HSP 1 Score: 244.2 bits (622), Expect = 1.900e-61
Identity = 155/270 (57.41%), Postives = 172/270 (63.70%), Query Frame = 1

		  

Query: 8   QPGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSW 67
           QP    APT+ Q+QT            IRYPLAS+GRG+L         P      PNS 
Sbjct: 60  QPVPFTAPTSIQSQT----------QPIRYPLASSGRGIL---------PTIGCTRPNSS 119

Query: 68  ALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSP 127
             QQYQHQ         S+HL+    L P  AHISP    A AR +  NN PQ+  GSSP
Sbjct: 120 QHQQYQHQ--------SSAHLL----LRPLNAHISPA--PAVARPIPLNNPPQSKAGSSP 179

Query: 128 YSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDV 187
           Y   DS+D KDP+DKGTDD LVTLRDRKVRI+DGAS YALCRSWLRNG VE+NQPQYGD+
Sbjct: 180 YFIADSSDSKDPLDKGTDDMLVTLRDRKVRITDGASFYALCRSWLRNGYVEENQPQYGDI 239

Query: 188 LKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVR 247
                    LP PLP  E  SPKRKE DDETEVDE E L    NA+ELL+ HV++AKRVR
Sbjct: 240 FSR-----SLPKPLPLPESFSPKRKECDDETEVDEAEKL---GNAEELLSMHVRRAKRVR 288

Query: 248 ARLREERSQRIERYKSRLALLL-TPNGDVQ 277
            RLREER QRIERYKSRL LLL  PNG  Q
Sbjct: 300 TRLREERLQRIERYKSRLGLLLHPPNGTTQ 288

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A061E3E3_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_005945 PE=4 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 1.500e-34
Identity = 99/237 (41.77%), Postives = 140/237 (59.07%), Query Frame = 1

		  

Query: 35  IRYPLASTGRGLLPAPRGILPDPVASSIAPNSWALQQYQHQHHHSRLSAPSSHLVRGPFL 94
           + YP+AS+GRG LP      P      + P  +    + H HH +    PS         
Sbjct: 49  VMYPVASSGRGFLPTNHPCRP------LLP--YHHHPHPHPHHFANPRPPS--------- 108

Query: 95  HPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDR 154
            PS +   P  F    + +  +  P+  V  SP S  ++N  K+  D+  DD+LV +RDR
Sbjct: 109 -PSLSLPHPTHFHPPLKALSLSLHPK--VAPSPSSLSETNGYKNVRDRTKDDSLVNVRDR 168

Query: 155 KVRISDGASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKER 214
           KVRI+DGAS+YALCRSWLRNG  ++ QPQYGDV KS P+P  LP+P+  +     + +E 
Sbjct: 169 KVRITDGASVYALCRSWLRNGFPDETQPQYGDVSKSLPQP--LPIPVTDNLLKDTEDEEE 228

Query: 215 DDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
            ++ +  EDE  V+  +AQ+LL  H+ +AK+VR+RLR+ER +RI RYK+RLALLL P
Sbjct: 229 QEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLALLLPP 263

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A0V0I3K0_SOLCH (Putative mucin-2-like OS=Solanum chacoense PE=4 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 9.800e-34
Identity = 109/266 (40.98%), Postives = 145/266 (54.51%), Query Frame = 1

		  

Query: 11  LLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPN-SWAL 70
           L+  P  P +Q +LH        SI YP+AS+GRG L  P      PV S +    ++ L
Sbjct: 78  LVLKPPNPDSQPHLH--------SILYPVASSGRGFLSKPSNYPXRPVVSHLGSRPTFGL 137

Query: 71  QQYQHQHHHSRLSAPS--SHLVRG--PFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGS 130
            Q       S    PS   H + G  P ++ +    S      A +G    +     + S
Sbjct: 138 NQMDPGLGQSTGVRPSHLQHALLGSSPTVNSAGPAASAGVLPGAVKGFPVVSSSHHKIAS 197

Query: 131 SPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYG 190
           +  S  D N  ++  D+  DDT   +RDRKVRISD ASLY LCRSWLRNG  +D Q QY 
Sbjct: 198 TQPSLSDCNGFREKRDRSKDDTFAIIRDRKVRISDNASLYTLCRSWLRNGLPDDTQSQYM 257

Query: 191 DVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKR 250
           D ++S P+    PL L P +  SP +KE D E E +  E++  +S  +ELL  HVK+AKR
Sbjct: 258 DGVRSLPR----PLALAPQDAESPVKKEGDKEEEXEAGESVEHLS-PKELLQRHVKRAKR 317

Query: 251 VRARLREERSQRIERYKSRLALLLTP 272
           +R+RLREER +RI RYK+RLALLL P
Sbjct: 318 IRSRLREERLRRIARYKTRLALLLPP 330

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A061DWJ9_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_005945 PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 3.700e-33
Identity = 100/238 (42.02%), Postives = 139/238 (58.40%), Query Frame = 1

		  

Query: 35  IRYPLASTGRGLLPAPRGILPDPVASSIAPNSWALQQYQHQHHHSRLSAPSSHLVRGPFL 94
           + YP+AS+GRG LP      P      + P  +    + H HH +    PS         
Sbjct: 49  VMYPVASSGRGFLPTNHPCRP------LLP--YHHHPHPHPHHFANPRPPS--------- 108

Query: 95  HPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDR 154
            PS +   P  F    + +  +  P+  V  SP S  ++N  K+  D+  DD+LV +RDR
Sbjct: 109 -PSLSLPHPTHFHPPLKALSLSLHPK--VAPSPSSLSETNGYKNVRDRTKDDSLVNVRDR 168

Query: 155 KVRISDGASLYALCRSWLRNGCV-EDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKE 214
           KVRI+DGAS+YALCRSWLRNG   E  QPQYGDV KS P+P  LP+P+  +     + +E
Sbjct: 169 KVRITDGASVYALCRSWLRNGFPDETQQPQYGDVSKSLPQP--LPIPVTDNLLKDTEDEE 228

Query: 215 RDDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
             ++ +  EDE  V+  +AQ+LL  H+ +AK+VR+RLR+ER +RI RYK+RLALLL P
Sbjct: 229 EQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLALLLPP 264

BLAST of Spo18849.1 vs. TAIR (Arabidopsis)
Match: AT2G32840.1 (proline-rich family protein)

HSP 1 Score: 123.6 bits (309), Expect = 1.900e-28
Identity = 101/249 (40.56%), Postives = 136/249 (54.62%), Query Frame = 1

		  

Query: 34  SIRYPLASTGRGLLPAP----RGILPDPVASSIAPNSWALQQYQHQHHHSRLSA---PSS 93
           S+ YP  S+GRG    P       + DPV S  +P  +  +   + +HH +  +   P +
Sbjct: 96  SLIYPFGSSGRGFPTRPVRQNSNSVADPVGSP-SPGGYTPRGPVYGYHHGQFVSNLDPMN 155

Query: 94  HLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDD 153
             +R    HP Q   SP   S   +GV    +P+     SP S  D++  K    +  DD
Sbjct: 156 QFMRAA--HP-QNQQSPQLGSGHMKGVPHFLQPRAT--PSPTSILDNSGHKKA--RSRDD 215

Query: 154 TLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEY 213
            LV +R RKVRI++GASLY+LCRSWLRNG  E  +PQ  D++   PK    PLP+   E 
Sbjct: 216 ALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIKPQRIDMMTCLPK----PLPVDKTET 275

Query: 214 SSPKRKERDDETEVD-EDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRL 273
           S PK    +   E D EDE  V   +  +LL  H+ +AK+VRARLREER +RI RYK+RL
Sbjct: 276 SLPKDLVEEAICEEDKEDEESVKHLSESDLLKRHIDRAKKVRARLREERLKRIARYKARL 332

Query: 274 ALLLTPNGD 275
           ALLL P G+
Sbjct: 336 ALLLPPFGE 332

BLAST of Spo18849.1 vs. TAIR (Arabidopsis)
Match: AT1G04930.2 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 95.5 bits (236), Expect = 5.500e-20
Identity = 60/134 (44.78%), Postives = 84/134 (62.69%), Query Frame = 1

		  

Query: 142 KGTDDTLVTLRDRKVRISDGAS-LYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLP 201
           +  D  L  +R RKVRI++G+S LY+L RSWL+NG     QPQ   ++K  PKP  LP+ 
Sbjct: 231 RSKDGALAVVRGRKVRITEGSSSLYSLGRSWLKNGAHVGIQPQRSGIMKPLPKP--LPVD 290

Query: 202 LPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIER 261
           L   E S P   + +   E  EDE  V   + ++LL  H+++AK+VRA+LREERS+RI R
Sbjct: 291 LTT-ETSVPDDPDEESADEDKEDEEAVKQLSEKDLLKRHIERAKKVRAQLREERSRRIRR 350

Query: 262 YKSRLALLLTPNGD 275
           YK R+ L+L  + D
Sbjct: 351 YKERITLILAQSED 361

The following BLAST results are available for this feature:
BLAST of Spo18849.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902162695|gb|KNA06718.1|8.2e-151100.hypothetical protein SOVF_1784... [more]
gi|731350598|ref|XP_010686585.1|2.7e-6157.4PREDICTED: uncharacterized pro... [more]
gi|731415441|ref|XP_010659555.1|2.0e-4045.4PREDICTED: SH3 domain-containi... [more]
gi|1009106179|ref|XP_015873160.1|9.4e-3842.8PREDICTED: uncharacterized pro... [more]
gi|720084883|ref|XP_010243354.1|2.1e-3741.3PREDICTED: leucine-rich repeat... [more]
back to top
BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9QJ54_SPIOL5.7e-151100.Uncharacterized protein OS=Spi... [more]
A0A0J8BT56_BETVU1.9e-6157.4Uncharacterized protein OS=Bet... [more]
A0A061E3E3_THECC1.5e-3441.7Hydroxyproline-rich glycoprote... [more]
A0A0V0I3K0_SOLCH9.8e-3440.9Putative mucin-2-like OS=Solan... [more]
A0A061DWJ9_THECC3.7e-3342.0Hydroxyproline-rich glycoprote... [more]
back to top
BLAST of Spo18849.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Spo18849.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 2
Match NameE-valueIdentityDescription
AT2G32840.11.9e-2840.5proline-rich family protein[more]
AT1G04930.25.5e-2044.7hydroxyproline-rich glycoprote... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR028226Protein LIN37PFAMPF15306LIN37coord: 148..268
score: 3.4
NoneNo IPR availablePANTHERPTHR37173FAMILY NOT NAMEDcoord: 10..274
score: 1.5
NoneNo IPR availablePANTHERPTHR37173:SF1HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 10..274
score: 1.5

GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007049 cell cycle
cellular_component GO:0017053 transcriptional repressor complex