Spo18849 (gene)

Overview
NameSpo18849
Typegene
OrganismSpinacia oleracea (Spinach)
DescriptionHydroxyproline-rich glycoprotein
Locationchr2 : 48021594 .. 48031171 (+)
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTGCACCCAAATTGTAATCGTATAAACAAATTTAACATTCGAAATACAAATACCCCATACTCCAAATGTGGCAAATGCGGGAACGCATCTCCTTCCCCCTTCCTCCTCAGTCTTCCCGCATCTCCTTCCTTCTCCTCTCGAATCTCCTCCTCTTAAACCCTAACAACAAAACCAGCATCTTGTAGTTGTAGTCGGCGGCACCTCTGTCAACCACCTCTCCTCTGCTGCTGCTATTCTCGGTAATTATCTACAAGTATTTTTGTATTGATTTTTCTGTTGATTTTATTTGTATTTTTATTGATTATAATTCTCTTAATTGTCATATACATTGTGTTGGTTTTTAGTTAGCTTTGTTTTGATACGATTCTAATAAGATTGATTAGGATTAAAGATTCAATTTTTCTGATATGATGGGTTTCAAATTGTTAAATATATTCAGTTGGGTTAGTTTTGGTGTTGTTTTATGCCTAATTAGTGTATACCCACTTCATGATTCTACTTCACATTCGTATAATTCCACTTCATAATTCTACTTGGAATTCGCATAATCCTTTTTGATATGGTTTGATTTTTCTGACAATCACTGTTCATCTCTCTTGGCCCAGGGTGGGCGCCCCTCCTGCGCTAGCTCATGGACGGGACTGGACAGCATCAGCCTGGGTTATTAGCAGCTCCAACCACACCTCAAACTCAAACCTACTTGCATCCTCGCTTGGCCAATGCTTCAGATTCGATTCGATATCCATTAGCATCTACCGGTCGTGGATTGCTTCCGGCACCTAGAGGCATATTGCCTGATCCAGTGGCAAGTTCCATTGCCCCTAACTCGTGGGCTCTTCAGCAGTATCAACATCAGCACCATCACTCTCGATTGTCTGCTCCCTCTAGTCATCTTGTTCGCGGTCCTTTTTTGCATCCTTCTCAGGCTCATATATCACCTGTTTCATTCTCTGCTGCTGCTAGAGGCGTTCTATTCAACAATCGCCCTCAGACCATGGTTTCCTTTATTCCTTTCTTTTTCTTTCCTGCTGTCTTAACCTTTGTATATTTCTTAACTGTCTAAGCTCTAGGATTTGTTTTGTGATGCTATGTTGCTGCTTGTTTCATGGTTTTTTGGAAGCTTTTTTTTTAGAAGTTATAACTTTTTTCTGGTCGGAATGGGTCTTTTCTGGCTAGGTTTTATGGGTATTACATTGTCATATCACATGTGCCTCATTCGAGACCAAAATTCAACTATCCTGACTGGTCTGGTTTAAGGTGCAAAAGTGGGTAGACTGTTTGTGCACATAATTTCTTATTGAGGATTAAGGAGCCCTCATGCACACCAATGTGACCATTTTGGAACAACCTTTCTAATTTACTCTTTATATATCCATCTTAATTTTTGTTTTCTGGAACAATCTGTCTACTACTTTAGATTGTTGCGGTAGCACTGCATACTTATGGCCTATTATCTTCTCTTGTTCAGGTTGGTTCATCTCCCTACTCTGCTCCTGATAGTAATGATCCTAAAGATCCCATGTAAAACATCTACTCTGGCCTATTTCCTTCCTTGTGTGAACTTCGGTGTCATGGTTGCACACTTGTAGTTACGGTTCTTGTTTTGACTTGTGCAGAGACAAAGGTACTGATGATACGTTGGTCACTCTTAGAGATCGAAAGGTAAGGTTAAGGTCTACTATTCTCTTTAAGAATCATACTATAGATATTCATTGATTCCCAAGTTTCTTTAATAATTCATCTTTGCCGTGCATATTCCCTAAGAGATATCAATAATAATGAGAAATGTATGAGTAGTCTATATTATAATTTTTCCTGTCTTTTGGGGATGGAAGGAATTAACCTCTCGCTGATTTTTTTCCATTATATCGAAAGAACTGAAGTTCGTTACAGGTAATGAGCTGAAGCTCATGACACCAACTGAGCCAAGGCTAATAGCAACTAGCAAAACAGGGCTTCTAAAACCTTGTGACTCGATGCTTGTAATGTGCTAATGAGTGCATATTGTTTTATGTGTTGGTTTAATTTGTGCTCTAGGTACTTATTCTTGTGATTCAGGTGGCTTAAGTGTAGTTTGGAAGTTGAGCTGTTTATGGAAACTTTTCATGCTTCTGATATTTTATATAGTAATATTCACCTTGTCGTTTTGTTTGAATAATTTATATGCTTAAGGTGTGGCCTGGCTTGTTTTTTTACGTAGGTCTTTACAATGAAACAATTAGTACATATTCCCTCATTAGAAACATTGTCTTCTTGTGACTTACTGGAAGTCTGGAACTGATTCTGGATAAGTGCTTTAATAGTTTTGAGAGTGGAACCTATTCCTGCTATCAGATTACCTTTTCGGAGAACAGGTAGTTTGGAAGTTGTTGGGATGAGTTGAGAAGGTGTACGATTTTATTCATATTATTTTGTGGGGAATAGTTGGGTAAAGTGATGGACACTTGGACAGCAAGGGTTTTAGACTTACTTTTAGTCCCAGTGGAGCAAATCTTTTGTGTATCAGAGTGTGAATTTTGTTCTGAAATAGCAAACCAAAGTTATAATTACGAAAAGTTCAACACTCGAACCGTAAAGAATAAAGACCCACTCACTAAATTCAAGATAAAGACTCTGTGATTCTTTTATATTTTAACATAAATAACATAGTTCCTGGGCTAAGTTTTGGCTATCATAGACTATTAACCTCGGTATTGCGAAATGTCCTTATTTTTGAAAAATACAAAGTCTACACTAATTTTCATGTAAAAATTCCCGATGAATGGCTTATTTGTGCTTGTAATATGGACATGAGTGCACAGTGCTCCTTATGCACCAAAAAGCACCGATGCATATGAAAAAGAAAATCCAATATAACAATCTGTACCAGTGAGTTTAATGGTCCTTTATGTGAGAGAAGATGTGACCTCAAATCTACTTTTCTTTGGTTAACTAGGTTGAGCACAAAGTTGAAAATAAGAACACAACACTTTTCAGTGCACAAGGCTTGTTGACTTTGTTCCTCTTAAGAAACCAGGGTAGATGGCAAAAGATTTCTAAATTATGAGCTATTCACTATCATTGCGGTTCAACAATCTCCTCCTTACTAGAAGCAACTTTTACATGAAACCCTTCTCTTTAAAGCTGGTTTGACTTGTCAAGTTCCTCACTAACTTATTTCTTGTAGGATCATAGTATGCCTGTTATGGAAGTCACTTTGCACAGACTCATAACCATAACACTTGAAGCCATGCACCTTATCCATCTTGATCTCTTTTGTTGGTTTCTCATGCAAAGTTACTTCCTTGTCATCATTAGTCAAAGAATCAGATTTCGGTATACAGATTTTCATTGAAGAAGACTCCTTAATTAACCGTTTTACACGCTCGTGAACTTGCTTTCACAGTTTCAGGGCCAAAGTGCACAATTGCACATTCATCAAAACTCTTCCAAGGTAAGAGTACGCTGAAATACCAAATTATGAAATTTGGCTGATAAATATGAACTGATATAACCCAGTAAAGTAAATTCTACTTGTCAATACTTAAAGACTAGGCAATTATTACCCACTTTGATGTAAAGGATGGAGCACCCGCTTTGAGAAATTTAAAGATTACTATCTGTTGTATAACCGTTGACCTGTAAATAATTAAATTTAATTGGTCTTTTTCTGTTTTGTGATGTGTCAATAAGATCACTAATTCGAAATCTAGTTTACAACGCGAAAGTAACCGATATTTCTCCTTGTGGAACACTCCTGAACTCACAATTCGAATCCCCAAGCCTAGAAGATCATTCCTGAAGCCCCTAAGTATTAGATCTCTAAGTCCTAGAAACACTCCTGAATCTAAGCCTAGATTTACTTACGTCCAAACCACTCCTGAAATTCAATCCAAGCAACGAAAGAATAAAACTTTCAATTTAATTAAGCAAAGCTCGCGAATTTTACTTAATTCTCAAAAATCTGAATAATAGAAATTACAAGTCTTCACTATATAGGAGTATGACTAACAGTAGTCCTGTATAAATGGAACTCTAAAAAGCTATTCATAAAAGGTCTTAGGTAGATTTGTTTTAGCTTGTGTATTATTGCTGCAATTACGGTGTCTCGTGTGAAGAATTGTAATCGCATGCCAGGAATTTTGCAATGATAATGAAATACTGGCAACATATTATTAATGGCTCAAAGTGCAACACGTTTTGGGCTTGTAGTTGCAGTAGTATGTGTTTTCCAATTCAGCAAAATAACTTGATGATGAATTATTCTGACCCTTCACTTGGGATTATGCTGAAACGAGAAGGCATTTTAGTTGGTTGTTGTTAGAGAGTGCAAGTATGTTAAGGGCAGGTGTTTAGGTATGTGTTTCGATATATGTTCTTGTATTTGACTTATCCACTTGGCGTTTCAGCTTGTGCAGTTAGGTGTTTTCTGAAAATTTGCAGTGATTATAACTTGAAACCAAATCATGTTTTGGTCTTTCATCAGATTTCTACCCTAAGTCTCTCTTTGGATACTTTTTGTAGGTCAGGATATCAGACGGGGCATCACTGTATGCTCTCTGTCGGTCGTGGTTGAGGAATGGATGTGTAGAAGATAATCAAGTACGTAGCATTTGGTCATATCTACTGTTATTTTTATTATCTGTAGAATACAATTCATCTATGGAGCTTTCTAAATTCATGCAACCATATGCAGCTGACTTCTATATATCTGTGTATCTTGAAGCAATTGTTTGTTTGATCAAAACCATCTTTGGGTTCAGACTGGTAACAAATATTCCAAGTTTTTATATTTAACATAGGAGTTGCCATTTGTTGACATATACCCACAGACACACACATGGATAGCTTGTTGACATTCTTAATTAATAGGTCAGAAATGGAAGACTCAATTCTCCAACCGGACTGTCTACCTTACAAGAACTATTAACTTTCTTCCTCATTACTTGTCCAGTAAAACATGAAGGGTTTAGAATTAGAATGTACATAATATATCAGTATTTTATAGTTAAAAGTTTCATTTTTTTGGGCTAACATATGAATATATCAGCATAAGCAGATCACCTCCTCTCTGGTTCTCCACCTTCAAGTAGCTTATAGTTATAATCACAATACGATCATTGGAAAACACAAGTATACTAATCTTGTTCCATGCTAAGACTATTGAAGTACTTGTGACAAACTGAGTAAGATTGCTGGATTTTGATTCAACAAAGTTGAGGGAGGAGGCCCAAAAAATTGTTGAGAATTTGTTAATCACCTGTGCGGCTGTGCCTTGAGTGTGATTCTAAATTATAGAGGAGAAGAGCTTAACTATTATTTTTGTTTTTATAAAAATGGGAGCAGTAGGAAGATGAAAGGACAGATCGAGGAACAGAATCCCAACTTGTCCAATATGGAATGAGAGAAATCAAAAGCAGGAAAGAGAAATGGTCCTTTGACCACTTTAGGAAGGTGTGAATCATTATTTACCTTTTTTAAATAACATAGGTGTCATTATTTACCTTTTTTAAGTCGTAAGTCTTGTAACAGGTGTCAGACTTGTTATTTTCCTTTCCCTAAATCCTTCACTGTTCAGGTCCAAATTCAGTTCTGCTCCGGTTTGTGCTCACTCCCACTTTCAATTAGCAACTAGTTATAATCTCTATATGATCCTTGGCCATAGAATCATTATGCTGTTGGACGTTGAAGTAGCAATGAAACTATATGTTGCGTACTTGGTCAGAGTTCTTCTCTTTCATTTTTCTAAATTGTTTAAAAACTTGCAGCCACAGTATGGAGATGTTCTAAAGTCTCATCCGAAACCCTTTCATCTGCCTCTTCCTCTGCCTCCGCATGAATATTCCTCACCGAAGAGAAAGGAGAGAGATGATGAAACGGAAGTTGACGAGGTTGGTTGTTCTTTGACTGTAATATTAGTACTATTGTCTATTTTTCTTAATTATTCAAAACTGTTTAAGGGTGGTGTTTTGAGATTTTTCAATGTTTTTCTTTTTTCTTGTGCAAGGTCATAACTTCTACTCCCTCCGTCCACAAAGATTGCTCATACCCTTTGTTTCTTTTTTTGAAAAGTTGGTTCATTTAAAAGTGCGTGAAGCTCTCCACCCCCCCCCCCCCCCCNCCCCCCCCTTTTGTAGTTGCACATGGGTTGTTGAGTGGGGAAAAATCAATGGGACATCCATTTTAGGAACGTGGAGCAATCTTAGTGGGACGGAGGGAGTATATATGTGGAAGATGGTGTAACTTATCAATTCATTTTACTTGTTTAAAAGCAAGGTTCAGTTGGTAAGGTAATCAATAGCCTTTTGATGCCGTTCAATGTTAGCAGTATTCATGATAAATGATGTTGTTAGATCATTCACGATAAATTTTGGGTGGTAGAATTCATGACAAATATTGGTTATCTGTTAGAGTATTCATGATAAAACTTTGATGCCATAAGAGCATTCAAGATAACGTAGTATTGTGGTAGGTAGTATTAATTCTGCTTTTTTTCTTTTCTTTTCTGGATGGTAGTGTTACTAGTGTAGCCATGCATTACTTGGAATGCTGATATTTTTGACTTTTTTCTTTTTAGTTCTCTTTCATACTACGAAATTTTCTTCTGTTAGAATTGGAAAATTCCTGGTGGATACCATGAGTTCCAATGAAATTACAATGAGATACTTCAACATTTTATAAAATCTCTATCATCCCCTCCAATGTTTAAACTAGAGTCAGTAGCGTAGAGTTGTGAGAATTAGTTGAGAATGAGAAAGAAAGAGGGAGGGAGGGAGAATACTAACTTATAAAAACTTGAGAGTGCATCGTATGGCATCAAATGAGATGGCATTTATAGGACTTGTTACAAAGGATCCTTTTATTTTTGTGGTCGAGTTGCTTGGCGATTTGCTATCTCCTCTGGGCTCCGGGGGCTGCTGATTTATTTTTTCAATGTATTTTATGTTGGACTATTACTTGTAGCGTTGGGTAGGGCTTGAGTAGTTAAGTAATTCTTTTGGGCATTGGGTTGTGTCTTGTCTATTTTTCAGTTGAAAACAACTATTACTCTATTAGTGGTTTGAAGATGTGTCTCGTTTGTACTATTCTTTTGGACCAATTGTGGAATAGTAAGATAAGTGGATTTGGGCTGGCAACTCAATAAACACCCAAAAGAGTTGTGGTGGTTGTGTATGTGTGTTTATGTTTTACACTTTCTCCTAAATACTTATGCCCAGCTCAAATGGGAAGTTATTTGGGGAGGTAGGGAGGGAATGTCATTAAGGGATAAGTTGATTGTCATGATGTAACCCATTTTCAGGAGTCCTTATTCCACTATTTTTGCTTGCAGGACGAGAATTTAGTTGATATTTCAAATGCTCAAGAGTTGCTAACAAGCCATGTTAAGCAAGCTAAGAGAGTTCGAGCAAGGTATGTCCCTCCTGATTTTGTGTCGCCTGACATTGTTGTTTTTATTAGCTATGATTGAATCTTCAGTTGGAATTTTTCACATCAAATGCTATCTCTAAATCACCTGTACAGGCTCAATGTTGATAACCGTAGTATGAAACAGAGAAACTGTTATCTTGAATACAGTAGAAATGCAAACGTAAATCAAAAGAATTAATTAACATGCATATATTCATTGTATACTTTCTCCGTTTCTAGATAATTGCAACACTTTCTGTTTCTTTTATGTCCGAAAATTATTGCAATACCAAAATAACAAATTTGCCATTACATAACCAAACCACTAACATAATTAGAAAACCAAAAACTGAAAATAACCCTACCACAGCACTACCCCAACCAACTAACCCCCTTCTACCTTCTGTCCATGCCAACTAACCCCCAACTTTCCTCCCCTCTCTCTAATCCCGTCACCTACCATCCACCAGGCACCACCATCTTCTTTCCTCCCTTTCACTTGTTGCCAGCACTTCAAACCACCAACGAACACAGCCTCACGCCTTACCCTAGACAACAGTCCACTCCAAACTACAGCCACACCTATCGGAATGCTACAATTGAAACCACAACTAATATTTGTACTTCTGAGTTTTGATCACAAGTCCCCATAATTTGGCCAGCAAAGCTTTGAAACCTCTGGATATGATCACTGAAGGGTCAAATCTCAATTGTATGAAGATCAGCAGTAGCCACTAGGGAGTAGGGAGAGGCGAGAAAGCAGATGAGCAGAAGGGCGGTGATTCACCACATGAGATGAGAGAGAGAGTGGAAGGGAGGGCTTGTCAGTGGTGATCTAAGAGATAGGGGGAGGGGCATGCTGAGACGGGTGTGATTTGTGAAATATAGGAGGGCGGGCTTGGTGGGATTTAGGGGTCGTCGAGTGTGTGAATGACCCCTTACAACTTGCCCTACTTATGTGCTATGATTATTTAAAAAGCGTAGGAACTATGACTCAATGCAGTTGAAATAAAGTACAACTAGATCTAGAACAGAAGAATGAAAATGTGTTGGGGGGTGGGGGGGGGGGNGGGGGGGGGGAGAAAGCCCCTAAACATATCTCCTAAGCATATCTAATTACATCAAGAAAGAAAGTCTACATAGCTTCTTTGTACAAACAAGTAATTTCATTCCCATTTTCCGTTGTAACCGACCCCTAAATCAATATCCATGTGATCTTCCTTTAATCCTTCCAATGGGCTAAACTGTAACTTCCATACCCCTAAAGAAATTTAGACAGCATTTCCTACAAAAAAACAATGTTCCAATCAATTATTTTTTTTACTTCCAGTGGAATGTTAGTAATATTAGTAAATGTTAAACTTCGCATTTACCTTAGAGACATCTCTTCCCCTTGTTTATTTATTGCATTTGTAGGCTTATATAACTTTGAGCATATGAGCTAATTTGGTTAGAGGCTTAGAGCTTGCTACTGCCCAATGCTTATTACTAGGAGATGTGCATGTAATGTATGGATGGACCTTTTTGAACGTTTGCGCCTGTTTACAGATTAAGGGAAGAAAGATCACAACGCATTGAAAGATACAAAAGTAGACTTGCTCTTCTTCTTACACCTAACGGAGATGTGCAAAATCAGCAGGAATGATACAGTCCCTGGTCCCTGAGTAGTTGAGTACTACATAGCTAGTTTGTATGATTGATTTCTGGTTCCTGTTGCTTCCGTTCCTCCTGTAATATGTTACTAAAGGATGGGACGGCATCAGGTTTTGTAGATTAAGTTATGTCGCGAGTTTTGTTGTATGATTTTTCTTCAATGAGCAATCAGAGGTTTTGCCAACATTTTTCTTGGTTGGAAATATTGTTAAGAAGGGTAAGCTAAAGAGAAGTAGATTATTGTTATGTTGCTAATAGTTAGTTATTACTCGTAATAATTATTTTATGGGATATATTAATCCCACCGGGATATTTTTGGTAGAGGTCTCTCAGTTTGAGCAGAGTTGCTCACTTAATTGCTTATTGGTACTAGTCAACTGCTGGATCAAATAATTGTTCAATAATGAATCTAGTTGAAAGGAATAAAATCCTCCTACACAATTCTATCATTCTATGCACTTCTTTAATGATTAAAGAGTTATTGGATGAGCAGTAGAAACATGATTTGAGCACTTGTAATCTAGGAAATAGGAACAAG

mRNA sequence

GGTGCACCCAAATTGTAATCGTATAAACAAATTTAACATTCGAAATACAAATACCCCATACTCCAAATGTGGCAAATGCGGGAACGCATCTCCTTCCCCCTTCCTCCTCAGTCTTCCCGCATCTCCTTCCTTCTCCTCTCGAATCTCCTCCTCTTAAACCCTAACAACAAAACCAGCATCTTGTAGTTGTAGTCGGCGGCACCTCTGTCAACCACCTCTCCTCTGCTGCTGCTATTCTCGGGTGGGCGCCCCTCCTGCGCTAGCTCATGGACGGGACTGGACAGCATCAGCCTGGGTTATTAGCAGCTCCAACCACACCTCAAACTCAAACCTACTTGCATCCTCGCTTGGCCAATGCTTCAGATTCGATTCGATATCCATTAGCATCTACCGGTCGTGGATTGCTTCCGGCACCTAGAGGCATATTGCCTGATCCAGTGGCAAGTTCCATTGCCCCTAACTCGTGGGCTCTTCAGCAGTATCAACATCAGCACCATCACTCTCGATTGTCTGCTCCCTCTAGTCATCTTGTTCGCGGTCCTTTTTTGCATCCTTCTCAGGCTCATATATCACCTGTTTCATTCTCTGCTGCTGCTAGAGGCGTTCTATTCAACAATCGCCCTCAGACCATGGTTGGTTCATCTCCCTACTCTGCTCCTGATAGTAATGATCCTAAAGATCCCATAGACAAAGGTACTGATGATACGTTGGTCACTCTTAGAGATCGAAAGGTCAGGATATCAGACGGGGCATCACTGTATGCTCTCTGTCGGTCGTGGTTGAGGAATGGATGTGTAGAAGATAATCAACCACAGTATGGAGATGTTCTAAAGTCTCATCCGAAACCCTTTCATCTGCCTCTTCCTCTGCCTCCGCATGAATATTCCTCACCGAAGAGAAAGGAGAGAGATGATGAAACGGAAGTTGACGAGGACGAGAATTTAGTTGATATTTCAAATGCTCAAGAGTTGCTAACAAGCCATGTTAAGCAAGCTAAGAGAGTTCGAGCAAGATTAAGGGAAGAAAGATCACAACGCATTGAAAGATACAAAAGTAGACTTGCTCTTCTTCTTACACCTAACGGAGATGTGCAAAATCAGCAGGAATGATACAGTCCCTGGTCCCTGAGTAGTTGAGTACTACATAGCTAGTTTGTATGATTGATTTCTGGTTCCTGTTGCTTCCGTTCCTCCTGTAATATGTTACTAAAGGATGGGACGGCATCAGGTTTTGTAGATTAAGTTATGTCGCGAGTTTTGTTGTATGATTTTTCTTCAATGAGCAATCAGAGGTTTTGCCAACATTTTTCTTGGTTGGAAATATTGTTAAGAAGGGTAAGCTAAAGAGAAGTAGATTATTGTTATGTTGCTAATAGTTAGTTATTACTCGTAATAATTATTTTATGGGATATATTAATCCCACCGGGATATTTTTGGTAGAGGTCTCTCAGTTTGAGCAGAGTTGCTCACTTAATTGCTTATTGGTACTAGTCAACTGCTGGATCAAATAATTGTTCAATAATGAATCTAGTTGAAAGGAATAAAATCCTCCTACACAATTCTATCATTCTATGCACTTCTTTAATGATTAAAGAGTTATTGGATGAGCAGTAGAAACATGATTTGAGCACTTGTAATCTAGGAAATAGGAACAAG

Coding sequence (CDS)

ATGGACGGGACTGGACAGCATCAGCCTGGGTTATTAGCAGCTCCAACCACACCTCAAACTCAAACCTACTTGCATCCTCGCTTGGCCAATGCTTCAGATTCGATTCGATATCCATTAGCATCTACCGGTCGTGGATTGCTTCCGGCACCTAGAGGCATATTGCCTGATCCAGTGGCAAGTTCCATTGCCCCTAACTCGTGGGCTCTTCAGCAGTATCAACATCAGCACCATCACTCTCGATTGTCTGCTCCCTCTAGTCATCTTGTTCGCGGTCCTTTTTTGCATCCTTCTCAGGCTCATATATCACCTGTTTCATTCTCTGCTGCTGCTAGAGGCGTTCTATTCAACAATCGCCCTCAGACCATGGTTGGTTCATCTCCCTACTCTGCTCCTGATAGTAATGATCCTAAAGATCCCATAGACAAAGGTACTGATGATACGTTGGTCACTCTTAGAGATCGAAAGGTCAGGATATCAGACGGGGCATCACTGTATGCTCTCTGTCGGTCGTGGTTGAGGAATGGATGTGTAGAAGATAATCAACCACAGTATGGAGATGTTCTAAAGTCTCATCCGAAACCCTTTCATCTGCCTCTTCCTCTGCCTCCGCATGAATATTCCTCACCGAAGAGAAAGGAGAGAGATGATGAAACGGAAGTTGACGAGGACGAGAATTTAGTTGATATTTCAAATGCTCAAGAGTTGCTAACAAGCCATGTTAAGCAAGCTAAGAGAGTTCGAGCAAGATTAAGGGAAGAAAGATCACAACGCATTGAAAGATACAAAAGTAGACTTGCTCTTCTTCTTACACCTAACGGAGATGTGCAAAATCAGCAGGAATGA

Protein sequence

MDGTGQHQPGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTPNGDVQNQQE
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spo18849.1Spo18849.1mRNA


Homology
BLAST of Spo18849.1 vs. NCBI nr
Match: gi|902162695|gb|KNA06718.1| (hypothetical protein SOVF_178440 [Spinacia oleracea])

HSP 1 Score: 541.6 bits (1394), Expect = 8.200e-151
Identity = 272/272 (100.00%), Postives = 272/272 (100.00%), Query Frame = 1

		  

Query: 9   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 68
           PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA
Sbjct: 2   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 61

Query: 69  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 128
           LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY
Sbjct: 62  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 121

Query: 129 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 188
           SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL
Sbjct: 122 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 181

Query: 189 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 248
           KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA
Sbjct: 182 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 241

Query: 249 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 281
           RLREERSQRIERYKSRLALLLTPNGDVQNQQE
Sbjct: 242 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 273

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|731350598|ref|XP_010686585.1| (PREDICTED: uncharacterized protein LOC104900777 [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 244.2 bits (622), Expect = 2.700e-61
Identity = 155/270 (57.41%), Postives = 172/270 (63.70%), Query Frame = 1

		  

Query: 8   QPGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSW 67
           QP    APT+ Q+QT            IRYPLAS+GRG+L         P      PNS 
Sbjct: 60  QPVPFTAPTSIQSQT----------QPIRYPLASSGRGIL---------PTIGCTRPNSS 119

Query: 68  ALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSP 127
             QQYQHQ         S+HL+    L P  AHISP    A AR +  NN PQ+  GSSP
Sbjct: 120 QHQQYQHQ--------SSAHLL----LRPLNAHISPA--PAVARPIPLNNPPQSKAGSSP 179

Query: 128 YSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDV 187
           Y   DS+D KDP+DKGTDD LVTLRDRKVRI+DGAS YALCRSWLRNG VE+NQPQYGD+
Sbjct: 180 YFIADSSDSKDPLDKGTDDMLVTLRDRKVRITDGASFYALCRSWLRNGYVEENQPQYGDI 239

Query: 188 LKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVR 247
                    LP PLP  E  SPKRKE DDETEVDE E L    NA+ELL+ HV++AKRVR
Sbjct: 240 FSR-----SLPKPLPLPESFSPKRKECDDETEVDEAEKL---GNAEELLSMHVRRAKRVR 288

Query: 248 ARLREERSQRIERYKSRLALLL-TPNGDVQ 277
            RLREER QRIERYKSRL LLL  PNG  Q
Sbjct: 300 TRLREERLQRIERYKSRLGLLLHPPNGTTQ 288

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|731415441|ref|XP_010659555.1| (PREDICTED: SH3 domain-containing protein C23A1.17 [Vitis vinifera])

HSP 1 Score: 174.9 bits (442), Expect = 2.000e-40
Identity = 120/264 (45.45%), Postives = 155/264 (58.71%), Query Frame = 1

		  

Query: 25  HPRLANASD---SIRYPLASTGRGLLPAP-RGILPDPVASSIA-------PNSWALQQYQ 84
           +P+LA   D    I YP+AS+GRG +P P R    D    ++A       P S A     
Sbjct: 75  NPQLAKPHDPPQGILYPVASSGRGFIPKPLRPQSSDHNTVTVANPGAAFPPRSAATAAAA 134

Query: 85  HQHHHSRLSAPSS------HLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSP 144
             H       P S      H +R P L PS   ++ V  SA  +G+  +  P+  V  SP
Sbjct: 135 FSHQARPFGFPQSDLNYPVHSMRMPHLLPSHVGVTAVPGSAPIKGIPVSAHPK--VAPSP 194

Query: 145 YSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDV 204
            S  D N  KD  D+  DDT VT+RDRKVRISDGAS+YALCRSWLRNG  E+ QPQ+ D 
Sbjct: 195 PSVSDCNGYKDSRDRNRDDTFVTVRDRKVRISDGASIYALCRSWLRNGFSEETQPQHYDS 254

Query: 205 LKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVR 264
           +KS P+    PLP+P  + + PK+KE D+E   +EDE  V+    Q+LL  H+K+AK+VR
Sbjct: 255 MKSLPR----PLPIPVTDPNLPKKKEDDEE---EEDEGSVENLLPQDLLQRHIKRAKKVR 314

Query: 265 ARLREERSQRIERYKSRLALLLTP 272
           ARLRE+R +RI RYK+RLALLL P
Sbjct: 315 ARLREQRLKRIARYKTRLALLLPP 329

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|1009106179|ref|XP_015873160.1| (PREDICTED: uncharacterized protein LOC107410261 [Ziziphus jujuba])

HSP 1 Score: 166.0 bits (419), Expect = 9.400e-38
Identity = 126/294 (42.86%), Postives = 163/294 (55.44%), Query Frame = 1

		  

Query: 7   HQPGLLAAPTTPQTQTYLHPRL--------ANASDSIRYPLASTGRGLLPAPRGILPDPV 66
           HQP L AA T P      + +L        ++ +  I YPLAS+GRG +P  + + P PV
Sbjct: 71  HQP-LYAAQTLPIASPNPNFQLPAKPPNDPSSPAHPISYPLASSGRGFVP--KAVRPVPV 130

Query: 67  ASSIA-----PNSWALQQY-----------QHQHHHSRLSAPSSHLVRGPFLHPSQAHIS 126
            S  A     P  +  +             QH  H + L  P  +L       P Q    
Sbjct: 131 ISDQAVTVANPGGYPPRPVVNFHHGVDGVRQHLDHAAHLMRPPHNLQHHHHYLPHQQVHR 190

Query: 127 PVSFSAAA--RGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISD 186
           P   SAA   +GV  +   +    + P S PDSN  KD  DK  DDTL  +RDRKVRI++
Sbjct: 191 PHLGSAAVPVKGVPVSAHLKV---APPSSVPDSNGYKDSRDKSRDDTLAIIRDRKVRITE 250

Query: 187 GASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEV 246
           GASLYALCRSWLRNG  E++QPQYGD ++S PK    P P+       PK+KE +++ E 
Sbjct: 251 GASLYALCRSWLRNGAPEESQPQYGDAVRSLPK----PSPIHMTNTDLPKKKEGEEDGEQ 310

Query: 247 DE---DENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
           +E   DE  V+  ++QELL  HVK+AK++RARLREER +RI RYKSRLALLL P
Sbjct: 311 EEEVKDEESVEHLSSQELLKRHVKRAKKIRARLREERLKRIARYKSRLALLLPP 354

BLAST of Spo18849.1 vs. NCBI nr
Match: gi|720084883|ref|XP_010243354.1| (PREDICTED: leucine-rich repeat extensin-like protein 5 [Nelumbo nucifera])

HSP 1 Score: 164.9 bits (416), Expect = 2.100e-37
Identity = 114/276 (41.30%), Postives = 145/276 (52.54%), Query Frame = 1

		  

Query: 10  GLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIA------ 69
           G  + P  P     +          I YP+AS+GRG +P  +   P PV   +       
Sbjct: 55  GKTSNPNPPAQMAKIQDPSVPPPQGILYPVASSGRGFIP--KSFRPQPVDQLVTVANPGG 114

Query: 70  --PNS---WALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNR 129
             P S   +A Q              + HL+R P + P   H+ P    A   G    + 
Sbjct: 115 FPPRSVVAFANQVRPFSFPPGDPQVQAVHLMRPPHMQPP--HLGPRHIGATVSGPPIKSI 174

Query: 130 PQTM---VGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNG 189
           P  +       P S  D N  K+  D+  DDT+VT+ DRKVR+SDGASLYALCRSW+RNG
Sbjct: 175 PLVVHPKAAQFPSSTSDFNGYKELRDRSRDDTVVTIHDRKVRLSDGASLYALCRSWVRNG 234

Query: 190 CVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQEL 249
             +++QPQ+G+ +K  P+    PLP    E   PK+ E DDE E  EDE  V+  +AQEL
Sbjct: 235 LPQESQPQFGEGVKLLPR----PLPTSISEIPLPKKTEGDDEDEKKEDEGSVEELSAQEL 294

Query: 250 LTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
           L  HVK AK+VRARLREER QRI RYK RLALLL P
Sbjct: 295 LQRHVKHAKKVRARLREERLQRIARYKQRLALLLPP 322

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A0K9QJ54_SPIOL (Uncharacterized protein OS=Spinacia oleracea GN=SOVF_178440 PE=4 SV=1)

HSP 1 Score: 541.6 bits (1394), Expect = 5.700e-151
Identity = 272/272 (100.00%), Postives = 272/272 (100.00%), Query Frame = 1

		  

Query: 9   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 68
           PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA
Sbjct: 2   PGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSWA 61

Query: 69  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 128
           LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY
Sbjct: 62  LQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPY 121

Query: 129 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 188
           SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL
Sbjct: 122 SAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVL 181

Query: 189 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 248
           KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA
Sbjct: 182 KSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRA 241

Query: 249 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 281
           RLREERSQRIERYKSRLALLLTPNGDVQNQQE
Sbjct: 242 RLREERSQRIERYKSRLALLLTPNGDVQNQQE 273

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A0J8BT56_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_8g184170 PE=4 SV=1)

HSP 1 Score: 244.2 bits (622), Expect = 1.900e-61
Identity = 155/270 (57.41%), Postives = 172/270 (63.70%), Query Frame = 1

		  

Query: 8   QPGLLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPNSW 67
           QP    APT+ Q+QT            IRYPLAS+GRG+L         P      PNS 
Sbjct: 60  QPVPFTAPTSIQSQT----------QPIRYPLASSGRGIL---------PTIGCTRPNSS 119

Query: 68  ALQQYQHQHHHSRLSAPSSHLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSP 127
             QQYQHQ         S+HL+    L P  AHISP    A AR +  NN PQ+  GSSP
Sbjct: 120 QHQQYQHQ--------SSAHLL----LRPLNAHISPA--PAVARPIPLNNPPQSKAGSSP 179

Query: 128 YSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDV 187
           Y   DS+D KDP+DKGTDD LVTLRDRKVRI+DGAS YALCRSWLRNG VE+NQPQYGD+
Sbjct: 180 YFIADSSDSKDPLDKGTDDMLVTLRDRKVRITDGASFYALCRSWLRNGYVEENQPQYGDI 239

Query: 188 LKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVR 247
                    LP PLP  E  SPKRKE DDETEVDE E L    NA+ELL+ HV++AKRVR
Sbjct: 240 FSR-----SLPKPLPLPESFSPKRKECDDETEVDEAEKL---GNAEELLSMHVRRAKRVR 288

Query: 248 ARLREERSQRIERYKSRLALLL-TPNGDVQ 277
            RLREER QRIERYKSRL LLL  PNG  Q
Sbjct: 300 TRLREERLQRIERYKSRLGLLLHPPNGTTQ 288

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A061E3E3_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 2 OS=Theobroma cacao GN=TCM_005945 PE=4 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 1.500e-34
Identity = 99/237 (41.77%), Postives = 140/237 (59.07%), Query Frame = 1

		  

Query: 35  IRYPLASTGRGLLPAPRGILPDPVASSIAPNSWALQQYQHQHHHSRLSAPSSHLVRGPFL 94
           + YP+AS+GRG LP      P      + P  +    + H HH +    PS         
Sbjct: 49  VMYPVASSGRGFLPTNHPCRP------LLP--YHHHPHPHPHHFANPRPPS--------- 108

Query: 95  HPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDR 154
            PS +   P  F    + +  +  P+  V  SP S  ++N  K+  D+  DD+LV +RDR
Sbjct: 109 -PSLSLPHPTHFHPPLKALSLSLHPK--VAPSPSSLSETNGYKNVRDRTKDDSLVNVRDR 168

Query: 155 KVRISDGASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKER 214
           KVRI+DGAS+YALCRSWLRNG  ++ QPQYGDV KS P+P  LP+P+  +     + +E 
Sbjct: 169 KVRITDGASVYALCRSWLRNGFPDETQPQYGDVSKSLPQP--LPIPVTDNLLKDTEDEEE 228

Query: 215 DDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
            ++ +  EDE  V+  +AQ+LL  H+ +AK+VR+RLR+ER +RI RYK+RLALLL P
Sbjct: 229 QEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLALLLPP 263

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A0V0I3K0_SOLCH (Putative mucin-2-like OS=Solanum chacoense PE=4 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 9.800e-34
Identity = 109/266 (40.98%), Postives = 145/266 (54.51%), Query Frame = 1

		  

Query: 11  LLAAPTTPQTQTYLHPRLANASDSIRYPLASTGRGLLPAPRGILPDPVASSIAPN-SWAL 70
           L+  P  P +Q +LH        SI YP+AS+GRG L  P      PV S +    ++ L
Sbjct: 78  LVLKPPNPDSQPHLH--------SILYPVASSGRGFLSKPSNYPXRPVVSHLGSRPTFGL 137

Query: 71  QQYQHQHHHSRLSAPS--SHLVRG--PFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGS 130
            Q       S    PS   H + G  P ++ +    S      A +G    +     + S
Sbjct: 138 NQMDPGLGQSTGVRPSHLQHALLGSSPTVNSAGPAASAGVLPGAVKGFPVVSSSHHKIAS 197

Query: 131 SPYSAPDSNDPKDPIDKGTDDTLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYG 190
           +  S  D N  ++  D+  DDT   +RDRKVRISD ASLY LCRSWLRNG  +D Q QY 
Sbjct: 198 TQPSLSDCNGFREKRDRSKDDTFAIIRDRKVRISDNASLYTLCRSWLRNGLPDDTQSQYM 257

Query: 191 DVLKSHPKPFHLPLPLPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKR 250
           D ++S P+    PL L P +  SP +KE D E E +  E++  +S  +ELL  HVK+AKR
Sbjct: 258 DGVRSLPR----PLALAPQDAESPVKKEGDKEEEXEAGESVEHLS-PKELLQRHVKRAKR 317

Query: 251 VRARLREERSQRIERYKSRLALLLTP 272
           +R+RLREER +RI RYK+RLALLL P
Sbjct: 318 IRSRLREERLRRIARYKTRLALLLPP 330

BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Match: A0A061DWJ9_THECC (Hydroxyproline-rich glycoprotein family protein, putative isoform 1 OS=Theobroma cacao GN=TCM_005945 PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 3.700e-33
Identity = 100/238 (42.02%), Postives = 139/238 (58.40%), Query Frame = 1

		  

Query: 35  IRYPLASTGRGLLPAPRGILPDPVASSIAPNSWALQQYQHQHHHSRLSAPSSHLVRGPFL 94
           + YP+AS+GRG LP      P      + P  +    + H HH +    PS         
Sbjct: 49  VMYPVASSGRGFLPTNHPCRP------LLP--YHHHPHPHPHHFANPRPPS--------- 108

Query: 95  HPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDDTLVTLRDR 154
            PS +   P  F    + +  +  P+  V  SP S  ++N  K+  D+  DD+LV +RDR
Sbjct: 109 -PSLSLPHPTHFHPPLKALSLSLHPK--VAPSPSSLSETNGYKNVRDRTKDDSLVNVRDR 168

Query: 155 KVRISDGASLYALCRSWLRNGCV-EDNQPQYGDVLKSHPKPFHLPLPLPPHEYSSPKRKE 214
           KVRI+DGAS+YALCRSWLRNG   E  QPQYGDV KS P+P  LP+P+  +     + +E
Sbjct: 169 KVRITDGASVYALCRSWLRNGFPDETQQPQYGDVSKSLPQP--LPIPVTDNLLKDTEDEE 228

Query: 215 RDDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRLALLLTP 272
             ++ +  EDE  V+  +AQ+LL  H+ +AK+VR+RLR+ER +RI RYK+RLALLL P
Sbjct: 229 EQEQEDKKEDEQSVENLSAQDLLKRHIDRAKKVRSRLRQERLKRIARYKTRLALLLPP 264

BLAST of Spo18849.1 vs. TAIR (Arabidopsis)
Match: AT2G32840.1 (proline-rich family protein)

HSP 1 Score: 123.6 bits (309), Expect = 1.900e-28
Identity = 101/249 (40.56%), Postives = 136/249 (54.62%), Query Frame = 1

		  

Query: 34  SIRYPLASTGRGLLPAP----RGILPDPVASSIAPNSWALQQYQHQHHHSRLSA---PSS 93
           S+ YP  S+GRG    P       + DPV S  +P  +  +   + +HH +  +   P +
Sbjct: 96  SLIYPFGSSGRGFPTRPVRQNSNSVADPVGSP-SPGGYTPRGPVYGYHHGQFVSNLDPMN 155

Query: 94  HLVRGPFLHPSQAHISPVSFSAAARGVLFNNRPQTMVGSSPYSAPDSNDPKDPIDKGTDD 153
             +R    HP Q   SP   S   +GV    +P+     SP S  D++  K    +  DD
Sbjct: 156 QFMRAA--HP-QNQQSPQLGSGHMKGVPHFLQPRAT--PSPTSILDNSGHKKA--RSRDD 215

Query: 154 TLVTLRDRKVRISDGASLYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLPLPPHEY 213
            LV +R RKVRI++GASLY+LCRSWLRNG  E  +PQ  D++   PK    PLP+   E 
Sbjct: 216 ALVLVRKRKVRITEGASLYSLCRSWLRNGAHEGIKPQRIDMMTCLPK----PLPVDKTET 275

Query: 214 SSPKRKERDDETEVD-EDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIERYKSRL 273
           S PK    +   E D EDE  V   +  +LL  H+ +AK+VRARLREER +RI RYK+RL
Sbjct: 276 SLPKDLVEEAICEEDKEDEESVKHLSESDLLKRHIDRAKKVRARLREERLKRIARYKARL 332

Query: 274 ALLLTPNGD 275
           ALLL P G+
Sbjct: 336 ALLLPPFGE 332

BLAST of Spo18849.1 vs. TAIR (Arabidopsis)
Match: AT1G04930.2 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 95.5 bits (236), Expect = 5.500e-20
Identity = 60/134 (44.78%), Postives = 84/134 (62.69%), Query Frame = 1

		  

Query: 142 KGTDDTLVTLRDRKVRISDGAS-LYALCRSWLRNGCVEDNQPQYGDVLKSHPKPFHLPLP 201
           +  D  L  +R RKVRI++G+S LY+L RSWL+NG     QPQ   ++K  PKP  LP+ 
Sbjct: 231 RSKDGALAVVRGRKVRITEGSSSLYSLGRSWLKNGAHVGIQPQRSGIMKPLPKP--LPVD 290

Query: 202 LPPHEYSSPKRKERDDETEVDEDENLVDISNAQELLTSHVKQAKRVRARLREERSQRIER 261
           L   E S P   + +   E  EDE  V   + ++LL  H+++AK+VRA+LREERS+RI R
Sbjct: 291 LTT-ETSVPDDPDEESADEDKEDEEAVKQLSEKDLLKRHIERAKKVRAQLREERSRRIRR 350

Query: 262 YKSRLALLLTPNGD 275
           YK R+ L+L  + D
Sbjct: 351 YKERITLILAQSED 361

The following BLAST results are available for this feature:
BLAST of Spo18849.1 vs. NCBI nr
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. NCBI nr)
Total hits: 5
Match NameE-valueIdentityDescription
gi|902162695|gb|KNA06718.1|8.2e-151100.hypothetical protein SOVF_1784... [more]
gi|731350598|ref|XP_010686585.1|2.7e-6157.4PREDICTED: uncharacterized pro... [more]
gi|731415441|ref|XP_010659555.1|2.0e-4045.4PREDICTED: SH3 domain-containi... [more]
gi|1009106179|ref|XP_015873160.1|9.4e-3842.8PREDICTED: uncharacterized pro... [more]
gi|720084883|ref|XP_010243354.1|2.1e-3741.3PREDICTED: leucine-rich repeat... [more]
back to top
BLAST of Spo18849.1 vs. UniProtKB/TrEMBL
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. UniprotKB/TrEMBL)
Total hits: 5
Match NameE-valueIdentityDescription
A0A0K9QJ54_SPIOL5.7e-151100.Uncharacterized protein OS=Spi... [more]
A0A0J8BT56_BETVU1.9e-6157.4Uncharacterized protein OS=Bet... [more]
A0A061E3E3_THECC1.5e-3441.7Hydroxyproline-rich glycoprote... [more]
A0A0V0I3K0_SOLCH9.8e-3440.9Putative mucin-2-like OS=Solan... [more]
A0A061DWJ9_THECC3.7e-3342.0Hydroxyproline-rich glycoprote... [more]
back to top
BLAST of Spo18849.1 vs. ExPASy Swiss-Prot
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. ExPASy SwissProt)
Total hits: 0
Match NameE-valueIdentityDescription
back to top
BLAST of Spo18849.1 vs. TAIR (Arabidopsis)
Analysis Date: 2018-06-29 (blastp Spinacia oleracea peptides vs. TAIR)
Total hits: 2
Match NameE-valueIdentityDescription
AT2G32840.11.9e-2840.5proline-rich family protein[more]
AT1G04930.25.5e-2044.7hydroxyproline-rich glycoprote... [more]
back to top
InterPro
Analysis Name: InterPro Annotations of S. oleracea
Date Performed: 2018-06-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR028226Protein LIN37PFAMPF15306LIN37coord: 148..268
score: 3.4
NoneNo IPR availablePANTHERPTHR37173FAMILY NOT NAMEDcoord: 10..274
score: 1.5
NoneNo IPR availablePANTHERPTHR37173:SF1HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 10..274
score: 1.5

GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007049 cell cycle
biological_process GO:0009098 leucine biosynthetic process
biological_process GO:0006974 cellular response to DNA damage stimulus
biological_process GO:0042742 defense response to bacterium
biological_process GO:0000023 maltose metabolic process
biological_process GO:0043085 positive regulation of catalytic activity
biological_process GO:0046777 protein autophosphorylation
biological_process GO:0010155 regulation of proton transport
biological_process GO:0009637 response to blue light
biological_process GO:0019252 starch biosynthetic process
biological_process GO:0009228 thiamine biosynthetic process
biological_process GO:0006094 gluconeogenesis
biological_process GO:0019761 glucosinolate biosynthetic process
biological_process GO:0006096 glycolytic process
biological_process GO:0046487 glyoxylate metabolic process
biological_process GO:0009097 isoleucine biosynthetic process
biological_process GO:0019643 reductive tricarboxylic acid cycle
biological_process GO:0015991 ATP hydrolysis coupled proton transport
biological_process GO:0009651 response to salt stress
biological_process GO:0009099 valine biosynthetic process
biological_process GO:0006508 proteolysis
biological_process GO:0009737 response to abscisic acid
biological_process GO:0009414 response to water deprivation
biological_process GO:0009627 systemic acquired resistance
biological_process GO:0008152 metabolic process
biological_process GO:0006121 mitochondrial electron transport, succinate to ubiquinone
biological_process GO:0006099 tricarboxylic acid cycle
biological_process GO:0007010 cytoskeleton organization
biological_process GO:0010498 proteasomal protein catabolic process
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0009853 photorespiration
biological_process GO:0015986 ATP synthesis coupled proton transport
biological_process GO:0052837 thiazole biosynthetic process
biological_process GO:0006457 protein folding
biological_process GO:0009089 lysine biosynthetic process via diaminopimelate
biological_process GO:0042254 ribosome biogenesis
biological_process GO:0001731 formation of translation preinitiation complex
biological_process GO:0006446 regulation of translational initiation
biological_process GO:0006412 translation
biological_process GO:0019877 diaminopimelate biosynthetic process
biological_process GO:0000028 ribosomal small subunit assembly
biological_process GO:0006487 protein N-linked glycosylation
biological_process GO:0080119 ER body organization
biological_process GO:0006886 intracellular protein transport
biological_process GO:0006888 ER to Golgi vesicle-mediated transport
biological_process GO:0008284 positive regulation of cell proliferation
cellular_component GO:0000139 Golgi membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0019013 viral nucleocapsid
cellular_component GO:1990904 ribonucleoprotein complex
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
cellular_component GO:0005576 extracellular region
cellular_component GO:0005783 endoplasmic reticulum
cellular_component GO:0005789 endoplasmic reticulum membrane
cellular_component GO:0031410 cytoplasmic vesicle
cellular_component GO:0030127 COPII vesicle coat
cellular_component GO:0005743 mitochondrial inner membrane
cellular_component GO:0045281 succinate dehydrogenase complex
cellular_component GO:0071011 precatalytic spliceosome
cellular_component GO:0033116 endoplasmic reticulum-Golgi intermediate compartment membrane
cellular_component GO:0009316 3-isopropylmalate dehydratase complex
cellular_component GO:0005840 ribosome
cellular_component GO:0005739 mitochondrion
cellular_component GO:0009507 chloroplast
cellular_component GO:0000275 mitochondrial proton-transporting ATP synthase complex, catalytic core F(1)
cellular_component GO:0005852 eukaryotic translation initiation factor 3 complex
cellular_component GO:0033290 eukaryotic 48S preinitiation complex
cellular_component GO:0016282 eukaryotic 43S preinitiation complex
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005829 cytosol
cellular_component GO:0017053 transcriptional repressor complex
cellular_component GO:0005747 mitochondrial respiratory chain complex I
cellular_component GO:0022627 cytosolic small ribosomal subunit
cellular_component GO:0010319 stromule
cellular_component GO:0009579 thylakoid
molecular_function GO:0051538 3 iron, 4 sulfur cluster binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0051082 unfolded protein binding
molecular_function GO:0008840 4-hydroxy-tetrahydrodipicolinate synthase
molecular_function GO:0042803 protein homodimerization activity
molecular_function GO:0046933 proton-transporting ATP synthase activity, rotational mechanism
molecular_function GO:0005515 protein binding
molecular_function GO:0000166 nucleotide binding
molecular_function GO:0008177 succinate dehydrogenase (ubiquinone) activity
molecular_function GO:0009055 electron transfer activity
molecular_function GO:0046872 metal ion binding
molecular_function GO:0051537 2 iron, 2 sulfur cluster binding
molecular_function GO:0003723 RNA binding
molecular_function GO:0003743 translation initiation factor activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003861 3-isopropylmalate dehydratase activity
molecular_function GO:0003735 structural constituent of ribosome
molecular_function GO:0003677 DNA binding
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0003994 aconitate hydratase activity
molecular_function GO:0050486 intramolecular transferase activity, transferring hydroxy groups
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003676 nucleic acid binding
RNA-Seq Expression