TWGFD

The tetraploid wheat gene family database

Information

Gene Family: GATA

Gene Id:TRIDC4AG001000.1

exon number:5

Chromosome number:4A

Start Position:5176002

End Position:5181181

Genome length:5179 bp

Strand positive:-

Protein length:697 aa

Molecular weight:79.23 kDa

Theoretical pI:8.8

Total number of negatively charged residues(Asp + Glu):79

Total number of positively charged residues (Arg + Lys):91

Instability index (II):51.64

Aliphatic index:67.3

Grand average of hydropathicity (GRAVY):-0.54

Ortholog groups

Protein ID Ortholog gene symbol
TRIDC4AG001000.1 - -
TRIDC4AG001000.1 Os03t0181600-02 OsGATA22

Sequence

Protein sequence:

>TRIDC4AG001000.1

MRRGPMGRRTLCNACGIAWAKGKLRKVIDSDTPIDDVPVAKMVPEVGMEFDNEDKAYEFYNRYAGHIGFSVRKSSSDKSADNITRSRTFVCSREGFRKDKKGANKVKRPRPETRIGCPARMIIKITSYNKYRIAEFVADHNHQPAPPSTMHMLRSQRVLTEVQTTECNSSEDSTTPSRFSGGSLGQQAGAFRNVNFLPADYRSSLCSKRTKNMQPGDAGGVVKYLQSMQLNNPSFFYAVQLDEDDKLTNIFWADSKSRVDFSYFSDVVCLDTTYKINAHGRPLTLFLGVNHHKQISIFGAALLYDESVESFKWLFDTFKIATDGKQPKTILTDQSIAASAAISAVWPSTIHCLCPWQVYQNTVKHLNHIFQGSKTFAKDFSRCVYDYEDEEGFLVGWRTMLEKYDLRNNEWLHKLFEDRDKWASAYNRHVFTADINGSLQLECVSNVLRKYLSPQFDFLSFFKHYERVLDEHRYAELQADFHASQSFPRIPPSKMLKQAANIYTPVVFEKFRREFEMFVDSVIYSCGESGTASDYRVAVTDRPGEHYVRFDSSDLSVACSCKKIESMGIQCCHVLKVLDFRNIKELPQKYLMRRWTKDAKSADRGNQEFLSDGALQTPSSRLNVPVPIINQPQSHLNNDHDHAASVSSFGHQALQGNANGNQAESCQARRIPHVTAHETVKSQRKCTVADPGSGTWQ

CDS sequence:

>TRIDC4AG001000.1

ATGCGTCGTGGACCGATGGGACGGCGGACTTTATGCAATGCATGTGGAATAGCATGGGCAAAGGGAAAATTGAGAAAAGTTATTGATTCTGACACCCCCATAGATGATGTTCCTGTTGCAAAAATGGTGCCTGAAGTCGGCATGGAATTTGACAATGAAGACAAAGCATATGAATTTTATAACAGGTATGCTGGACACATCGGCTTTAGTGTTCGTAAGAGTTCATCGGACAAATCAGCTGACAACATCACAAGATCAAGGACCTTTGTATGCTCGAGGGAGGGTTTCCGTAAGGACAAAAAAGGAGCTAATAAAGTTAAGAGGCCAAGGCCAGAAACAAGAATAGGATGCCCCGCACGGATGATAATTAAGATTACATCCTATAATAAATATCGCATTGCTGAATTTGTAGCAGACCATAACCATCAGCCAGCACCCCCATCAACCATGCATATGCTGAGATCTCAGAGGGTGCTTACTGAGGTACAAACAACTGAATGCAATTCCTCAGAAGATTCCACAACACCGTCAAGGTTTTCTGGTGGCTCTTTAGGACAGCAGGCAGGAGCTTTTAGAAATGTTAATTTCCTCCCTGCAGATTACAGAAGTTCCCTTTGTTCAAAGCGTACGAAAAATATGCAACCTGGTGATGCAGGAGGTGTTGTGAAGTACCTGCAGAGCATGCAGCTAAACAATCCGTCTTTCTTTTATGCTGTCCAGCTTGATGAGGATGACAAACTGACCAACATTTTCTGGGCCGATTCCAAATCTAGAGTTGATTTCAGCTACTTCAGTGACGTGGTTTGTTTGGACACAACCTACAAGATAAATGCACATGGAAGGCCATTAACTCTCTTCCTTGGAGTGAATCATCACAAGCAAATCTCCATATTTGGTGCTGCTTTGCTTTATGATGAATCAGTGGAATCGTTCAAGTGGTTGTTTGACACGTTCAAGATTGCTACAGATGGAAAGCAGCCAAAAACAATCTTGACAGATCAATCAATTGCAGCAAGTGCTGCCATAAGCGCAGTATGGCCAAGTACAATTCACTGTCTTTGCCCATGGCAAGTGTACCAAAACACTGTCAAACACCTTAATCACATCTTCCAAGGCTCTAAAACATTTGCAAAGGATTTCAGCAGATGTGTTTATGATTATGAGGATGAAGAGGGTTTCTTGGTAGGATGGAGAACCATGCTAGAGAAGTATGATCTAAGAAACAATGAATGGCTTCATAAGTTATTCGAAGATCGAGATAAATGGGCATCAGCGTACAATCGACATGTATTCACCGCAGATATAAATGGTTCATTGCAGTTAGAGTGTGTTAGCAATGTCTTGAGAAAGTACTTGAGTCCACAGTTTGATTTTTTGTCTTTCTTCAAGCACTATGAAAGAGTGTTGGATGAGCATCGCTATGCAGAGCTACAAGCTGATTTTCATGCAAGCCAAAGCTTCCCGAGAATACCTCCCTCGAAAATGCTGAAACAAGCTGCCAACATATACACACCTGTGGTTTTTGAAAAATTTCGCAGAGAGTTTGAGATGTTTGTGGATTCAGTGATCTACAGTTGTGGGGAGTCGGGAACTGCATCGGACTATAGAGTAGCAGTAACAGATAGACCTGGGGAACACTATGTTAGGTTTGACTCCAGTGACTTATCTGTAGCTTGCAGTTGTAAAAAAATCGAATCAATGGGTATCCAGTGTTGCCATGTGCTGAAAGTCCTTGATTTCAGAAATATAAAGGAGTTGCCACAAAAATATTTGATGAGAAGATGGACGAAGGATGCAAAGTCTGCAGACAGAGGCAATCAAGAGTTCTTGAGTGATGGCGCTTTGCAAACTCCAAGCTCTCGTTTAAATGTCCCTGTGCCAATTATAAATCAACCACAATCACACTTAAATAATGACCATGACCATGCCGCTTCTGTTTCTAGCTTCGGCCACCAAGCCCTTCAGGGAAATGCCAATGGAAACCAGGCAGAATCATGTCAAGCACGGCGCATTCCTCACGTCACCGCACACGAAACAGTCAAATCGCAGCGCAAATGTACCGTTGCTGATCCTGGCTCAGGCACCTGGCAGTAG