TWGFD
The tetraploid wheat gene family database
Information
Gene Id:TRIDC4AG001000.1
exon number:5
Chromosome number:4A
Start Position:5176002
End Position:5181181
Genome length:5179 bp
Strand positive:-
Protein length:697 aa
Molecular weight:79.23 kDa
Theoretical pI:8.8
Total number of negatively charged residues(Asp + Glu):79
Total number of positively charged residues (Arg + Lys):91
Instability index (II):51.64
Aliphatic index:67.3
Grand average of hydropathicity (GRAVY):-0.54
Ortholog groups
Protein ID | Ortholog | gene symbol |
---|---|---|
TRIDC4AG001000.1 | - | - |
TRIDC4AG001000.1 | Os03t0181600-02 | OsGATA22 |
Sequence
Protein sequence:
>TRIDC4AG001000.1
MRRGPMGRRTLCNACGIAWAKGKLRKVIDSDTPIDDVPVAKMVPEVGMEFDNEDKAYEFYNRYAGHIGFSVRKSSSDKSADNITRSRTFVCSREGFRKDKKGANKVKRPRPETRIGCPARMIIKITSYNKYRIAEFVADHNHQPAPPSTMHMLRSQRVLTEVQTTECNSSEDSTTPSRFSGGSLGQQAGAFRNVNFLPADYRSSLCSKRTKNMQPGDAGGVVKYLQSMQLNNPSFFYAVQLDEDDKLTNIFWADSKSRVDFSYFSDVVCLDTTYKINAHGRPLTLFLGVNHHKQISIFGAALLYDESVESFKWLFDTFKIATDGKQPKTILTDQSIAASAAISAVWPSTIHCLCPWQVYQNTVKHLNHIFQGSKTFAKDFSRCVYDYEDEEGFLVGWRTMLEKYDLRNNEWLHKLFEDRDKWASAYNRHVFTADINGSLQLECVSNVLRKYLSPQFDFLSFFKHYERVLDEHRYAELQADFHASQSFPRIPPSKMLKQAANIYTPVVFEKFRREFEMFVDSVIYSCGESGTASDYRVAVTDRPGEHYVRFDSSDLSVACSCKKIESMGIQCCHVLKVLDFRNIKELPQKYLMRRWTKDAKSADRGNQEFLSDGALQTPSSRLNVPVPIINQPQSHLNNDHDHAASVSSFGHQALQGNANGNQAESCQARRIPHVTAHETVKSQRKCTVADPGSGTWQ
CDS sequence:
>TRIDC4AG001000.1
ATGCGTCGTGGACCGATGGGACGGCGGACTTTATGCAATGCATGTGGAATAGCATGGGCAAAGGGAAAATTGAGAAAAGTTATTGATTCTGACACCCCCATAGATGATGTTCCTGTTGCAAAAATGGTGCCTGAAGTCGGCATGGAATTTGACAATGAAGACAAAGCATATGAATTTTATAACAGGTATGCTGGACACATCGGCTTTAGTGTTCGTAAGAGTTCATCGGACAAATCAGCTGACAACATCACAAGATCAAGGACCTTTGTATGCTCGAGGGAGGGTTTCCGTAAGGACAAAAAAGGAGCTAATAAAGTTAAGAGGCCAAGGCCAGAAACAAGAATAGGATGCCCCGCACGGATGATAATTAAGATTACATCCTATAATAAATATCGCATTGCTGAATTTGTAGCAGACCATAACCATCAGCCAGCACCCCCATCAACCATGCATATGCTGAGATCTCAGAGGGTGCTTACTGAGGTACAAACAACTGAATGCAATTCCTCAGAAGATTCCACAACACCGTCAAGGTTTTCTGGTGGCTCTTTAGGACAGCAGGCAGGAGCTTTTAGAAATGTTAATTTCCTCCCTGCAGATTACAGAAGTTCCCTTTGTTCAAAGCGTACGAAAAATATGCAACCTGGTGATGCAGGAGGTGTTGTGAAGTACCTGCAGAGCATGCAGCTAAACAATCCGTCTTTCTTTTATGCTGTCCAGCTTGATGAGGATGACAAACTGACCAACATTTTCTGGGCCGATTCCAAATCTAGAGTTGATTTCAGCTACTTCAGTGACGTGGTTTGTTTGGACACAACCTACAAGATAAATGCACATGGAAGGCCATTAACTCTCTTCCTTGGAGTGAATCATCACAAGCAAATCTCCATATTTGGTGCTGCTTTGCTTTATGATGAATCAGTGGAATCGTTCAAGTGGTTGTTTGACACGTTCAAGATTGCTACAGATGGAAAGCAGCCAAAAACAATCTTGACAGATCAATCAATTGCAGCAAGTGCTGCCATAAGCGCAGTATGGCCAAGTACAATTCACTGTCTTTGCCCATGGCAAGTGTACCAAAACACTGTCAAACACCTTAATCACATCTTCCAAGGCTCTAAAACATTTGCAAAGGATTTCAGCAGATGTGTTTATGATTATGAGGATGAAGAGGGTTTCTTGGTAGGATGGAGAACCATGCTAGAGAAGTATGATCTAAGAAACAATGAATGGCTTCATAAGTTATTCGAAGATCGAGATAAATGGGCATCAGCGTACAATCGACATGTATTCACCGCAGATATAAATGGTTCATTGCAGTTAGAGTGTGTTAGCAATGTCTTGAGAAAGTACTTGAGTCCACAGTTTGATTTTTTGTCTTTCTTCAAGCACTATGAAAGAGTGTTGGATGAGCATCGCTATGCAGAGCTACAAGCTGATTTTCATGCAAGCCAAAGCTTCCCGAGAATACCTCCCTCGAAAATGCTGAAACAAGCTGCCAACATATACACACCTGTGGTTTTTGAAAAATTTCGCAGAGAGTTTGAGATGTTTGTGGATTCAGTGATCTACAGTTGTGGGGAGTCGGGAACTGCATCGGACTATAGAGTAGCAGTAACAGATAGACCTGGGGAACACTATGTTAGGTTTGACTCCAGTGACTTATCTGTAGCTTGCAGTTGTAAAAAAATCGAATCAATGGGTATCCAGTGTTGCCATGTGCTGAAAGTCCTTGATTTCAGAAATATAAAGGAGTTGCCACAAAAATATTTGATGAGAAGATGGACGAAGGATGCAAAGTCTGCAGACAGAGGCAATCAAGAGTTCTTGAGTGATGGCGCTTTGCAAACTCCAAGCTCTCGTTTAAATGTCCCTGTGCCAATTATAAATCAACCACAATCACACTTAAATAATGACCATGACCATGCCGCTTCTGTTTCTAGCTTCGGCCACCAAGCCCTTCAGGGAAATGCCAATGGAAACCAGGCAGAATCATGTCAAGCACGGCGCATTCCTCACGTCACCGCACACGAAACAGTCAAATCGCAGCGCAAATGTACCGTTGCTGATCCTGGCTCAGGCACCTGGCAGTAG