In Python: Build a function that takes a record from your FASTA file as an argument, and returns a count of each amino acid coded for by the codons of the sequence. Keep in mind that because these records are not necessarily in the proper reading frame, so the user should be prompted to select a reading frame (0, +1, +2). You should put some thought into the presentation of your results by your main function, making sure they are clear and readable.
In Python:
Build a function that takes a record from your FASTA file as an argument, and returns a count of each amino acid coded for by the codons of the sequence. Keep in mind that because these records are not necessarily in the proper reading frame, so the user should be prompted to select a reading frame (0, +1, +2). You should put some thought into the presentation of your results by your main function, making sure they are clear and readable.
The fasta file contains the below:
>MD10G1276500 pacid=40089867 polypeptide=MD10G1276500 locus=MD10G1276500 ID=MD10G1276500.v1.1.491 annot-version=v1.1
ATGATGCAGTCCGTGGCTCCTGTGTGCAATGTCTGCGGCGAGCAGGTGGGGCTTGGTGCCAATGGGGAGGTTTTCGTGGC
ATGCCACGAGTGTAATTTCCCCATTTGCAAGGCTTGTTTCGATGAAGATGTCAAGGCTGGGCGTAAAGTTTGCTTGCAGT
GTGGTATTCCCTATGACGATAACCCGTTGGCGGAGTATGAAACAAAGGTGTCAGGCACTCGATCCACAATGGAAGCTCAC
CTGAATAATACACAGGATACAGGAATTCATGCTAGGCATATCAGCAGTGTGTCTACGTTGGATAGTGAATTAAACGATGA
ATCTGGCAATCCGATTTGGAAGAATAGAGTGGAAAGTTGGAAGGATAAGAAGGATAAGAAGGATAAAAAGATCAAGAAGA
AAAAGGATACACCTAATGGGGAAAAAGAGGCTCAAATTCCACCTGAGAAGCAGATGACAGAGGAATATTCATCAGAGGCT
GCGGAACCACTTTCAACTCTCGTCCCACTTCCATCTAACAGAATCACACCATACAGAACTGTTATAATTATGCGATTGAT
CATTCTCGCCCTTTTCTTCCATTATCGAGTAACAAATCCTGTTGATAGTGCTTACGGTCTATGGTTCACTTCGATCATAT
GTGAGATCTGGTTTGCTTTTTCTTGGGTGTTGGATCAGTTTCCTAAGTGGTCTCCAGTTAATCGGACTACATTTACTGAC
AGGTTATCTGCCAGGTTTGAAAGAGAGGGTGAACTCTCCGAGCTTGCTGCTGTGGATTTCTTCGTAAGTACAGTTGATCC
GTTGAAAGAACCGCCCTTGATTACTGCCAATACCGTGCTTTCTATCCTTGCTGTAGACTACCCTGTGGACAAAGTTTCCT
GCTATGTGTCTGATGATGGTGCTGCCATGCTTACATTTGAATCCCTTGCCGAAACATCTGAATTTGCAACAAAGTGGGTT
CCTTTCTGCAAGAAATTTTCAATTGAACCACGTGCACCTGAGTTTTACTTCTCACAAAAGATTGACTACTTGAAGGATAA
AGTGCAACCATCTTTTGTGAAGGAGCGCAGAGCGATGAAAAGAGATTATGAAGAGTTCAAAGTGCGAATGAATGCTTTAG
TAGCAAAGGCTCAAAAAACACCAGAAGAAGGATGGACTATGCAAGATGGAACTCCATGGCCAGGAAATAACTCGCGTGAC
CATCCTGGGATGATCCAGGTGTTCCTTGGACATAGCGGTGCCTATGACATCGAGGGAAATGAACTTCCTCGATTGGTTTA
TGTCTCGAGAGAGAAGAGACCCGGCTACCCACATCACAAGAAAGCTGGTGCTGAAAATGCTTTGGTAAGGGTGTCTGCAG
TTCTCACAAATGCCCCATACATCCTCAATCTTGACTGTGATCACTACGTTAACAACAGCCAGGCAATTCGTGAGGCAATG
TGTTTCTTGATGGACCCTCAAGTCGGTCGAGAAGTATGCTATGTGCAGTTTCCTCAGAGGTTTGATGGTATTGATCGCAG
TGATCGATATGCTAATCGCAACACAGTTTTCTTTGATGTTAACATGAAAGGACTGGATGGCATTCAAGGTCCAGTATATG
TGGGGACAGGATGTTGTTTCAACAGGCAAGCACTTTACGGCTACGGTCCTCCTTCTATGCCCGCCTTATCCAAGGCTGCT
TCCTCATCCTCCTGCTCTTGTTGCTGTCCCTCTAAGAAGCCCTCTAAAGATGTGTCAGAGGCTTATCGAGATGCAAAACA
GGAGGAGCTTGATGCTGCCATTTTTAACCTCCGTGATATTGAGAATTATGATGAGCTTGAGAGGTCAATGCTGATCTCGC
AGACAAGCTTTGAGAAAACTTTTGGATTATCGTCTGTATTCATCGAATCTACGCTAATGGAGAACGGAGGAGTGGCCGAA
TCTTCCAACCCTTCAACATTGATCAAGGAGGCGATTCACGTCATTAGCTGTGGTTATGAAGAGAAGACCGCGTGGGGAAA
AGAGATTGGTTGGATATATGGATCAATCACTGAGGATATCTTAACCGGTTTCAAGATGCATTGCCGTGGATGGAGGTCAA
TTTACTGCATGCCCTTGAGACCTGCATTCAAAGGGTCAGCTCCCATTAACCTTTCTGATCGACTGCACCAAGTTCTTCGG
TGGGCACTGGGATCGGTGGAAATTTTCCTCAGTAGACATTGTCCTCTCTGGTACGGGTTTGCAGGAGGCCGCCTCAAATT
GCTTCAGAGAATGGCATATATCAACACTATTGTTTACCCCTTCACATCCCTCCCTCTCGTCGCTTACTGCACACTCCCTG
CAATATGCCTTCTCACAGGAAAATTCATCATCCCAACACTTACAAACCTGGCAAGTGCCCTGTTTCTTGGCCTCTTCATC
TCCATCATTGCTACAAGTGTGCTTGAGTTGAGGTGGAGTGGAGTCCGCATTGAGGACTTATGGCGTAACGAGCAGTTCTG
GGTGATCGGAGGTGTTTCAGCCCATCTCTTTGCCGTCTTCCAAGGTTTCTTAAAGATGTTGGCCGGAATTGACACCAACT
TCACCGTCACAACCAAATCAGCCGAAGACACAGAATTCGGAGAGCTCTATCTGATCAAATGGACCACACTTTTGATTCCC
CCAACTACACTCCTCATCGTCAACATGGTTGGTGTTGTTGCAGGATTTTCGGACGCCCTCAACAAGGGATACGAAGCTTG
GGGGCCACTTTTCGGGAAGGTTTTCTTTGCCTTCTGGGTGATTCTTCATCTATATCCCTTCCTCAAAGGTCTCATGGGAC
GCCAAAACCGGACTCCAACCATCGTTGTTTTGTGGTCAGTGCTCTTGGCCTCTGTCTTCTCCCTTGTTTGGGTGAAGATA
AATCCATTTGTGAGCAAAGTGGACAGCTCAACGCTTGCTCAAAGCTGCATTTCCATAGACTGCTGA
>MD10G1110200 pacid=40089868 polypeptide=MD10G1110200 locus=MD10G1110200 ID=MD10G1110200.v1.1.491 annot-version=v1.1
ATGGCGTCTCTCTCCATCGCTTCACATTACTCCATTTCCTCATGGCCTCCCAAGAATAGCAACACCAGTACCACCCTCTC
ATCTTTCAACAAAACCCCATTTCGTAAAAACACCCAAAATCTCACCTTTTCGAAAAAACGCCATCATTTTCTCACGTTTT
CCATGAAAAATGACCAGGAAAACGAACCCACTTCTTCTTCGTCGCCGTCGTCGGCTGTTGCAGTTTCCGAGAAGCCGAGC
GACGGCAGTGAGACCCAGGAGAGTGAATTGCCAGCAGAAGGAGCTGACTCTGAGTCTGATGAGGAGAAAGAGAAGCAGCT
GGAGACGGATTGGAAGACGGACGAGGAGTTCAAGAAGTTCATGGGAAATCCTTCGATTGAAGCTGCAATAAAGTTGGAGA
AGAAAAGGGCAGATAGGAAACTCAAGGATCTTGATAGGGAAAGTAGTGGGAACCCATTTGCAGGGATCCTG
There are more sequences but for you to work on, I gave half of the sequences in the file!
Step by step
Solved in 5 steps