Write
Write a program (call it categories.py) that counts how many genes are in each category (1.1, 1.2, 2.1 etc.) based on data from the chr21_genes.txt file. The program should print the results so that categories are arranged in ascending orderto an output file (call the output output OUTPUT/categories.txt . Read the paper to see what the categories represent and have this part of your output (this will be input from chr21_genes_categories.txt). Use Dictionaries to solve this problem. HINT: Feel free to use as Dictionary of Dictionaries, but it is not required.
Note: you will notice that one gene has no category information. That's due to missing data in the file, JUST IGNORE THIS GENE!.
Remember to have these command line options:
$ python3 categories.py -h usage: categories.py [-h] -i1 INFILE1 -i2 INFILE2 Combine on gene name and count the category occurrence optional arguments: -h, --help show this help message and exit -i1 INFILE1, --infile1 INFILE1 Path to the gene description file to open -i2 INFILE2, --infile2 INFILE2 Path to the gene category to open
$ python3 categories.py -i1 chr21_genes.txt -i2 chr21_genes_categories.txt
Output to the file (OUTPUT/categories.txt) from this program: Category Occurrence Description 1.1 <Occurrence Here> Genes with 100% identity over a complete cDNA with defined functional association (for example, transcription factor, Note <Occurrence Here> is a number
Trending now
This is a popular solution!
Step by step
Solved in 3 steps with 1 images