The example of the two CSV files is attached below. One of the file is the crime database and the other one is the suspect database. Specification Your task is to write a python program that will take three CSV file names on the command line. The first CSV file contains STR counts for DNAs found in a list of crime scenes; the second CSV file contains a list of suspect's names and their DNA sequences; the third CSV file name is the output file where you write a CSV file that maps each suspect's name to the list of Crimes that have DNAs matching the suspect. Your program will take three file names on the command line: The first command line argument is a file name for a crime database file in csv file format. The header row would look like this: CrimeID,STR1,STR2,STR3,... Where each STRi is a short DNA sequence that is composed of DNA bases A/C/G/T. And each row will comprise of a CrimeID of the form CIDXXXXX, an integer count for each of the STRs. The second command line argument is a file name for a suspect DNA database file in csv file format. The header row would look like this: Suspect,Sequence And each row will have a Suspect name and the suspect's DNA sequence The third command line argument is a file name for the program to write the matching result to also in csv file format. The header row would look like this: Suspect,Crimes And each row will have a Suspect name and all the crime ids where suspect's DNA matches. The crimeids will be stored as a ',' separated string that will be treated as one csv file cell value (meaning it will be escaped with "" if there are more than one matching crimeid for a suspect.
The example of the two CSV files is attached below. One of the file is the crime database and the other one is the suspect database. Specification Your task is to write a python program that will take three CSV file names on the command line. The first CSV file contains STR counts for DNAs found in a list of crime scenes; the second CSV file contains a list of suspect's names and their DNA sequences; the third CSV file name is the output file where you write a CSV file that maps each suspect's name to the list of Crimes that have DNAs matching the suspect. Your program will take three file names on the command line: The first command line argument is a file name for a crime database file in csv file format. The header row would look like this: CrimeID,STR1,STR2,STR3,... Where each STRi is a short DNA sequence that is composed of DNA bases A/C/G/T. And each row will comprise of a CrimeID of the form CIDXXXXX, an integer count for each of the STRs. The second command line argument is a file name for a suspect DNA database file in csv file format. The header row would look like this: Suspect,Sequence And each row will have a Suspect name and the suspect's DNA sequence The third command line argument is a file name for the program to write the matching result to also in csv file format. The header row would look like this: Suspect,Crimes And each row will have a Suspect name and all the crime ids where suspect's DNA matches. The crimeids will be stored as a ',' separated string that will be treated as one csv file cell value (meaning it will be escaped with "" if there are more than one matching crimeid for a suspect.
Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
Related questions
Question
The example of the two CSV files is attached below. One of the file is the crime
Specification
Your task is to write a python
Your program will take three file names on the command line:
- The first command line argument is a file name for a crime database file in csv file format. The header row would look like this:
CrimeID,STR1,STR2,STR3,...
Where each STRi is a short DNA sequence that is composed of DNA bases A/C/G/T. And each row will comprise of a CrimeID of the form CIDXXXXX, an integer count for each of the STRs. - The second command line argument is a file name for a suspect DNA database file in csv file format. The header row would look like this:
Suspect,Sequence
And each row will have a Suspect name and the suspect's DNA sequence - The third command line argument is a file name for the program to write the matching result to also in csv file format. The header row would look like this:
Suspect,Crimes
And each row will have a Suspect name and all the crime ids where suspect's DNA matches. The crimeids will be stored as a ',' separated string that will be treated as one csv file cell value (meaning it will be escaped with "" if there are more than one matching crimeid for a suspect.
Expert Solution
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
This is a popular solution!
Trending now
This is a popular solution!
Step by step
Solved in 3 steps with 4 images
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.Recommended textbooks for you
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education