compute text file_numeric_summary(textFile: character (n)) -> numeric (6): takes a text file as its input and outputs a numeric of length 6 with the following characteristics of the text file: 1. the number of lines (from the signature we know this is just n) 2. the number of blank lines (i.e. lines that contain nothing or only whitespace) 3. the number of lines that are comments (L.e. lines that starts with "#") 4. the total number of characters in the text file 5. the median line length (i.e. the median number of characters per line) 6. the max line length (i.e. the max number of characters in a line) compute text file_vord_counts (textFile: character (n)) ->data.frane (kx2): takes a text file as its input and outputs a dataframe with k rows and 2 columns where k is the number of distinct "words". Here "words" include English word, variable names, function names, or any string that starts with a letter and contains only alpha-mumerics, periods, and underscores. The first column will consist of the different words and the second columns will be the frequency with which the word appears in the text file. The names of the columns should be Word and Count and it should be sorted by frequency in descending order.
compute text file_numeric_summary(textFile: character (n)) -> numeric (6): takes a text file as its input and outputs a numeric of length 6 with the following characteristics of the text file: 1. the number of lines (from the signature we know this is just n) 2. the number of blank lines (i.e. lines that contain nothing or only whitespace) 3. the number of lines that are comments (L.e. lines that starts with "#") 4. the total number of characters in the text file 5. the median line length (i.e. the median number of characters per line) 6. the max line length (i.e. the max number of characters in a line) compute text file_vord_counts (textFile: character (n)) ->data.frane (kx2): takes a text file as its input and outputs a dataframe with k rows and 2 columns where k is the number of distinct "words". Here "words" include English word, variable names, function names, or any string that starts with a letter and contains only alpha-mumerics, periods, and underscores. The first column will consist of the different words and the second columns will be the frequency with which the word appears in the text file. The names of the columns should be Word and Count and it should be sorted by frequency in descending order.
Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
Related questions
Question
Expert Solution
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
Step by step
Solved in 2 steps
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.Recommended textbooks for you
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education