3) ChiMerge [Ker92] is a supervised, bottom-up (i.e., merge-based) data discretization method. It relies on _2 analysis: Adjacent intervals with the least _2 values are merged together until the chosen stopping criterion satisfies. (a) Briefly describe how ChiMerge works. (b) Take the IRIS data set, obtained from the University of California-Irvine Machine Learning Repository a data set to be Data (www.ics.uci.edu/_mlearn/MLRepository.html), as discretized. Perform data discretization for each of the four numeric attributes using the ChiMerge method. (Let the stopping criteria be: max- interval D 6). You need to write a small program to do this to avoid clumsy numerical computation. Submit your simple analysis and your test results: split-points, final intervals, and the documented source program.

Database System Concepts
7th Edition
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Chapter1: Introduction
Section: Chapter Questions
Problem 1PE
icon
Related questions
Question
3)
ChiMerge [Ker92] is a supervised, bottom-up (i.e., merge-based) data
discretization method. It relies on _2 analysis: Adjacent intervals with the
least _2 values are merged together until the chosen stopping criterion
satisfies.
(a) Briefly describe how ChiMerge works.
(b) Take the IRIS data set, obtained from the University of California-Irvine
Machine
Learning
Repository
set to be
Data
(www.ics.uci.edu/_mlearn/MLRepository.html), as
discretized. Perform data discretization for each of the four numeric
a
data
attributes using the ChiMerge method. (Let the stopping criteria be: max-
interval D 6). You need to write a small program to do this to avoid clumsy
numerical computation.
Submit your simple analysis and your test results: split-points, final intervals,
and the documented source program.
Transcribed Image Text:3) ChiMerge [Ker92] is a supervised, bottom-up (i.e., merge-based) data discretization method. It relies on _2 analysis: Adjacent intervals with the least _2 values are merged together until the chosen stopping criterion satisfies. (a) Briefly describe how ChiMerge works. (b) Take the IRIS data set, obtained from the University of California-Irvine Machine Learning Repository set to be Data (www.ics.uci.edu/_mlearn/MLRepository.html), as discretized. Perform data discretization for each of the four numeric a data attributes using the ChiMerge method. (Let the stopping criteria be: max- interval D 6). You need to write a small program to do this to avoid clumsy numerical computation. Submit your simple analysis and your test results: split-points, final intervals, and the documented source program.
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 4 steps with 4 images

Blurred answer
Knowledge Booster
Complex Datatypes
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
Recommended textbooks for you
Database System Concepts
Database System Concepts
Computer Science
ISBN:
9780078022159
Author:
Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:
McGraw-Hill Education
Starting Out with Python (4th Edition)
Starting Out with Python (4th Edition)
Computer Science
ISBN:
9780134444321
Author:
Tony Gaddis
Publisher:
PEARSON
Digital Fundamentals (11th Edition)
Digital Fundamentals (11th Edition)
Computer Science
ISBN:
9780132737968
Author:
Thomas L. Floyd
Publisher:
PEARSON
C How to Program (8th Edition)
C How to Program (8th Edition)
Computer Science
ISBN:
9780133976892
Author:
Paul J. Deitel, Harvey Deitel
Publisher:
PEARSON
Database Systems: Design, Implementation, & Manag…
Database Systems: Design, Implementation, & Manag…
Computer Science
ISBN:
9781337627900
Author:
Carlos Coronel, Steven Morris
Publisher:
Cengage Learning
Programmable Logic Controllers
Programmable Logic Controllers
Computer Science
ISBN:
9780073373843
Author:
Frank D. Petruzella
Publisher:
McGraw-Hill Education