Using Designed Experiments for Cake Mix
docx
keyboard_arrow_up
School
Lawrence Technological University *
*We aren’t endorsed by this school
Course
6753
Subject
Industrial Engineering
Date
Dec 6, 2023
Type
docx
Pages
11
Uploaded by BrigadierClover826
Using Designed Experiments (DOE) to Minimize
Moisture Loss
Marilyn Wheatley
20 February, 2017
11
11 Comments
As a person who loves baking (and eating) cakes, I find it bothersome to go through all the effort of
baking a cake when the end result is too dry for my taste. For that reason, I decided to use a
designed experiment in Minitab to help me reduce the moisture loss in baked chocolate cakes, and
find the optimal settings of my input factors to produce a moist baked chocolate cake. I’ll share the
details of the design and the results in this post.
Choosing Input Factors for the Designed
Experiment
Because I like to use premixed chocolate cake mixes, I decided to use two of my favorite cake mix
brands for the experiment. For the purpose of this post, I’ll call the brands A and B. Thinking about
what could impact the loss of moisture, it is likely that the baking time and the oven temperature will
affect the results. Therefore, the factors or inputs that I decided to use for the experiment are:
1.
Cake mix brand: A or B (categorical data)
2.
Oven temperature: 350 or 380 degrees Fahrenheit (continuous data)
3.
Baking time: 38 or 46 minutes (continuous data)
Measuring the Response
Next, I needed a way to measure the moisture loss. For this experiment, I used an electronic food
scale to weigh each cake (in the same baking pan) before and after baking, and then used those
weights in conjunction with the formula below to calculate the percent of moisture lost for each cake:
% Moisture Loss = 100 x initial weight – final weight
initial weight
Designing the Experiment
For this experiment, I decided to construct a 2
3
full factorial design with
center points
to detect any
possible curvature in the response surface. Since the cake mix brand is categorical and therefore
has no center point between brand A and brand B, the number of center points will be doubled for
that factor. Because of this, I’d have to bake 10 cakes which, even for me, is too many in a single
day. Therefore, I decided to run the experiment over two days. Because differences between the
days on which the data was collected could potentially introduce additional variation, I decided to add
a block to the design to account for any potential variation due to the day.
To create my design in Minitab, I use
Stat
>
DOE
>
Factorial
>
Create Factorial Design
:
Minitab makes it easy to enter the details of the design. First, I selected 3 as the number of factors:
Next, I clicked on the
Designs
button above. In the Designs window, I can tell Minitab what type of
design I’d like to use with my 3 factors:
In the window above, I’ve selected a full 2
3
design, and also added 2 blocks (to account for variation
between days), and 1 center point per block. After making the selections and clicking
OK
in the
above window, I clicked on the
Factors
button in the main window to enter the details about each of
my factors:
Because center points are doubled for categorical factors, and because this design has two blocks,
the final design will have a total of 4 center points. After clicking
OK
in the window above, I ended
up with the design shown below with 12 runs:
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Performing the Experiment and Analyzing the
Data
After spending an entire weekend baking cakes and calculating the moisture loss for each one, I
entered the data into Minitab for the analysis. I also brought in a lot of cake to share with my
colleagues at Minitab!
With the moisture loss for each of my 12 cakes recorded in column C8 in the experiment worksheet,
I’m ready to analyze the results.
In Minitab, I used
Stat
>
DOE
>
Factorial
>
Analyze Factorial Design...
and then entered
the Moisture Loss column in the
Responses
field:
In the window above, I also clicked on
Terms
to make sure I’m only including the main effects and
two-way interactions. After clicking
OK
in each window, Minitab produced a Pareto chart of the
standardized effects that I could use to reduce my model:
I can see from the above graph that the main effects (A, B and C) all significantly impact the moisture
of the cake, since the bars that represent those terms on the graph extend beyond the red vertical
reference line. All of the two-way interactions (AB, AC and BC) are not significant.
I can also see the same information in the ANOVA table in Minitab’s session window:
In the above ANOVA table, we can see that the cake mix brand, oven temp, and baking time are all
significant since their p-values are lower than my alpha of 0.05.
We can also see that all of the 2-way interactions have p-values higher than 0.05, so I’ll conclude
that those interactions are not significant and should be removed from the model.
Interestingly, the p-value for the
blocks
is
significant (with a p-value of 0.01). This indicates that
there was indeed a difference between the two days in which the data was collected which impacted
the results. I'm glad I accounted for that additional variation by including a block in my design!
Analyzing the Reduced Model
To analyze my reduced model, I can go back to
Stat
>
DOE
>
Factorial
>
Analyze Factorial
Design
. This time when I click the
Terms
button I’ll keep only the main effects, and remove the
two-way interactions. Minitab displays the following ANOVA table for the reduced model:
The table shows that all the terms I’ve included (mix brand, oven temp, and baking time) are
significant since all the p-values for these terms are lower than 0.05. We can also see that the
test
for curvature based on the center points
is not significant (p-value = 0.587), so we can conclude that
the relationship between the three factors and moisture loss is linear.
The
r-squared
, r-squared adjusted, and r-squared predicted are all quite high, so this model seems to
be a very good fit to the data.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Checking the Residuals
Now I can take a look at the
residual plots to make sure all the model assumptions
for my model
have been met:
The residuals in the graph above appear to be normally distributed. The residuals versus fits graph
appears to show the points are randomly scattered above and below 0 (which indicates constant
variance), and the residuals versus order graph doesn’t suggest any patterns that could be due to
the order in which the data was collected.
Now that I'm confident the assumptions for the model have been met, I’ll use this model to determine
the optimal settings of my factors so that going forward
all
the cakes I make will be moist and
fabulous!
Optimizing the Response
I can use Minitab’s
Response Optimizer
and my model to tell me exactly what combination of
cake mix brand, oven temperature, and baking time I’ll want to use to get the moistest cake. I
select
Stat
>
DOE
>
Factorial
>
Response Optimizer
:
In the above window, I can tell Minitab what my goal is. In this case, I want to know what input
settings to use so that the moisture loss will be
minimized
. Therefore, I choose
Minimize
above
and then click
OK
:
In the above graph, the optimal settings for my factors are marked in red near the top. Using the
model that I’ve fit to my data, Minitab is telling me that I can use
Brand B
with an oven temperature
of
350
and a baking time of
38
minutes to minimize the moisture loss. Using those values for the
inputs, I can expect the moisture loss will be approximately
3.3034
, which is quite low compared to
the moisture loss for the cakes collected as part of the experiment.
Success! Now I can use these optimal settings, and I’ll never waste my time baking a dry cake
again.
Thanks for sharing and very intuitive case study.My question is since block is significant
(day to day variation),how should I quantify the effect of block to the total variation?What
happened when the R-Sq(adj) model is moderate,and block is significantly explained the
variation of Moisture Loss?Please advise.
Thanks for reading the blog! By definition, the block is a categorical variable
that explains variation in the response (the moisture loss in this case) that is
not caused by the factors in the study. By including the block, we are
accounting for any possible variation due to the day in the model, but the
day on which the data was collected is not a factor in the study. If we
wanted to learn more about the day on which the data was collected, we'd
need to create a design where the day is a factor in the study. In the
example above, the R-squared adjusted is quite high, and the block was left
in the final model since its effect is significant.
I hope this helps!
Marilyn
Many thanks, I also would like to know how to establish Regression Equation in Uncoded
Units include blocks and the center point
Thanks for reading the blog! As part of the output, Minitab provides the
regression equation in uncoded units in the session window. This equation
is used for predictions in the response optimizer. If you keep both the blocks
and the center point in the model, Minitab displays the following information
in the session window:
Regression Equation in Uncoded Units
Moisture Loss = -13.171 - 0.1287 Mix Brand + 0.03271 Temp + 0.13568
Baking Time
+ 0.0339 Ct Pt
Equation averaged over blocks.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
The 'Ct Pt' in the equation is the center point. As the message above
suggests, the equation is averaged over the blocks: Essentially we are
accounting for any variation due to the day on which the data was collected,
but that is not in the equation itself since the block was not one of the
factors in the study.
Thanks so much, I still did not understand, if I use the
Regression Equation in Uncoded Units but Mix Brand and Ct
Pt are not a numeric factors, will I do?
Hi Phuc,
Thanks for your reply. The equation is
averaged over the blocks for you, so there is
nothing else that needs to be done with the
block and the equation. The center point is for
and indicator/dummy variable. If you're
including the center point in the equation use a
1 as the value to multiply by the coefficient, and
the coefficient will be 0.0339 in the example
above. If you don't want to use the center point
use 0 instead to multiply by the coefficient.
Thanks,
Marilyn
I can't help to ask if you have actually tried to bake the "optimized" cake and how close did
you come to the predicted value?
Hi Miguel!
Thanks so much for reading! Yes... confirmation runs! Believe it or not, I
was pretty caked-out after baking so many cakes, so my new obsession
has shifted to donuts (for now). But you bring up a good point. At your
suggestion, I think I'll bake a few more cakes at the optimal factor settings
and then write another post about the capability of the process. Stay tuned,
and thank you again for reading!
Marilyn
Hi Hira- thanks for reading the blog! You can create your PB design in
Minitab by using this menu path: Stat > DOE > Factorial > Create Factorial
Design. Choose Plackett-Burman design and select 9 as the number of
factors from the drop-down list. Then, you can click on the 'Designs' button
to see the options for the number of runs, replicates, center points, etc. After
you collect the data, you can use Stat > DOE > Factorial > Analyze Factorial
Design for the analysis. If you have questions or get stuck, feel free to
contact our tech support team at 814-231-2682. Good luck!