ECE_260A_FALL_2023_LAB3
docx
keyboard_arrow_up
School
University of California, San Diego *
*We aren’t endorsed by this school
Course
260
Subject
Electrical Engineering
Date
Jan 9, 2024
Type
docx
Pages
8
Uploaded by MinisterScienceCapybara33
ECE 260A – Lab 3 4-tap 16-bit unsigned averaging FIR filter
Due: 2023/12/15 @ 11:59pm
Please make sure that you close existing terminals and open a fresh terminal and execute “ee260afa21”. This will set path to the Design Compiler tool
(Group Project – max of 3 per group)
Project Overview The objective is first to design a FIR filter, at register-transfer-level abstraction (in System Verilog) and then synthesize from RTL to a gate-level netlist (in 65nm CMOS). This filter is a
4-tap unsigned magnitude FIR filter with registered I/O, that needs to be
optimized for fast performance. Here, 4-tap indicates that the filter averages the input signal from four different time stamps. One way to implement this is to use 4 flops to capture the signal at different time steps. The following figure shows an illustration of N-tapped delay line with rectangular boxes representing flops. In our project, we discard the first input before the first delay element and start from the output of the first flop. So, we will have 4 flops and the output of flops is used for averaging (coefficient multiplication and addition operations). For simplicity, assume that all the coefficients h[k] to be 1. This makes it a much simpler problem, by not worrying about the intermediate multipliers, represented by X. With the above assumption of unit coefficients, the key component in this circuit is design “adder” combinational logic and synthesize this Verilog to a gate-level netlist. ECE 260A
Page 1
of 8
Lab 3
1. Design your added logic in System Verilog. Use the testbench provided, to validate the functionality in Modelsim, Questa or any other simulator that you are comfortable with. Free FPGA simulation (Mentor Questa) and synthesizer (Mentor Precision) tool access is also available at EDAPlayground.com. 2a. Synthesize from RTL to a lower-abstraction (gate-level) using “Design-Compiler (DC)” tool. This is the start of physical design flow. 2b. Alternatively, synthezie from RTL to an FPGA implementation. I suggest using Mentor Precision on the EDAPlaygorund, but Xilinx Vivado and Intel Quartus (freeware student version, if you like) are also good. We can implement the four-input adder logic in several ways. One possible way is to use a ripple carry adder. Once you design the adder logic in system verilog, you should synthesize the circuit to a gate-level
in 65nm CMOS process or an FPGA of your choosing. The output of synthesis is a gate-level netlist where the components are standard cells such as NAND, NOR, MUX, Full Adders etc.
This project is a great opportunity to design a complex circuit at RTL abstraction and synthesize to gate-
level netlist, that will solidify your knowledge of arithmetic architecture and circuit design and your ability to creatively design circuits that have better performance, (that could manifest as increase in power
and area) than the reference design. By using the state-of-the art compiler from Synopsys (“Design Compiler”, also referred as DC), it also helps in connecting your understanding of a schematic to actual gates that get placed and routed on the silicon. This is a good starting point for ECE 260B (physical design), that starts from a gate-level netlist, and takes you through the physical design flow.
Design Criteria
The adder shall have clk and reset inputs, along with two's comp. data input A[15:0] and output S[17:0]. The output is driven by the Q outputs of positive-edge triggered flip-flops, and the input drives the D inputs of positive-edge triggered flip-flops. We provide a Verilog testbench for those wishing to try a hand at logic simulation/verification, but the main objective will be to compare a few candidate architectures by synthesizing them using Design Compiler or an FPGA synthesizer. The critical timing paths are the flop-to-flop paths, between the flops in the input stream and flop generating sum. So, if the adder combinational logic involves heavy series circuitry, the tool tries to meet the given clock frequency by using low-vt and high drive-strength cells. This results in increase of power or area, to meet the given (fixed) frequency. If you try to parallelize the adder logic, by reducing the cascading nature of the adder, this increases the amount of combinational logic between these flops. Eventually, this leads to an increase in power and/or area, but a potentially significant reduction in the delay of the timing paths. A faster design that you propose should try to avoid making significant trade-offs with power consumption or layout area. Proposed designs that have low power consumption and a lesser area at a given clock period of 0.5ns will be awarded more points than designs that have high power consumption and large area. ECE 260A
Page 2
of 8
Lab 3
You are free to try CSAs, as in W&H Fig. 11.42(c), a three-adder "tree" architecture, or any other topology that gets the job done, with improvements from reference architecture in terms of performance, power and area. Project Report Requirements
The report should include your names and student ID numbers. Please submit this “report” to Canvas by the due date noted above. “Only one” partner should submit a copy – multiple submissions for the same group could lead up to -10 points. Also, mention contribution of each group member (one line per group member). The total project report should not exceed 5 pages.
Reporting Contents
The final project report can consist of figures with text annotation. Suggestions on topics to cover in your report are:
Briefly describe (max of 2 pages) your chosen architecture of “adder” or architectures that your tried and did not work, and why you chose the final single “proposed” architecture. o
If something did not work as you expected, explain why
o
Provide suggestions for future iterations of your design
o
Figure that might help explain your design and/or results.
Results:
o
Table of power, performance and area comparison (w.r.t reference)
o
Conclusions based on the results
o
Pointer on the server to your proposed System Verilog file and Design Compiler power, timing and qor reports.
A sample table of results is shown below. Feel free to expand if necessary. ECE 260A
Page 3
of 8
Lab 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Metrics
Ripple Carry Adder
(Reference)
Proposed Architecture
Worst negative slack in ps (setup analysis)
Total power consumed Leakage power
Area of combinational logic
# of combinational cells
Total area
Evaluation
You will be evaluated on the quality of your final report and design choice reasoning (50%), and experimental results (40%). Quality of results; power-performance and area improvements from the vanilla ripple carry adder) (10%). Running the flow:
-
You are provided a reference System Verilog file (naive implementation using ripple carry adder)
o
/home/linux/ieng6/ee260afa23/public/lab3_setup/
run_dc_setup_reference/rtl/fir4rca.sv
-
We are using 65nm CMOS process w/ DC. -
Clock frequency of 2 GHz -
Please open the following input files (to Design Compiler) to understand the contents of these files. Do not change the content of these files. o
/home/linux/ieng6/ee260afa23/public/lab3_setup/run_dc_setup_reference/run_dc.tcl
is
the setup file for executing Design Compiler
o
/home/linux/ieng6/ee260afa23/public/lab3_setup/run_dc_setup_reference/*sdc
has the constraints for your design (including clock period and I/O constraints)
o
/home/linux/ieng6/ee260afa23/public/lab3_setup/run_dc_setup_reference/rtl/*sv
is the
system verilog file that is an input to Design Compiler tool.
ECE 260A
Page 4
of 8
Lab 3
-
Running Design Compiler on reference design
o
mkdir <your course specific home dir>/lab3
o
cd <your course specific home dir>/lab3
o
cp -r /home/linux/ieng6/ee260afa23/public/lab3_setup/run_dc_setup_
reference/ .
o
cd run_dc_setup_reference
o
dc_shell -f run_dc.tcl // Command to launch DESIGN COMPILER
report_power > power.rpt
report_qor > qor.rpt
report_timing > timing.rpt
Sample screenshot for timing report
o
Go through these reports and get a feel of your gate-level circuit, and make use of the shell to query for various attribute of your design
o
Modify the Path in “run_dc.tcl” File
Change the “ee260afa19” part to “ee260afa23” in rtlPath and target library before launching DESIGN COMPILER
ECE 260A
Page 5
of 8
Lab 3
-
Running Design Compiler on proposed design
o
cd <your course specific home dir>/lab3
o
mkdir dc_proposed_design
o
cp -r <>/run_dc_setup_reference/* . // Copying the contents including rtl/ directory
o
cd dc_proposed_design/rtl o
change the system Verilog file (specified as to what part of
the file needs to be modified)
o
Update run_dc.tcl in your new run dir, with the correct rtl path to your new location
set rtlPath "<PATH TO YOUR RTL FILES>" o
Change the “ee260afa19” part to “ee260afa23” in target library before launching DESIGN COMPILER
o
Do not change any other file (content or name).
o
cd ..
o
dc_shell -f run_dc.tcl // this needs to be run in the dir where run_dc.tcl is present
report_power > power_new.rpt
report_qor > qor_new.rpt
report_timing > timing_new.rpt
o
Open these reports and observe the changes from your reference implementation. For any command, you execute “-
help” or “man” on dc_shell, to know more about the command options. o
Eg:
dc_shell>report_timing -help ECE 260A
Page 6
of 8
Lab 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
dc_shell>man report_power
-
Useful link for Linux operation
https://www.javatpoint.com/linux-edit-file
Appendix:
The following timing path indicates a setup violation (negative sign) of 100ps. This is generated by report_timing command executed on dc_shell. By default, only the worst timing path is reported, and unit of time is nanoseconds. Just visualize the number of flop-4flop timing paths exist in your design, by exploring “-slack_greater_than” and “max_paths” options of “report_timing” command. report_qor and report_power commands are self-explanatory. Visualizing generated gate-level netlist:
1.
Output gate-level netlist is generated in verilog format (<design name>. out.v)
2.
To view schematic, execute “start_gui” on dc_shell
Click on the rectangular module
ECE 260A
Page 7
of 8
Lab 3
Zoom-in and out using mouse and move scroll using arrow keys
-
Optional: You can verify functionality
of system Verilog, using the test bench that we provided using modelsim. Remember to set the ‘fclk’ term in your testbench. When doing this, note that to HSPICE, ‘100M’ means 100 mHz, not 100 MHz. To use mega, make sure to type ‘100MEGA’.
o
/home/linux/ieng6/ee260afa23/public/lab3_setup/test_bench/ fir4.tb
ECE 260A
Page 8
of 8
Lab 3
Related Documents
Related Questions
c. Sketch the zero-pole plot and the direct form II diagram of the completed design out
of part b.
d. Calculate and sketch the output sequence after feeding x [n] into this system.
arrow_forward
Develop the Q output waveforms for a 74HC190
up/down counter with the input waveforms
shown in figure below. A binary 0 is on the data
inputs. Start with a count of 0000.
CLK
CTEN
LOAD
arrow_forward
Please Follow Question
List the outputs of Encoder at BCD indicators
for each of the eight input pulses shown in
Figure; (((Remember the activates of HIGH (1)
that has a LOW (0) input.))):
BCD output indicators
8s
4s
2s 1s
D
Encoder
3
B
6.
48
d9
(74147)
1 Add file
arrow_forward
6.q4
arrow_forward
2a. Assume a digital to analog conversion system uses a 10-bit integerto represent an analog temperature over a range of -25oF to 125oIf the actual temperature being read was 65.325oF, what would bethe closest possible value that the system could represent?
2b. What is the minimum sampling rate needed in order to successfullycapture frequencies up to 155 KHz in an analog signal?
arrow_forward
1. Consider the CRC generator shown below. Determine the output of the CRC circuit (i.e. Q4 Q3
Q2 Q1 Q0, expressed as a decimal number) for the input sequence "1010" (input one bit at a
time, left to right). Assume the CRC circuit is initialized to state 11111.
D
Q0
Q2
Q4
Q1
Q3
Clock -
Data In
arrow_forward
(Analog to Digital signal conversion problem)
Continuous voltage is converted to discrete signal:
Range of signal after amplification= 0V to 5 V
A/D converter = 6 bits
# Quantization levels = 26 = 64
Question: Assume the input signal is 3.5 V. Use the successive approximation method to encode the signal for this 6-bit converter. Please write both the decimal value and binary value for the code
arrow_forward
How could a binary-weighted DAC operate with BCD input? Explain.
arrow_forward
Develop the steps of Analog Digital Conversion (ADC) of the following analog signal, considering the signal sampling analog (gray square values), the quantization of the signal itself, the encoding of the quantization result in binary code and the resulting pulse train.
arrow_forward
a) Determine the delay (in terms of d) for each of the following 128-bit CLA designs when
fan-in limit (f) is set to cach of the following:
i) f = 4
ii) f= 6
arrow_forward
2. Design a 6 bit digital to analog converter with binary weighted resistive ladder network so that the analog output for the bit sequence 100100 is -10V.
arrow_forward
A 9-bit asynchronous counter has a 128-kHz clock signal applied.
(1) What is the MOD number of this counter?
MOD number =
(ii) What will be the frequency at the MSB output?
fmsb =
(iii)
Assume that the counter starts at zero. What will be the count after 635 input pulses?
After 635 input pulse, Count =
arrow_forward
A 12-bit linear PCM code is digitally compressed into eight bits. The resolution is equal to 0.03 V. Determine the following for an analog input voltage of 1.449 V
a. The 12-bit linear PCM code is
b. The 8-bit compressed code is
c. The decoded 12-bit code is
d. The percentage error is
arrow_forward
Exampic
Bandwidth of the input to pulse code modulator is restricted to 4 kHz.
The input varies from -3.8V to 3.8 V and has the average power of 30 mW, the required
signal to quantization noise power ratio is 20 dB. The modulator produces binary output.
Assume uniform quantization Calculate the number of bits required per sample.
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
data:image/s3,"s3://crabby-images/aa23b/aa23b915856c09ebc75d48cc8f33abe144abbf32" alt="Text book image"
Delmar's Standard Textbook Of Electricity
Electrical Engineering
ISBN:9781337900348
Author:Stephen L. Herman
Publisher:Cengage Learning
Related Questions
- c. Sketch the zero-pole plot and the direct form II diagram of the completed design out of part b. d. Calculate and sketch the output sequence after feeding x [n] into this system.arrow_forwardDevelop the Q output waveforms for a 74HC190 up/down counter with the input waveforms shown in figure below. A binary 0 is on the data inputs. Start with a count of 0000. CLK CTEN LOADarrow_forwardPlease Follow Question List the outputs of Encoder at BCD indicators for each of the eight input pulses shown in Figure; (((Remember the activates of HIGH (1) that has a LOW (0) input.))): BCD output indicators 8s 4s 2s 1s D Encoder 3 B 6. 48 d9 (74147) 1 Add filearrow_forward
- 6.q4arrow_forward2a. Assume a digital to analog conversion system uses a 10-bit integerto represent an analog temperature over a range of -25oF to 125oIf the actual temperature being read was 65.325oF, what would bethe closest possible value that the system could represent? 2b. What is the minimum sampling rate needed in order to successfullycapture frequencies up to 155 KHz in an analog signal?arrow_forward1. Consider the CRC generator shown below. Determine the output of the CRC circuit (i.e. Q4 Q3 Q2 Q1 Q0, expressed as a decimal number) for the input sequence "1010" (input one bit at a time, left to right). Assume the CRC circuit is initialized to state 11111. D Q0 Q2 Q4 Q1 Q3 Clock - Data Inarrow_forward
- (Analog to Digital signal conversion problem) Continuous voltage is converted to discrete signal: Range of signal after amplification= 0V to 5 V A/D converter = 6 bits # Quantization levels = 26 = 64 Question: Assume the input signal is 3.5 V. Use the successive approximation method to encode the signal for this 6-bit converter. Please write both the decimal value and binary value for the codearrow_forwardHow could a binary-weighted DAC operate with BCD input? Explain.arrow_forwardDevelop the steps of Analog Digital Conversion (ADC) of the following analog signal, considering the signal sampling analog (gray square values), the quantization of the signal itself, the encoding of the quantization result in binary code and the resulting pulse train.arrow_forward
- a) Determine the delay (in terms of d) for each of the following 128-bit CLA designs when fan-in limit (f) is set to cach of the following: i) f = 4 ii) f= 6arrow_forward2. Design a 6 bit digital to analog converter with binary weighted resistive ladder network so that the analog output for the bit sequence 100100 is -10V.arrow_forwardA 9-bit asynchronous counter has a 128-kHz clock signal applied. (1) What is the MOD number of this counter? MOD number = (ii) What will be the frequency at the MSB output? fmsb = (iii) Assume that the counter starts at zero. What will be the count after 635 input pulses? After 635 input pulse, Count =arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Delmar's Standard Textbook Of ElectricityElectrical EngineeringISBN:9781337900348Author:Stephen L. HermanPublisher:Cengage Learning
data:image/s3,"s3://crabby-images/aa23b/aa23b915856c09ebc75d48cc8f33abe144abbf32" alt="Text book image"
Delmar's Standard Textbook Of Electricity
Electrical Engineering
ISBN:9781337900348
Author:Stephen L. Herman
Publisher:Cengage Learning