lab9

zhs04001 · Apr 12, 2024 · 7864829 · 7864829
1 parent 04c4c19
commit 7864829
Show file tree

Hide file tree

Showing 2 changed files with 224 additions and 0 deletions.
diff --git a/lab9/lab9-cache.s b/lab9/lab9-cache.s
@@ -0,0 +1,65 @@
+# CSE 3666. Cache
+        .data
+        .align 2
+# 256 words
+warray: .space 1024
+
+        # .text starts code segments
+        .text
+        .globl  main
+
+main:   
+        # s2: the stride size, in number of words
+        # Try different values, 1, 2, 4, 8, and so on
+        addi    s2, zero, 1
+
+        # s3: the number of accesses to data memory
+        # note that the lowest 12 bits are 0
+        lui     s3, 0x00003
+
+        # s0: starting address of the word array. Hard coded
+        # s1: number of elements in the array
+        lui     s0, 0x10010
+        addi    s1, zero, 256
+
+        # call read_array
+        addi    a0, s0, 0
+        addi    a1, s1, 0
+        addi    a2, s2, 0
+        addi    a3, s3, 0
+        jal     ra, read_array
+
+        # exit from the program
+exit:   addi    a7, x0, 10      
+        ecall
+
+# read_array(int a[], unsigned int n_elements, unsigned int stride_size, unsigned int n_accesses)
+read_array:
+
+        beq     a1, x0, ra_exit # array length cannot be 0
+
+        add     t1, x0, x0      # t1 is the index
+
+        beq     x0, x0, ra_test
+
+ra_loop:
+        slli    t0, t1, 2
+        add     t0, a0, t0
+        lw      x0, 0(t0)       # do not need data
+
+        # Set a breakpoint after LW
+        add     t1, t1, a2      # index += stride_size
+        bltu    t1, a1, ra_skip # out of the range?
+
+        # reset the index
+        add     t1, x0, x0
+
+ra_skip:
+        addi    a3, a3, -1
+
+ra_test:
+        bne     a3, x0, ra_loop
+
+ra_exit:
+        jalr    x0, ra, 0
+
diff --git a/lab9/lab9.md b/lab9/lab9.md
@@ -0,0 +1,159 @@
+# Cache simulator in RARS
+
+**Lab 9 is mandatory**
+
+*Firm Deadline: Submit the lab report as a PDF file in HuskyCT by the end of Wed,
+4/24/2024.*
+
+*If you work on a lab computer or UConn AnyWare Desktop, save your files to
+cloud storage like OneDrive. Otherwise, you may lose your files.*
+
+## Learning Objectives
+
+* Use the data cache simiulator in RARS.
+
+* Collect and interpret experiment data.
+
+## Description
+
+In this lab, we experiment with data cache simulator in RARS. It helps us
+understand how cache works. 
+
+### The program
+
+The RISC-V program we will use in experiment is `lab9-cache.s`. The program
+reads words from a word array `warray` for 0x3000 times, starting from
+`warray[0]`. The index is incremented by the stride size after each read. The
+default stride size is 1.  If the index is out of range, it is reset to 0. The
+pseudocode is as follows.
+
+```
+index = 0
+for n in range(number_of_reads)
+    read warray[index]
+    index += stride
+    if index >= len(warray)
+        index = 0
+```
+
+### Data cache simulator
+
+RARS has a data cache simulator. The steps to collect data from programs are as follows. 
+
+*   Open data cache simulator, by selecting "Tools/Data cache simulator" from
+    the menu.
+*   Configure the cache. We will experiment with direct mapped cache only. The
+    options we need to change are "Number of blocks" and "Cache block size (words)".
+*   Press the "Connect to Program" button to connect the simulator to the
+    program in RARS.
+*   Enable "Runtime log".
+*   Run programs in RARS as usual. We should be able to see cache events and
+    statistics in the cache simulator. The numbers will stop changing when the
+    program stops at a breakpoint or terminates.
+*   If needed, reset the cache by pressing the "Reset" button at the bottom.
+    The cache is reset if the cache configuration is changed.
+
+### Task 1. Default cache configuration
+
+We first experiment with the default cache configuration. The Number of blocks
+is 8 and the cache block size is 4 words. The cache size is 128 bytes. 
+
+1.  Find out the number of bits in block offset, cache index, and tag for the
+    default cache configuration. 
+
+2.  Assemble the program and set a breakpoint at the instruciton right after LW.
+    Write down the address of the instruction as we will need it often.
+
+3.  Start the data simulator. Check the configuration is correct. Enable "Runtime log".
+
+4.  Connect data simulator to the program.
+
+5.  Run the program. It should stop at the breakpoint (set in Step 2).
+
+6.  Observe the cache event in data cache simulator. The "Runtime log" shows the address and
+    fields in the address. Check if all fields are correct. Record the information in a table.
+
+7.  Repeat Steps 4 and 5 at least 6 times, and until you see a pattern and can predict 
+    the cache hit rate.
+
+8.  Rmove the brekpoint. Run the program until it finishes. Did you predict the correct hit rate?
+
+The following is an example of the table for recording data. Note that all
+information, except for block offset, can be found in the runtime log. Write
+block offset in decimal.
+
+| Address    | Cache index | tag        | Block offset | Hit/Miss |
+| ------     | ------      | ------     | ------       | ------   |
+| 0x10010000 | 0           | 0x00200200 | 0            | Miss     |
+|            |             |            |              |          |
+
+#### Report 
+
+Include the following in the report.
+
+*   The number of bits in each field (block offset, cache index, and tag) and
+    your explanations. 
+
+*   The table.
+
+*   Explain the cache index, tag, and block offset for the first 2 accesses. 
+
+*   Explain the hit/miss outcomes for each memory access recorded in the table.
+    Note that the table should have at least 6 entries.  
+
+*   Explain the cache hit rates you observed.
+
+### Task 2. Block size = 4 words
+
+In this set of experiment, we keep the block size as 4 words. We find out the
+hit rates when running the program with three caches configurations, each
+having a different number of cache blocks. Since the block size is the same,
+the cache size is increased as we increase the number of cache blocks. 
+
+Based on the eariler experiment, predict the hit rate for each cache
+configuration. 
+
+Run the program and record the cache hit rates and miss counts from the
+simulator. For large numbers of blocks, you may need to make the data cache
+simulator window larger, by dragging the board of the window, to see all
+information.
+
+| Number of blocks | Cache size | Hit rate | Miss count |
+| ------           | ------     | ------   | ------     |
+| 8                | 128 bytes  |          |            |
+| 16               |            |          |            |
+| 32               |            |          |            |
+| 64               |            |          |            |
+| 128              |            |          |            |
+
+One way to collect the data is to go through the following steps for each cache
+configuration.
+
+1.  Set the correct number of cache blocks in the data cache simulator. 
+2.  Reset the cache, if it is not already reset.
+3.  Make sure the simulator is connected to the program.
+4.  Assemble/Reset the program if necessary. 
+5.  Run the program and wait until the program terminates.
+6.  Record the cache hit rates and miss counsts in the table.
+
+#### Report 
+
+Include the following in the report.
+
+*   The table.
+
+*   Explanations for the cache hit rates and miss counts you observed.
+    Did you predict all the hit rates correctly? If not, what did you
+    miss? 
+
+*   If the size of `warray` is doubled, which rows in the table will
+    have different data? Explain your answer (and you can confirm with
+    the cache simulator). 
+
+## Additional task. Block size = 8 words
+
+Now we change the block size to 8 words and repeat the experiment in Taks 2.
+Did you predict all hit rates and miss counts correctly?
+
+You do not need to include this additional task in the report. However, it
+helps to check if you really understand how a direct-mapped cache works.