This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

How can I do read vela compiler report ?

I have some problems about his report when using ethous-U-vela.

The vela version I use is 3.0.0 .

This is an introduction to Vela's options:
review.mlplatform.org/.../OPTIONS.md

There is an example of Configuration File in the above website

I would like to ask

1.
There are burst length and latency.

Does the latency in the example refer to memoey's CAS?

Then burst length refers to reading the address once, which can read several strokes continuously.

2.
The three (Sram Only, Shared Sram, Shared Sram) modes that I want to ask the most are whether to adjust the four settings (const_mem_area, arena, cache_mem_area, arena_cache_size)
) Instead of directly setting the model?


3.
The total DRAM MB/batch in the figure below has a value.
But there is no value in the last DRAM access cycle.
What could be the problem?


4.
Finally, I would like to ask about the value of those cycles. (Picture above)
Where can I know the cycle value used by an OP?
I think these values ​​should be calculated after understanding the process of op operation.

Thank you for your answers in advance

Parents
  • @Danter

    1. Efficient burst length - This is a parameter to make Vela estimates memory performance depending on burst length.
    Say when set to 64B, for instance, any burst shorter than 64B will be considered as taking the same time as a 64B burst.
    This can be used to model a memory where short bursts are inefficient. So, 
    this is depends on the interconnect, memory of the HW and you should set burst length to the hardware burst length.

     

    2. "Info: Changing cost_mem_area from Sram to OnChipFlash.
    When you see changing const_mem_area from Sram to OnChipFlash. This will use the same characteristics as Sram.

    In the code, this happens:

                    if self.const_mem_area == MemPort.Axi0:
                        self.const_mem_area = MemPort.Axi1
                        self.axi1_port = MemArea.OnChipFlash

    OnChipFlash is  internal to Vela and as shown in code, it’s the name used to specify the const_mem_area as being in Read-Only SRAM. in the vela.ini config file, user can specify either OnChipFlash or Sram. If Sram is specified then Vela will automatically change this OnChipFlash and print a message indicating this and will be 2 area of sram created. one called 'Sram' and other one called 'OnChipFlash' - 'Sram'​ area is r/w and 'OnChipFlash' is read only.

    3. Need to investigate this further. I will get back to you. 

Reply
  • @Danter

    1. Efficient burst length - This is a parameter to make Vela estimates memory performance depending on burst length.
    Say when set to 64B, for instance, any burst shorter than 64B will be considered as taking the same time as a 64B burst.
    This can be used to model a memory where short bursts are inefficient. So, 
    this is depends on the interconnect, memory of the HW and you should set burst length to the hardware burst length.

     

    2. "Info: Changing cost_mem_area from Sram to OnChipFlash.
    When you see changing const_mem_area from Sram to OnChipFlash. This will use the same characteristics as Sram.

    In the code, this happens:

                    if self.const_mem_area == MemPort.Axi0:
                        self.const_mem_area = MemPort.Axi1
                        self.axi1_port = MemArea.OnChipFlash

    OnChipFlash is  internal to Vela and as shown in code, it’s the name used to specify the const_mem_area as being in Read-Only SRAM. in the vela.ini config file, user can specify either OnChipFlash or Sram. If Sram is specified then Vela will automatically change this OnChipFlash and print a message indicating this and will be 2 area of sram created. one called 'Sram' and other one called 'OnChipFlash' - 'Sram'​ area is r/w and 'OnChipFlash' is read only.

    3. Need to investigate this further. I will get back to you. 

Children
  • for point 3: I checked the behaviour and it's align with the Vela code. Vela does DMA weight stored in Flash/DRAM (permanent storage) to SRAM, so that they are saved from reading more than once. That's why for the smaller you are getting 0 off chip flash cycles as weights are cached in SRAM. Hope this clears all your questions.