

## Measurement of Worst Case Execution Time on PolarFire® SoC FPGA

## Introduction

Microchip's PolarFire<sup>®</sup> SoC FPGAs include the industry's RISC-V based Microprocessor Subsystem (MSS), and FPGA fabric inherits all the features of the PolarFire family. The PolarFire SoC MSS includes 5x 64-bit RISC-V processor cores, AXI Switch, DDR Controller, Fabric Interface Controllers (FIC), and a rich set of peripherals.

Worst Case Execution Time (WCET) is the upper-bound time taken by a software task to execute on a specific hardware platform for any set of inputs. WCET is an important measurement for checking whether the execution time of a task complies with real-time requirements.

This white paper describes the WCET results obtained from running a bare metal task on the PolarFire SoC MPFS250T device. The following table lists the system configuration used for this WCET measurement.

| System Configuration          | Description                                    |
|-------------------------------|------------------------------------------------|
| Product and Architecture      | PolarFire <sup>®</sup> SoC FPGA, RISC-V 64-bit |
| Development Board             | PolarFire <sup>®</sup> SoC Icicle Kit          |
| Example WCET Task             | 32 x 32 Matrix Multiplication                  |
| CPU Core Frequency            | 600 MHz                                        |
| External Memory Configuration | LPDDR4, 800 MHz                                |
| Compiler                      | GCC                                            |
| Toolchain for Bare Metal      | riscv64-unknown-elf-gcc (v8.3.0)               |
| Design Suite                  | Libero <sup>®</sup> SoC v2021.1                |
| Software IDE                  | SoftConsole v2021.1                            |

## **Table of Contents**

| Intro | oduction1                                                                                             |
|-------|-------------------------------------------------------------------------------------------------------|
| 1.    | Latency Factors                                                                                       |
|       | WCET Measurement.       4         2.1. Toolchain Settings.       5         2.2. WCET Results.       5 |
| 3.    | Conclusion7                                                                                           |
| 4.    | Revision History                                                                                      |
| The   | Microchip Website9                                                                                    |
| Proc  | Juct Change Notification Service9                                                                     |
| Cus   | tomer Support9                                                                                        |
| Micr  | ochip Devices Code Protection Feature9                                                                |
| Lega  | al Notice                                                                                             |
| Trac  | lemarks                                                                                               |
| Qua   | lity Management System 11                                                                             |
| Wor   | Idwide Sales and Service12                                                                            |

## 1. Latency Factors

In PolarFire SoC devices, the following hardware and software factors affect the upper-bound of the software execution time.

- Memory region—placement of code, data, and stack segments affect the execution time of a task. Code placed in Instruction Tightly Integrated Memory (ITIM) executes faster than the code placed in DDR memory. Similarly, data placed in DDR memory (cached region) can be accessed faster than data in Loosely Integrated Memory (LIM).
- Cache configuration—LIM, scratchpad, and L2 cache.
- Interrupts—outside event can trigger an interrupt, which might not be in the user's control. In such events, the CPU core halts the execution of the current task and executes the interrupt handler. As a result, the execution time gets incremented.
- Total concurrent tasks in the system.
- Branch prediction and execution pipeline also increment the execution time.
- Toolchain settings (ISA, optimization settings) and Programming Style (Efficiency of the code).

For the WCET task used for this white paper, WCET is measured in terms of execution cycles using the hardware performance monitoring register. For more information about the hardware performance monitoring register, ITIM and LIM, and other MSS functional blocks, see PolarFire SoC MSS Technical Reference Manual.

The following section describes an example bare metal application used for white paper.

## 2. WCET Measurement

A bare metal application was developed for realizing how various latency factors increase the upper-bound of the software execution time. Using this bare metal application, the normal execution time and WCET are measured by disabling and enabling the latency factors. In the application, the device eNVM stores a custom bootloader. At device power-up, the E51 monitor core executes the bootloader for loading separate binary files in the DDR memory for each U54 application core. The bootloader executes from eNVM (code) and scratchpad (data).

The following figure shows the memory hierarchy and the interaction of application cores with different memory regions in the example application.



#### Figure 2-1. Interaction of Application Cores with Different Memory Regions

The following table lists the task executed by each U54 application core and the memory region used for fetching and executing the code. The bootloader loads all the binary files to different DDR regions.

| Application<br>Core | Task(s)                                                                                             | Memory<br>Region                                 | Description                                                                                                                                                                                                                                                                        |
|---------------------|-----------------------------------------------------------------------------------------------------|--------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| U54_1               | Executes the 32 x 32<br>matrix multiplication task<br>and prints the results on<br>serial terminal. | Executes from<br>ITIM and LIM.                   | The bootloader loads the 32 x 32 matrix multiplication task to DDR and U54_1 copies this task to ITIM and LIM for executing it. This scenario demonstrates the predictability of execution time. Because, the non-predictability caused by caching is eliminated in this scenario. |
| U54_2               | Continuously flushes L2<br>cache lines using the L2<br>flush register.                              | Executes from cached DDR.                        | This task acts as adversarial to tasks running on U54_1<br>and U54_4 cores. The effects of continuously evicting<br>or flushing the L2 cache lines on tasks executing from                                                                                                         |
| U54_3               | Continuously reads data from DDR, which results in cache evicts.                                    | Executes from cached DDR.                        | ITIM/LIM and DDR can be observed by executing this task. This scenario helps in analyzing the effects of L2 cache flush and refresh function on the execution time.                                                                                                                |
| U54_4               | Executes 32 x 32 matrix<br>multiplication task and<br>prints results on serial<br>terminal.         | Executes from<br>cached DDR<br>address<br>range. | DDR cached scenario demonstrates unpredictability<br>of execution times due to presence of caches. This<br>scenario is for comparison purpose only.                                                                                                                                |

The normal execution time is measured by excluding U54\_2 and U54\_3 application cores and by disabling branch prediction trashing in U54\_1 and U54\_4 cores. WCET is measured by including the U54\_2 and U54\_3 cores, and by enabling branch prediction trashing and a 1000 mcycle interrupt routine in U54\_1 and U54\_4 cores. At device power-up, bootloader provides an option to enable or disable U54\_2 and U54\_3 cores via the serial terminal. To calculate WCET, the U54\_1 and U54\_4 applications require a rebuild because the latency factors are enabled in the code. The normal execution time and WCET are measured based on scenarios highlighted in Figure 2-1.

- 1. To measure the normal execution time on U54\_1 (ITIM and LIM) and U54\_4 (cached DDR), the following sequence is followed:
  - a. U54\_2 and U54\_3 are excluded > U54\_1 and U54\_4 execute the task without branch prediction trashing and interrupt routine. U54\_4 executes the task from cached DDR region.
  - To measure WCET on U54\_1 (ITIM and LIM) and U54\_4 (cached DDR), the following sequence was followed:
  - a. U54\_2 and U54\_3 are included > U54\_1 and U54\_4 execute the task with branch prediction trashing and interrupt routine. The U54\_4 core executes the task from cached DDR region.

Bare metal user applications are built from the PolarFire SoC Bare Metal library, which includes hardware abstraction layer and drivers to access various MSS blocks such as L1 cache, L2 cache, timer, Interrupt registers, and peripherals. For more information about the Bare Metal library, see https://github.com/polarfire-soc/polarfire-soc-bare-metal-examples.

#### 2.1 Toolchain Settings

2.

The following toolchain settings were applied for all U54 application cores in the bare metal application used for this white paper.

#### Table 2-1. Toolchain Settings

| Attribute          | Setting                      |  |
|--------------------|------------------------------|--|
| Architecture       | rv64g                        |  |
| Integer ABI        | lp64                         |  |
| Floating point ABI | (d) Double precision         |  |
| Tuning             | Toolchain default            |  |
| Code model         | Medium Any (-mcmodel=medany) |  |
| small data limit   | 8                            |  |
| Align              | Strict (-mstrict-align)      |  |
| Optimization Level | None(O0)                     |  |
| Debug Level        | Maximum(-g3)                 |  |
| Language standard  | GNU ISO C11 (-std=gnu11)     |  |

#### 2.2 WCET Results

The following table lists the normal execution time and WCET of a 32 x 32 matrix multiplication task measured using the example application.

| Program Segment                       | U54_                      | 1 Core                    | U54_4 Core <sup>4</sup> |
|---------------------------------------|---------------------------|---------------------------|-------------------------|
| Code (bytes)                          | ITIM (12736)              | LIM (80496 <sup>1</sup> ) | DDR cached              |
| Data                                  | LIM (78000 <sup>2</sup> ) | LIM                       | DDR cached              |
| Stack                                 | LIM                       | LIM                       | DDR cached              |
| Normal Execution Time (±deviation) in | 8407148 (±21 cycles)      | 8407167 (±4 cycles)       | 1955036 (±136 cycles)   |
| mcycles                               | (14.0119 ms)              | (14.0119 ms)              | (3.258 ms)              |
| WCFT <sup>3</sup>                     | 8409198 (±61 cycles)      | 8409426 (±200 cycles)     | 1971234 (±1653 cycles)  |
|                                       | (14.0153 ms)              | (14.0157 ms)              | (3.285 ms)              |

- 1. All sections are combined.
- 2. Data and stack are combined.
- 3. WCET includes the following:
  - a. Branch prediction trashing and branch prediction scheme (static) are enabled in core U54\_4 and U54\_1.
  - b. Adversarial runs on U54\_3; cache flush runs on U54\_2.
  - c. Interrupt with execution time (1000 mcycle).
- 4. The DDR cached execution timings are given for comparison purpose only.

Note: WCET (seconds) = (1/(CPU Core Frequency in MHz × 1000 × 1000)) × (number of execution cycles), where

CPU Core Frequency = 600 MHz

#### 3. Conclusion

In this white paper, the WCET results for a 32 x 32 matrix multiplication task in bare metal were analyzed by executing the task from ITIM, LIM, and DDR cached regions. In all three cases, the task execution time increased due to the latency factors which resulted in WCET. The effects of L2 cache refresh and flush functions were negligible when the tasks were executing from ITIM or LIM.

On U54\_1, when the task was executing from ITIM with LIM as target memory for data and stack, the additional latency resulted mainly from the interrupt execution latency factor.

On U54\_1, when the task was executing from LIM (code, data, and stack), the additional latency resulted mainly from the interrupt execution latency factor. In this case, when the task was executed multiple times, there was a deviation in the execution time mainly due to branch prediction trashing.

On U54\_4, when the task was executing from DDR cached region (code, data, and stack), the additional latency and execution time deviation were higher when compared to the above two cases. This is due to the L2 cache refresh/flush and interrupt execution latency factors. However, the minimum execution time was achieved due to the presence of caches.

Based on the above results, the task executing from ITIM with LIM as target memory for data and stack gave more deterministic behavior.

# 4. Revision History

| Revision | Date    | Description                             |
|----------|---------|-----------------------------------------|
| A        | 10/2021 | The first publication of this document. |

## The Microchip Website

Microchip provides online support via our website at www.microchip.com/. This website is used to make files and information easily available to customers. Some of the content available includes:

- **Product Support** Data sheets and errata, application notes and sample programs, design resources, user's guides and hardware support documents, latest software releases and archived software
- **General Technical Support** Frequently Asked Questions (FAQs), technical support requests, online discussion groups, Microchip design partner program member listing
- **Business of Microchip** Product selector and ordering guides, latest Microchip press releases, listing of seminars and events, listings of Microchip sales offices, distributors and factory representatives

## **Product Change Notification Service**

Microchip's product change notification service helps keep customers current on Microchip products. Subscribers will receive email notification whenever there are changes, updates, revisions or errata related to a specified product family or development tool of interest.

To register, go to www.microchip.com/pcn and follow the registration instructions.

## Customer Support

Users of Microchip products can receive assistance through several channels:

- Distributor or Representative
- Local Sales Office
- Embedded Solutions Engineer (ESE)
- Technical Support

Customers should contact their distributor, representative or ESE for support. Local sales offices are also available to help customers. A listing of sales offices and locations is included in this document.

Technical support is available through the website at: www.microchip.com/support

## **Microchip Devices Code Protection Feature**

Note the following details of the code protection feature on Microchip devices:

- Microchip products meet the specifications contained in their particular Microchip Data Sheet.
- Microchip believes that its family of products is secure when used in the intended manner and under normal conditions.
- There are dishonest and possibly illegal methods being used in attempts to breach the code protection features of the Microchip devices. We believe that these methods require using the Microchip products in a manner outside the operating specifications contained in Microchip's Data Sheets. Attempts to breach these code protection features, most likely, cannot be accomplished without violating Microchip's intellectual property rights.
- · Microchip is willing to work with any customer who is concerned about the integrity of its code.
- Neither Microchip nor any other semiconductor manufacturer can guarantee the security of its code. Code
  protection does not mean that we are guaranteeing the product is "unbreakable." Code protection is constantly
  evolving. We at Microchip are committed to continuously improving the code protection features of our products.
  Attempts to break Microchip's code protection feature may be a violation of the Digital Millennium Copyright Act.
  If such acts allow unauthorized access to your software or other copyrighted work, you may have a right to sue
  for relief under that Act.

## Legal Notice

Information contained in this publication is provided for the sole purpose of designing with and using Microchip products. Information regarding device applications and the like is provided only for your convenience and may be superseded by updates. It is your responsibility to ensure that your application meets with your specifications.

THIS INFORMATION IS PROVIDED BY MICROCHIP "AS IS". MICROCHIP MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WHETHER EXPRESS OR IMPLIED, WRITTEN OR ORAL, STATUTORY OR OTHERWISE, RELATED TO THE INFORMATION INCLUDING BUT NOT LIMITED TO ANY IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY, AND FITNESS FOR A PARTICULAR PURPOSE OR WARRANTIES RELATED TO ITS CONDITION, QUALITY, OR PERFORMANCE.

IN NO EVENT WILL MICROCHIP BE LIABLE FOR ANY INDIRECT, SPECIAL, PUNITIVE, INCIDENTAL OR CONSEQUENTIAL LOSS, DAMAGE, COST OR EXPENSE OF ANY KIND WHATSOEVER RELATED TO THE INFORMATION OR ITS USE, HOWEVER CAUSED, EVEN IF MICROCHIP HAS BEEN ADVISED OF THE POSSIBILITY OR THE DAMAGES ARE FORESEEABLE. TO THE FULLEST EXTENT ALLOWED BY LAW, MICROCHIP'S TOTAL LIABILITY ON ALL CLAIMS IN ANY WAY RELATED TO THE INFORMATION OR ITS USE WILL NOT EXCEED THE AMOUNT OF FEES, IF ANY, THAT YOU HAVE PAID DIRECTLY TO MICROCHIP FOR THE INFORMATION. Use of Microchip devices in life support and/or safety applications is entirely at the buyer's risk, and the buyer agrees to defend, indemnify and hold harmless Microchip from any and all damages, claims, suits, or expenses resulting from such use. No licenses are conveyed, implicitly or otherwise, under any Microchip intellectual property rights unless otherwise stated.

## Trademarks

The Microchip name and logo, the Microchip logo, Adaptec, AnyRate, AVR, AVR logo, AVR Freaks, BesTime, BitCloud, chipKIT, chipKIT logo, CryptoMemory, CryptoRF, dsPIC, FlashFlex, flexPWR, HELDO, IGLOO, JukeBlox, KeeLoq, Kleer, LANCheck, LinkMD, maXStylus, maXTouch, MediaLB, megaAVR, Microsemi, Microsemi logo, MOST, MOST logo, MPLAB, OptoLyzer, PackeTime, PIC, picoPower, PICSTART, PIC32 logo, PolarFire, Prochip Designer, QTouch, SAM-BA, SenGenuity, SpyNIC, SST, SST Logo, SuperFlash, Symmetricom, SyncServer, Tachyon, TimeSource, tinyAVR, UNI/O, Vectron, and XMEGA are registered trademarks of Microchip Technology Incorporated in the U.S.A. and other countries.

AgileSwitch, APT, ClockWorks, The Embedded Control Solutions Company, EtherSynch, FlashTec, Hyper Speed Control, HyperLight Load, IntelliMOS, Libero, motorBench, mTouch, Powermite 3, Precision Edge, ProASIC, ProASIC Plus, ProASIC Plus logo, Quiet-Wire, SmartFusion, SyncWorld, Temux, TimeCesium, TimeHub, TimePictra, TimeProvider, WinPath, and ZL are registered trademarks of Microchip Technology Incorporated in the U.S.A.

Adjacent Key Suppression, AKS, Analog-for-the-Digital Age, Any Capacitor, AnyIn, AnyOut, Augmented Switching, BlueSky, BodyCom, CodeGuard, CryptoAuthentication, CryptoAutomotive, CryptoCompanion, CryptoController, dsPICDEM, dsPICDEM.net, Dynamic Average Matching, DAM, ECAN, Espresso T1S, EtherGREEN, IdealBridge, In-Circuit Serial Programming, ICSP, INICnet, Intelligent Paralleling, Inter-Chip Connectivity, JitterBlocker, maxCrypto, maxView, memBrain, Mindi, MiWi, MPASM, MPF, MPLAB Certified logo, MPLIB, MPLINK, MultiTRAK, NetDetach, Omniscient Code Generation, PICDEM, PICDEM.net, PICkit, PICtail, PowerSmart, PureSilicon, QMatrix, REAL ICE, Ripple Blocker, RTAX, RTG4, SAM-ICE, Serial Quad I/O, simpleMAP, SimpliPHY, SmartBuffer, SMART-I.S., storClad, SQI, SuperSwitcher, SuperSwitcher II, Switchtec, SynchroPHY, Total Endurance, TSHARC, USBCheck, VariSense, VectorBlox, VeriPHY, ViewSpan, WiperLock, XpressConnect, and ZENA are trademarks of Microchip Technology Incorporated in the U.S.A. and other countries.

SQTP is a service mark of Microchip Technology Incorporated in the U.S.A.

The Adaptec logo, Frequency on Demand, Silicon Storage Technology, and Symmcom are registered trademarks of Microchip Technology Inc. in other countries.

GestIC is a registered trademark of Microchip Technology Germany II GmbH & Co. KG, a subsidiary of Microchip Technology Inc., in other countries.

All other trademarks mentioned herein are property of their respective companies.

<sup>©</sup> 2021, Microchip Technology Incorporated, Printed in the U.S.A., All Rights Reserved.

ISBN: 978-1-5224-9011-1

## **Quality Management System**

For information regarding Microchip's Quality Management Systems, please visit www.microchip.com/quality.



# **Worldwide Sales and Service**

| MERICAS                               | ASIA/PACIFIC          | ASIA/PACIFIC            | EUROPE                |
|---------------------------------------|-----------------------|-------------------------|-----------------------|
| Corporate Office                      | Australia - Sydney    | India - Bangalore       | Austria - Wels        |
| 355 West Chandler Blvd.               | Tel: 61-2-9868-6733   | Tel: 91-80-3090-4444    | Tel: 43-7242-2244-39  |
| Chandler, AZ 85224-6199               | China - Beijing       | India - New Delhi       | Fax: 43-7242-2244-393 |
| el: 480-792-7200                      | Tel: 86-10-8569-7000  | Tel: 91-11-4160-8631    | Denmark - Copenhager  |
| ax: 480-792-7277                      | China - Chengdu       | India - Pune            | Tel: 45-4485-5910     |
| echnical Support:                     | Tel: 86-28-8665-5511  | Tel: 91-20-4121-0141    | Fax: 45-4485-2829     |
| ww.microchip.com/support              | China - Chongqing     | Japan - Osaka           | Finland - Espoo       |
| Veb Address:                          | Tel: 86-23-8980-9588  | Tel: 81-6-6152-7160     | Tel: 358-9-4520-820   |
| ww.microchip.com                      | China - Dongguan      | Japan - Tokyo           | France - Paris        |
| tlanta                                | Tel: 86-769-8702-9880 | Tel: 81-3-6880- 3770    | Tel: 33-1-69-53-63-20 |
| uluth, GA                             | China - Guangzhou     | Korea - Daegu           | Fax: 33-1-69-30-90-79 |
| el: 678-957-9614                      | Tel: 86-20-8755-8029  | Tel: 82-53-744-4301     | Germany - Garching    |
| ax: 678-957-1455                      | China - Hangzhou      | Korea - Seoul           | Tel: 49-8931-9700     |
| ustin, TX                             | Tel: 86-571-8792-8115 | Tel: 82-2-554-7200      | Germany - Haan        |
| el: 512-257-3370                      | China - Hong Kong SAR | Malaysia - Kuala Lumpur | Tel: 49-2129-3766400  |
| oston                                 | Tel: 852-2943-5100    | Tel: 60-3-7651-7906     | Germany - Heilbronn   |
| estborough, MA                        | China - Nanjing       | Malaysia - Penang       | Tel: 49-7131-72400    |
| el: 774-760-0087                      | Tel: 86-25-8473-2460  | Tel: 60-4-227-8870      | Germany - Karlsruhe   |
| ax: 774-760-0088                      | China - Qingdao       | Philippines - Manila    | Tel: 49-721-625370    |
| hicago                                | Tel: 86-532-8502-7355 | Tel: 63-2-634-9065      | Germany - Munich      |
| asca, IL                              | China - Shanghai      | Singapore               | Tel: 49-89-627-144-0  |
| el: 630-285-0071                      | Tel: 86-21-3326-8000  | Tel: 65-6334-8870       | Fax: 49-89-627-144-44 |
| ax: 630-285-0075                      | China - Shenyang      | Taiwan - Hsin Chu       | Germany - Rosenheim   |
| allas                                 | Tel: 86-24-2334-2829  | Tel: 886-3-577-8366     | Tel: 49-8031-354-560  |
| ldison, TX                            | China - Shenzhen      | Taiwan - Kaohsiung      | Israel - Ra'anana     |
| l: 972-818-7423                       | Tel: 86-755-8864-2200 | Tel: 886-7-213-7830     | Tel: 972-9-744-7705   |
| ax: 972-818-2924                      | China - Suzhou        | Taiwan - Taipei         | Italy - Milan         |
| etroit                                | Tel: 86-186-6233-1526 | Tel: 886-2-2508-8600    | Tel: 39-0331-742611   |
| ovi, MI                               | China - Wuhan         | Thailand - Bangkok      | Fax: 39-0331-466781   |
| l: 248-848-4000                       | Tel: 86-27-5980-5300  | Tel: 66-2-694-1351      | Italy - Padova        |
| ouston, TX                            | China - Xian          | Vietnam - Ho Chi Minh   | Tel: 39-049-7625286   |
| l: 281-894-5983                       | Tel: 86-29-8833-7252  | Tel: 84-28-5448-2100    | Netherlands - Drunen  |
| dianapolis                            | China - Xiamen        | 161. 04-20-3440-2100    | Tel: 31-416-690399    |
| oblesville, IN                        | Tel: 86-592-2388138   |                         | Fax: 31-416-690340    |
| el: 317-773-8323                      | China - Zhuhai        |                         | Norway - Trondheim    |
| ax: 317-773-5453                      | Tel: 86-756-3210040   |                         | Tel: 47-72884388      |
| el: 317-536-2380                      | Tel. 80-750-5210040   |                         | Poland - Warsaw       |
|                                       |                       |                         | Tel: 48-22-3325737    |
| o <b>s Angeles</b><br>ssion Viejo, CA |                       |                         | Romania - Bucharest   |
| l: 949-462-9523                       |                       |                         | Tel: 40-21-407-87-50  |
| nx: 949-462-9608                      |                       |                         | Spain - Madrid        |
| il: 951-273-7800                      |                       |                         | Tel: 34-91-708-08-90  |
|                                       |                       |                         |                       |
| aleigh, NC                            |                       |                         | Fax: 34-91-708-08-91  |
| l: 919-844-7510                       |                       |                         | Sweden - Gothenberg   |
| ew York, NY                           |                       |                         | Tel: 46-31-704-60-40  |
| l: 631-435-6000                       |                       |                         | Sweden - Stockholm    |
| an Jose, CA                           |                       |                         | Tel: 46-8-5090-4654   |
| el: 408-735-9110                      |                       |                         | UK - Wokingham        |
| el: 408-436-4270                      |                       |                         | Tel: 44-118-921-5800  |
| anada - Toronto                       |                       |                         | Fax: 44-118-921-5820  |
| l: 905-695-1980                       |                       |                         |                       |