#### EECS 507: Introduction to Embedded Systems Research Power, Energy, and Temperature

Robert Dick

University of Michigan

## Outline

#### 1. Action items

- 2. Power models for embedded systems
- 3. Power and temperature definitions and fundamentals
- 4. Thermal analysis

# Action items I

19 Sep: A. Sangiovanni-Vincentelli, W. Damm, and R. Passerone, "Taming Dr. Frankenstein: Contract-based design for cyber-physical systems," *European Journal of Control*, vol. 18, no. 3, pp. 217–238, 2012.

24 Sep: L. Zhang, B. Tiwana, Z. Qian, Z. Wang, R. P. Dick, Z. M. Mao, and L. Yang, "Accurate online power estimation and automatic battery behavior based power model generation for smartphones," in *Proc. Int. Conf. Hardware/Software Codesign and System Synthesis*, Oct. 2010, pp. 105–114.

26 Sep: J. Polastre, R. Szewczyk, A. Mainwaring, D. Culler, and J. Anderson, "Analysis of wireless sensor networks for habitat monitoring," in *Wireless Sensor Networks*, C. S. Raghavendra, K. M. Sivalingam, and T. Znati, Eds. Springer US, 2004, ch. 18, pp. 399–423.

26 Sep: Final project proposals.

# Action items II

1 Oct: E. Ronen, A. Shamir, A.-O. Weingarten, and C. O'Flynn, "IoT goes nuclear: Creating a ZigBee chain reaction," in *Proc. Symp. on Security and Privacy*, May 2017.

3 Oct: K. Mekki, E. Bajic, F. Chaxel, and F. Meyer, "A comparative study of LPWAN technologies for large-scale IoT deployment," *Elsevier ICT Express*, vol. 5, no. 1, pp. 1–7, Mar. 2019.

## Outline

#### 1. Action items

- 2. Power models for embedded systems
- 3. Power and temperature definitions and fundamentals
- 4. Thermal analysis

# General case

Many components.

Each may have many power management/activity states.

System-wide power consumption depends on the specific combination of component states.

How many samples?

- 10 components.
- 5 states, each.
- $5^{10} \simeq 10$ -million system-wide states.

How to get enough samples to characterize?

# Independence assumption for embedded system power modeling

What if a component's power consumption were mostly independent of the power management/activity states of other components?

#### How many samples?

- 10 components.
- 5 states, each.
- 50 samples of interest.

#### The assumption is often correct.

When it is not, can treat the two interdependent components as a single component.

#### Practical embedded system power estimation

#### For each component.

- Put all other components in lowest power state.
- Measure component power consumption in each state.
- Can manually use measurements to build expression for system-wide power consumption.
- Also works for incomplete sampling by using linear regression to find the relationship between each state variable and the system-wide power consumption.

# Applying the power model

Estimate/measure the proportion of time each component spends in each state.

Sum the products of time proportions and component-state power consumptions to get system-wide average power consumption.

This is often inaccurate for instantaneous power consumption.

Not good for power supply provisioning or thermal design.

Often very accurate over timescales of minutes.

Good for battery lifespan estimation.

# Outline

- 1. Action items
- 2. Power models for embedded systems
- 3. Power and temperature definitions and fundamentals
- 4. Thermal analysis

# Definitions

Temperature: Average kinetic energy of particle.

Heat flow: Transfer of this energy.

Heat always flows from regions of higher temperature to regions of lower temperature.

Particles move.

What happens to a moving particle in a lattice?

#### Acoustic phonons

Lattice structure.

Transverse and longitudinal waves.

Electron-phonon interactions.

Effect of carrier energy increasing beyond optic phonon energy?

# Optic phonons

Only occur in lattices with more than one atom per unit cell.

Optic phonons out of phase from primitive cell to primitive cell.

Positive and negative ions swing against each other.

Low group velocity.

Interact with electrons.

#### Nanostructure heat transfer

Boundary scattering.

Quantum effects when phonon spectra of materials do not match.

## Why do wires get hot?

Scattering of electrons due to destructive interference with waves in the lattice.

What are these waves?

What happens to the energy of these electrons?

What happens when wires start very, very cool?

What is electrical resistance?

What is thermal resistance?

# Why do transistors get hot?

Scattering of electrons due to destructive interference with waves in the lattice.

Where do these waves come from?

Where do the electrons come from?

- Intrinsic carriers.
- Dopants.

What happens as the semiconductor heats up?

- Carrier concentration increases.
- Carrier mobility decreases.
- Threshold voltage decreases.

#### Power consumption trends

Initial optimization at transistor level.

Further research-driven gains at this level difficult.

Research moved to higher levels, e.g., RTL.

Trade area for performance and performance for power.

Clock frequency gains linear.

Voltage scaling  $V_{DD}^2$  – important.

# Power consumption in synchronous CMOS

$$\begin{split} P &= P_{SWITCH} + P_{SHORT} + P_{LEAK} \\ P_{SWITCH} &= C \cdot V_{DD}^2 \cdot f \cdot A \\ \dagger P_{SHORT} &= \frac{b}{12} (V_{DD} - 2 \cdot V_T)^3 \cdot f \cdot A \cdot t \\ P_{LEAK} &= V_{DD} \cdot (I_{SUB} + I_{GATE} + I_{JUNCTION} + I_{GIDL}) \\ C : \text{ total switched capacitance} \qquad V_{DD} : \text{ high voltage} \\ f : \text{ switching frequency} \qquad A : \text{ switching activity} \\ b : \text{ MOS transistor gain} \qquad V_T : \text{ threshold voltage} \\ t : \text{ rise/fall time of inputs} \\ \dagger P_{SHORT} \text{ usually} &\leq 10\% \text{ of } P_{SWITCH} \\ \text{ Smaller as } V_{DD} \rightarrow V_T \end{split}$$

#### Adiabatic charging

Voltage step function implies  $E = C V_{CAP}^2/2$ .

Instead, vary voltage to hold current constant:  $E = C V_{CAP}^2 \cdot RC/t$ .

Lower energy if T > 2RC.

Impractical when leakage significant.

#### Wiring power consumption

In the past, transistor power  $\gg$  wiring power.

Process scaling  $\Rightarrow$  ratio changing.

Conventional CAD tools neglect wiring power.

### Leakage



## Subthreshold leakage current

$$H_{subthreshold} = A_s rac{W}{L} v_T^2 \left(1 - e^{rac{-v_{DS}}{v_T}}
ight) e^{rac{(v_{GS} - v_{th})}{nv_T}},$$

where  $A_s$  is a technology-dependent constant,

 $V_{th}$  is the threshold voltage,

L and W are the device effective channel length and width,

 $V_{GS}$  is the gate-to-source voltage,

n is the subthreshold swing coefficient for the transistor,

 $V_{DS}$  is the drain-to-source voltage, and

#### $v_T$ is the thermal voltage.

A. Chandrakasan, W. Bowhill, and F. Fox, *Design of High-Performance Microprocessor Circuits*. IEEE Press, 2001

#### Simplified subthreshold leakage current

 $V_{DS} \gg v_T$  and  $v_T = \frac{kT}{q}$ . q is the charge of an electron. Therefore, equation can be simplified to

$$I_{subthreshold} = A_s \frac{W}{L} \left(\frac{kT}{q}\right)^2 e^{\frac{q(V_{GS} - V_{th})}{nkT}}$$
(1)

Power and temperature definitions and fundamentals

#### **Exponential**?



#### Piece-wise linear error



Y. Liu, R. P. Dick, L. Shang, and H. Yang, "Accurate temperature-dependent integrated circuit leakage power estimation is easy," in *Proc. Design, Automation & Test in Europe Conf.*, Mar. 2007, pp. 1526–1531

#### Gate leakage

Caused by tunneling between gate and other terminals.

$$I_{gate} = WLA_J \left(\frac{T_{oxr}}{T_{ox}}\right)^{nt} \frac{V_g V_{aux}}{T_{ox}^2} e^{-BT_{ox}(a-b|V_{ox}|)(1+c|V_{ox}|)}$$

where  $A_J, B, a, b$ , and c are technology-dependent constants,

nt is a fitting parameter with a default value of one,

 $V_{ox}$  is the voltage across gate dielectric,

 $T_{ox}$  is gate dielectric thickness,

 $T_{oxr}$  is the reference oxide thickness,

 $V_{\it aux}$  is an auxiliary function that approximates the density of tunneling carriers and available states, and

 $V_g$  is the gate voltage.

K. M. Cao, W. C. Lee, W. Liu, X. Jin, P. Su, S. K. H. Fung, J. X. An, B. Yu, and C. Hu, "BSIM4 gate leakage model including source-drain partition," in *IEDM Technology Dig.*, Dec. 2000, pp. 815–818

#### Temperature-aware leakage estimation



Power consumption conclusions

Voltage scaling is currently the most promising low-level power-reduction method:  $V^2$  dependence.

As  $V_{DD}$  reduced,  $V_T$  must also be reduced.

Sub-threshold leakage becomes significant.

What happens if  $P_{LEAK} > P_{SWITCH}$ ?

## Outline

- 1. Action items
- 2. Power models for embedded systems
- 3. Power and temperature definitions and fundamentals
- 4. Thermal analysis

Thermal analysis

# R(C) model





Partition into 3-D elements (diagram 2-D for simplicity)

- Thermal resistance  $\leftrightarrow$  Resistance
  - Heat flow  $\leftrightarrow$  Current
- For dynamic: Heat capacity  $\leftrightarrow$  Capacitance

#### Problem definition

$$Crac{dT(t)}{dt} = AT(t) + PU(t)$$

- A is the thermal conductivity matrix
- $\bullet$  Steady-state: Initial temperature and  ${\bf C}$  unnecessary
- Dynamic: Transient temperature analysis, must also consider heat capacity







#### Thermal analysis infrastructure overview



profile (and hybrid oct-tree)



# Steady-state thermal analysis

#### Basis: Multigrid analysis

Fast, multi-resolution relaxation method for matrix solving.

- Iterative solver (relaxation) on fine grid.
- Coarsen and propagate residual upward.
- Iterative solver for error at coarser level.
- Orrect fine-grained solution based on coarse-grained error.
- Iterative solver for error at fine level.
- Main challenge: Too slow for repeated use on large structures, especially 3-D chip-package modeling.
- Observation: Steepness of thermal gradients vary across IC.

#### Neighbor temperature difference histogram



Spatial adaptation can improve performance w.o. loss of accuracy.

## Hybrid oct-tree

- Reduce element count by merging when  $\Delta_{\mathcal{T}} < \epsilon$
- Conventional oct-tree inefficient for chip-package model
- Anisotropic thermal gradients
- We generalize to hybrid oct-tree
- Arbitrary partitioning on each axis



## Hybrid oct-tree

