# Nonvolatile Processors: Why is it Trending?

Fang Su<sup>1</sup>, Kaisheng Ma<sup>2</sup>, Xueqing Li<sup>2</sup>, Tongda Wu<sup>1</sup>, Yongpan Liu<sup>1</sup>, and Vijaykrishnan Narayanan<sup>2</sup>

<sup>1</sup>Tsinghua National Laboratory for Information Science and Technology (TNList)

<sup>2</sup>The Pennsylvania State University

Email: ypliu@tsinghua.edu.cn, vijay@cse.psu.edu

# **Invited** Paper

Abstract—Energy harvesting has become a promising solution to power up Internet-of-Things (IoT) devices. In this scenario, the constrained power budget and frequent absence of ambient energy cause severe reliability issues and performance degradation on conventional CMOS computing circuits. Fortunately, the advent of nonvolatile processor (NVP) opens the possibility to compute continuously using an intermittent power supply. It is considered as a key component of the next generation IoT edge devices. In this work, we provide insights to the evolution of the NVP and its application in real world scenarios. Efforts on improving the performance of NVP and future research prospects are also discussed in this paper.

*Index Terms*—Internet of Things (IoT), energy harvesting, nonvolatile processor (NVP)

## I. INTRODUCTION

The notion of interconnected world has already seen a significant transition since the advent of the internet. The next major transition is happening currently with almost every object that we as humans interact with joining this interconnected world creating a boom in Internet-of-Things (IoT) [1]. More than 200 billion devices are projected to become part of the IoT by the year 2020 [2]. While they offer unprecedented opportunities to monitor, analyze and control the physical world with which we interact, powering up all these devices is a critical barrier for IoT deployment [3].

Batteries that have been adopted by most of today's mobile devices is a potential candidate for this large market. However, batteries pose dimension, maintenance and pollution issues in IoT application spaces that would be much more ubiquitous than current day mobile systems [4]. Imagine the challenges that we already face in recycling batteries from our old mobile phones. Energy harvesting [5]–[9] has been widely investigated as a promising substitute for batteries. Energy scavenged from ambient environment such as vibrations, thermal differences, RF energy, solar power and delivered directly to the device. In such a scenario, computing devices have to operate sporadically rather than continuously due to the frequent absence of ambient energy [10].

Conventional CMOS circuits "forget everything" if power supply disappears [11], while retaining computational data in a sleep mode still incurs leakage power, resulting in waste of the precious harvested energy. Other remedies such as



Fig. 1. NVP: the heart of energy harvesting IoT platform [14]-[16].

checkpointing system state into a remote nonvolatile memory (NVM) [12] suffers from low speed and large energy penalty. Fortunately, nonvolatile processor (NVP) [13] simultaneously meets the requirements of zero leakage and nonvolatility. By incorporating emerging nonvolatile technologies, NVP maintains temporary states within embedded nonvolatile flipflops (NVFFs) during a power failure, and resumes back to computational tasks once power supply is recovered. It shows  $1000 \times$  higher backup and recovery speeds when compared with conventional processors. Such features make NVP the key component of a battery-less energy harvesting IoT system, as shown in Fig. 1.

In this paper, we review the history and discuss the research trends and future prospects of NVP. The objective of this paper is to help researchers understand what has been done and what still remains to be addressed in the field of NVP.

The remainder of the paper is organized as follows. In Section II, we first revisit how a processor evolved into an NVP. Emerging applications of NVP, as well as their features and characteristics, are discussed in Section III. Ongoing research efforts from different aspects are presented in Section IV. Finally, Section V discusses the future research prospects and Section VI concludes the paper.

## II. EVOLUTION OF NVP

The capability to maintain data and system states in the absence of input power is essential for an energy harvesting powered computing device. Due to the volatile nature of CMOS circuits, conventional processor [17] has no choice but to adopt an off-chip memory (such as Flash) for data backup, as shown in Fig. 2(a). Upon a power failure (resumption), the write (read) operations of the remote memory are slow and energy-consuming. In addition, the low endurance of Flash

This work was supported in part by NSF ASSIST ERC 1160483, DARPA/SRC LEAST Center, NSFC Grant 61674094, 61271269, Beijing Innovation Center for Future Chip and High-Tech Research and Development (863) Program under Contract 2013AA013201.



Fig. 2. Evolution of NVP.

memory ( $< 10^6$  write/erase cycles) precludes the processor from long-term autonomous service.

Advances in VLSI have made it feasible to integrate CMOS and emerging NVM [18] onto one die, as shown in Fig. 2(b). These NVM technologies, including ferroelectric random access memory (FRAM) [19], magnetic random access memory (MRAM) [20], phase-change memory (PCM) [21], resistive random access memory (RRAM) [22], offer a full range of benefits such as high density, low read/write energy, long endurance and 3-D integration compatibility. For example, the processor in [23] integrates an FRAM macro to copy system states in the event of a power failure. Nevertheless, this is not a fundamental solution since the bits held in flip-flops still have to be transferred into and read out from the centralized NVM in a sequential manner, resulting in significant energy and timing overheads.

The heavy burden of data movement severely limits the processor's efficiency, especially given a tight energy budget in energy harvesting scenarios. To address this issue, the first NVP [13] is invented and fabricated through a CMOS/ferroelectric hybrid process, achieving 3  $\mu$ s recovery time from power off. As Fig. 2(c) shows, an NVM element is attached to the standard flip-flop - the component which holds data in a processor - to form an NVFF and realize inplace data backup and restore. The main idea behind NVP is to replace the time-consuming and energy-inefficient byte-bybyte global data migration with a localized full-parallel bit-tobit transfer. Compared with conventional (volatile) processors, NVP offers  $10^3 \times$  higher backup/restore speed and  $10^4 \times$ energy savings [13].

Along this line of thought, NVPs have been widely investigated with regard to various NVM technologies. In [24] and [25], ferroelectric NVPs are implemented at lower area cost and higher on/off switching speed than [13]. Nanoseconds backup and restore time is achieved by integrating MRAM in NVP [26]. Recently, RRAM based NVP [27] is reported, providing higher energy efficiency by adopting an adaptive retention scheme.

Aside from the emerging nonvolatile memory technologies, the advent of emerging beyond-CMOS logic transistors, such as tunneling field-effect transistors (TFETs) and negative ca-



Fig. 3. (a) TFET  $I_{DS} - V_{GS}$  in comparison with CMOS [28]–[30]. (b) Forward progress improvement with TFET logic of MiBench testbenches powered with ambient RF energy [31].



Fig. 4. Nonvolatile computing with NCFET. (a) NCFET in a fin structure [35]. (b) NCFET device drain current versus gate driving voltage [33]. (c) NCFET logic with logic-memory synergy.

pacitance field-effect transistors (NCFETs), has also brought great opportunities towards a new paradigm of future lowpower nonvolatile computing. Those emerging devices, either showing enhanced Boolean logic operation with higher energy efficiency, or could actually be harnessed to redesign existing computing methods by introducing features beyond CMOS Boolean logic.

One example has already been shown in the region of NVP design in [31], where TFET transistors are used to replace CMOS transistors to operate at a lower voltage for higher computing energy efficiency, as well as for higher power conversion efficiency from ambient energy sources. It is shown in Fig. 3, that the adoption of TFET provides an average of  $2.7 \times$  computation forward progress improvement over the baseline NVP design using LP CMOS. Similar advantage is also possible for other steep-slope devices [28], [33], [34].

One more exciting technology for NVP is the idea of embedding nonvolatility into the computing logic with NCFETs (Fig. 2(d)), in a means that logic gates could also store their states in a nonvolatile fashion: the output will be restored when supply is back after a power outage. Such a feature is a co-design of device and circuit, and have great potential of

| TABLE I                                     |                     |
|---------------------------------------------|---------------------|
| PERFORMANCE COMPARISON OF REPORTED NVPS AND | VOLATILE PROCESSORS |

| Туре            | Volatile        | Processor  | NVP             |             |                 |                 |                  |             |
|-----------------|-----------------|------------|-----------------|-------------|-----------------|-----------------|------------------|-------------|
| Publication     | [17]            | [23]       | [13]            | [25]        | [24]            | [26]            | [27]             | [32]        |
| CMOS Technology | N/A             | N/A        | 130 nm          | 130 nm      | 130 nm          | 90 nm           | 65 nm            | 350 nm      |
| NVM             | Flash           | FRAM       | FRAM            | FRAM        | FRAM            | MRAM            | RRAM             | CAAC-OS     |
| Active Power    | 450 $\mu$ W/MHz | 200 µW/MHz | 160 $\mu$ W/MHz | N/A         | 170 $\mu$ W/MHz | 145 $\mu$ W/MHz | $33 \ \mu W/MHz$ | N/A         |
| Backup Energy   | 0.28 µJ/bit     | N/A        | 14.4 pJ/bit     | 3.44 pI/bit | 2.2 pJ/bit      | 6 pJ/bit        | N/A              | 3.65 pI/bit |
| Restore Energy  | 0.37 nJ/bit     | N/A        | 5 pJ/bit        | 5.44 pj/bh  | 0.66 pJ/bit     | 0.3 pJ/bit      | 15 fJ/bit        | 3.05 pi/on  |
| Backup Time     | 6 ms            | 212 µs     | 7 μs            | 2.2 µs      | 320 ns          | 4 ns            | 4 us             | 100 ns      |
| Restore Time    | 3 ms            | 310 µs     | 3 µs            | 2 µs        | 384 ns          | 120 ns          | 20 ns            | 167 ns      |

further optimization at the architecture level. Fig. 4(a) shows an NCFET device structure. Fig. 4(b) shows a typical NCFET I-V curve with hysteresis around zero gate-source voltage. Fig. 4(c) shows the concept of a nonvolatile logic gate (an inverter here), in which the inverter is powered by GND and VDD. When the input D ranges from  $V_{LOW}$  to  $V_{HIGH}$ , i.e. the gate-source voltage at the rising and falling edges of the NCFET I-V hysteresis, respectively, the output Q will not be changed, just like a memory. Q will only be updated when the input D is beyond the range of  $[V_{LOW}, V_{HIGH}]$ . Such a feature enables a dynamic operation of logic-memory synergy. More importantly, this memory is nonvolatile and will be capable of restoring the output when the power supply recovers after a power outage. Meanwhile, as is shown, it is different from conventional DFF-based designs in that it does not need a clock signal. Further exploration of NCFET nonvolatile computing would be very promising.

Table I lists the performance of silicon-verified NVPs as well as volatile processors. Here we highlight the active power, backup/restore time and energy as the most important performance indicators.

# III. APPLICATIONS OF NVP

# A. Why NVP: A Case Study

Before we dive into the discussion about applications, one question should be clarified – why NVP outperforms volatile processor? The following case study will answer the question and reveal the scientific reason behind NVP.

Suppose a body-heat powered smart patch wants to measure the exposure to ultraviolet (UV) radiation of human skin. Since the power income is extremely low (20  $\mu$ W/cm<sup>2</sup>), the patch runs sporadically, not continuously, with the help of a capacitor  $C_{bulk}$  as energy buffer (Fig. 5(a)). The system stays OFF until  $C_{bulk}$  reserves enough energy for a sensing operation and a successful restore-backup pair.

For comparison, a volatile processor [17] and an NVP [13] are adopted, individually, in such system, and their parameters are listed in Fig. 5(b). Since NVP exhibits much lower backup/restore energy than volatile processor, it requires a much smaller  $C_{bulk}$  (106 nF vs. 47  $\mu$ F) and therefore a shorter charging time (52 ms vs. 22.3 s) for a single measurement, as shown in Fig. 2(c). As a consequence, NVP provides  $400 \times$  data throughput boost than its counterpart. In other



(b) Processor Volatile [17] NVP [13] Body-heat Energy Source  $20 \,\mu\text{W/cm}^2 \times 1 \,\text{cm}^2$ UV Sensing 1.0 µJ 445 µJ Data Backup 23.1 nJ 0.6 µJ Data Restore 8 1 nJ 46 µF 106 nF  $C_{bulk}$ Charging Time 22.3 s 52 ms 0.045 sample/s Measure Rate 18.9 sample/s



Fig. 5. Case study: a body-heat powered UV smart patch.

| Application                                                               | SHM                           | Surveillance           | Healthcare                |
|---------------------------------------------------------------------------|-------------------------------|------------------------|---------------------------|
| Energy source                                                             | solar/wind                    | RF                     | Thermal/Piezo             |
| Sensors                                                                   | temp./strain/<br>acceleration | image/sound/<br>motion | temp./ECG/<br>pressure/UV |
| Duty cycle                                                                | 10s - 100s                    | 1s                     | 1ms - 10ms                |
| Communication                                                             | Cellular                      | Wi-Fi                  | RFID                      |
| Computation complexity                                                    | *                             | ***                    | **                        |
| Energy<br>distribution<br>backup/recovery<br>computation<br>I/O operation | 21% E<br>25% 54%              | (B) 37%<br>56%         | (S)<br>38%<br>29%         |

Fig. 6. Application domains of NVP.

words, NVP converts the precious harvested energy into more valuable information, and that is the fundamental reason why we build NVP.

# B. Application Domains

Self-powered IoT applications, which tend to be hard to reach [36], or limited in size [37], or require autonomous longterm service [38], can take full advantage of NVP. Also, the fast power-on response makes NVP useful in timing-critical systems such as video surveillance [39]. It's noteworthy that



Fig. 7. Self-write-termination NVFF [27].

NVP is not restricted to low-power applications. The literallyzero standby power and fast on/off switching capability are also promising for fine-grained power management in high performance computers.

Fig. 6 presents the features and characteristics of three selected application domains of NVP: 1) structural health monitoring (SHM); 2) building security and 3) personal healthcare. One interesting observation is that the energy breakdown varies a lot among applications due to different task pattern and computation complexity. Therefore, energy consumption on 1) data backup/restore; 2) computation and 3) sensing and transceiving (or I/O operations) should be simultaneously optimized. In addition, the energy harvester and power management circuits should also be carefully designed to increase the amount of energy scavenged from ambient environment and minimize conversion loss. Section IV will present the ongoing efforts to improve NVP from these aspects.

## **IV. ONGOING EFFORTS**

## A. Data Backup/Restore

Efforts on reducing backup/restore overheads of NVP can be categorized into three ways.

1) Minimize the frequency of backup/restore operations: A time-domain adaptive NVP is proposed in [27], wherein data retention is preferred rather than backup/restore to survive "short" power interruptions. The failure times can also be reduced by scaling the supply voltage or operating frequency [40], [41] in accordance with input power, which will be elaborated in the next subsection.

2) Reduce the amount of data to be stored: Techniques exploiting data pattern have been reported to eliminate redundant backup/restore of unused bits [27], [42]. Compression based approaches [43], [44] also helps to save the read/write energy.

*3) Optimize bit-level backup/restore cost:* Per-bit backup and restore cost can be reduced with advanced nonvolatile materials and more efficient read/write circuits [45]–[49]. Among those approaches, self-write-termination (SWT) [27] is a promising one to save write energy as well as prolong the lifetime of RRAM based NVFFs.

The motivation for SWT is to handle the large switching time variation of RRAM devices and to eliminate unnecessary store operations (e.g. the original state of RRAM matches the data to be stored). As Fig. 7(a) shows, the fast and slow devices show more than  $100 \times$  difference in switching time.

| Energy<br>Harvester | ╞ | Bottleneck<br>Resource<br>Predictor | $\Rightarrow$ | Proper<br>Frequency<br>Predictor |  | NVP |
|---------------------|---|-------------------------------------|---------------|----------------------------------|--|-----|
|---------------------|---|-------------------------------------|---------------|----------------------------------|--|-----|

Fig. 8. Resource/frequency adjustable NVP.

A fixed SET/RESET pulse will induce 1) energy waste and 2) degraded lifetime (Fig. 7(b)). The SWT scheme pre-senses the device state through a feedback path and terminates the SET/RESET process if the RRAM device is already in the target state. Fig. 7(c) shows the schematic of a SWT-NVFF, which achieves up to  $172 \times$  reduction in write energy.

# B. Computation

We define forward progress [15] as instructions committed within NVP, which roughly equals to the energy used for computation divided by energy per instruction (EPI). In order to transform as much harvested energy as possible, we deploy dynamic computation in a NVP system as shown in Fig. 8. It consists of a bottleneck resource predictor, a proper frequency predictor, and a modified NVP with support to adjustable resources and frequency. In a traditional NVP [34], [50], [51], we have two observations. On one side, dissipating energy slowly may cause the storage capacitor to have a large potential to be full, reducing the space for future energy boost. On the other side, consuming the energy too fast results in frequent energy emergency and eventually frequent backup operations.

Frequency scaling is one of the possible alternatives to aggrandize the partition of energy transformed into computational energy by dynamically adjusting the energy dissipating speed. The frequency maintains at a very low level to make the NVP run over most of the power outages. While the frequency boosts significantly to convert more energy boost into forward progress when abundant energy is monitored. The concern may lay in the policy for proper frequency proposal to change along with the harvested power and stored energy.

Another possible solution for better forward progress is to reduce EPI by powering up some essential resources so as to boost the instruction per cycle in an Out-of-order processor. We observe that different testbenches has different sensitivity to certain resources. And providing more resources may significantly boost the instruction per cycle in an Out-of-order processor. How to identify the key resources for the specific testbench and when to power on these resources remain an interesting topic. The potential to merge the frequency scaling and resource allocation is challenging but remains possible.

# C. I/O Operations

I/O operations refer to the interaction between NVP and peripheral devices, which consist of three phases: 1) I/O bus initialization; 2) peripheral configuration and 3) data exchange. These processes have to be repeated if attacked by a power failure. Various approaches have been employed to maintain or restore I/O bus status [12], [52]–[54], but the recovery of the latter two still remains to be addressed.



Fig. 10. Maximum energy efficiency tracking for converter-less systems [58].

To recover the entire I/O operation at minimum cost, we design a strategy wherein checkpoints are set before each data exchange. If a power failure has occurred, the NVP will first reinitialize I/O interfaces and peripheral devices, and then roll back to the checkpoint to retransmit the data. Intuitively, a longer data packet indicates longer rollback time and therefore more energy penalty. The analysis in Fig. 9 shows that the overhead caused by I/O interruption exhibits a superlinear growth with respect to data packet length. Even worse, the system may fall into deadlock if the packet is too long, because one I/O operation cannot be completed within a single power-on period and will be infinitely repeated. Therefore, splitting a long data packet into small pieces will help to improve the system-level efficiency of NVP.

## D. Energy Harvesters and Power Management Circuits

This topic involves 1) boosting the efficiency for converting ambient energy into electricity and 2) minimizing the energy loss due to mismatch between supply-side and loadside. The former requires more advanced energy harvesting technique [55], [56], and the latter seeks for more efficient power converters [29], [30], [57].

Recently, converter-less supply systems have been reported [57] which achieve 20% energy efficiency improvement. However, the removal of power converter may actually harm system-level efficiency, because the maximum efficiency point of electronic load (NVP) lies away from the maximum power point (MPP) of the energy harvester, as shown in Fig. 10(a). Obviously, it is the overall efficiency that should be optimized, rather than the power extracted from harvester. To mitigate this problem, an adaptive voltage-controlled clock frequency controller (CFC) is proposed in [58]. As Fig. 10(b) shows, when sufficient energy is available and the supply voltage rises, the CFC increases the clock frequency of NVP. Consequently, a higher current is drawn from the harvester

until the system reaches equilibrium. 162% overall efficiency enhancement is observed at negligible area penalty.

# V. FUTURE WORKS

Continuous performance improvement on NVP can be expected with advances in beyond-CMOS devices and NVM technologies. Meanwhile, the energy consumption on sensors and analog processing circuits – as the bridge connecting physical and digital world – will become the bottleneck and necessitate cautious redesign towards the IoT era. High-level optimization like architectural exploration, software support and system-on-chip integration are also necessary to take full advantage of NVP. Last but not least, new computing paradigms enabled by emerging devices and NVMs, such as non-boolean logic [59], [60], in-memory processing [61] and neuro-inspired computing [62], may bring about a fundamental change on how NVP works and thus require a device-circuit-application co-design.

## VI. CONCLUSION

NVP is a promising technology for energy harvesting powered IoT edge devices, as it enables continuous and reliable operation in spite of the unstable and intermittent power supply. This paper provides an overview of the research in NVP and reveals the fundamental road map from conventional processor to NVP. In this paper, we also discuss emerging application domains of NVP, and present ongoing efforts to enhance its performance. We believe this paper will help researchers better understand this new born area, and motivate further researches towards optimization of NVPs and their applications.

#### REFERENCES

- L. Atzori *et al.*, "The Internet of Things: A survey," *Computer Networks*, vol. 54, no. 15, pp. 2787–2805, 2010.
- [2] Samsung. (2014) IFA. [Online]. Available: http://news.samsung.com
- [3] J. Gubbi *et al.*, "Internet of Things (IoT): A vision, architectural elements, and future directions," *Future Generation Computer Systems*, vol. 29, no. 7, pp. 1645–1660, 2013.
- [4] J. A. Paradiso and T. Starner, "Energy scavenging for mobile and wireless electronics," *IEEE Pervasive Computing*, vol. 4, no. 1, pp. 18– 27, 2005.
- [5] A. Harb, "Energy harvesting: State-of-the-art," *Renewable Energy*, vol. 36, no. 10, pp. 2641–2654, 2011.
- [6] V. Raghunathan et al., "Design considerations for solar energy harvesting wireless embedded systems," in *Proceedings of the International Sym*posium on Information Processing in Sensor Networks. IEEE, 2005, p. 64.
- [7] S. Beeby *et al.*, "Energy harvesting vibration sources for microsystems applications," *Measurement Science and Technology*, vol. 17, no. 12, p. R175, 2006.
- [8] X. Lu et al., "Thermal energy harvesting for WSNs," in Proceedings of the International Conference on Systems Man and Cybernetics (SMC). IEEE, 2010, pp. 3045–3052.
- [9] V. Talla *et al.*, "Powering the next billion devices with Wi-Fi," *arXiv:1505.06815*, 2015.
- [10] W. K. Seah et al., "Wireless sensor networks powered by ambient energy harvesting (WSN-HEAP)-survey and challenges," in *Proceedings of the Wireless VITAE*. IEEE, 2009, pp. 1–5.
- [11] F. Su et al., "A ferroelectric nonvolatile processor with 46 μs systemlevel wake-up time and 14 μs sleep time for energy harvesting applications," *IEEE Transactions on Circuits and Systems I: Regular Papers*, 2016.
- [12] H. Jayakumar et al., "Quickrecall: A low overhead HW/SW approach for enabling computations across power cycles in transiently powered computers," in Proceedings of the International Conference on VLSI Design and International Conference on Embedded Systems (VLSID). IEEE, 2014, pp. 330–335.

- [13] Y. Wang et al., "A 3us wake-up time nonvolatile processor based on ferroelectric flip-flops," in Proceedings of the European Solid-State Circuit Conference (ESSCIRC). IEEE, 2012, pp. 149-152
- [14] Y. Liu et al., "Ambient energy harvesting nonvolatile processors: from circuit to system," in Proceedings of the 51st Annual Design Automation Conference (DAC). ACM, 2015, p. 150.
- [15] K. Ma et al., "Architecture exploration for ambient energy harvesting nonvolatile processors," in Proceedings of the International Symposium on High Performance Computer Architecture (HPCA). IEEE, 2015, pp. 526-537.
- [16] F. Su et al., "Design of nonvolatile processors and applications," in Proceedings of the International Conference on Very Large Scale Integration (VLSI-SoC). IEEE, 2016, pp. 1–6.
- [17] Texas Instruments. (2015) Datasheet of MSP430F522X, MSP430F521X mixed-signal microcontrollers. [Online]. Available: http://www.ti.com
- [18] S. Yu and P.-Y. Chen, "Emerging memory technologies: Recent trends and prospects," IEEE Solid-State Circuits Magazine, vol. 8, no. 2, pp. 43-56, 2016.
- [19] J. F. Scott, Ferroelectric memories. Springer Science & Business Media, 2013, vol. 3.
- [20] J.-G. Zhu, "Magnetoresistive random access memory: the path to competitiveness and scalability," Proceedings of the IEEE, vol. 96, no. 11, pp. 1786–1798, 2008.
- [21] H.-S. P. Wong *et al.*, "Phase change memory," *Proceedings of the IEEE*, vol. 98, no. 12, pp. 2201–2227, 2010.
- [22] "Metal-oxide RRAM," Proceedings of the IEEE, vol. 100, no. 6, pp. 1951–1970, 2012.
- [23] Texas Instruments. (2014) Datasheet of MSP430FR573X mixed-signal microcontrollers. [Online]. Available: http://www.ti.com
- [24] S. C. Bartling et al., "An 8MHz 75µA/MHz zero-leakage non-volatile logic-based Cortex-M0 MCU SoC exhibiting 100% digital state retention at VDD= 0V with; 400ns wakeup and sleep transitions," in International Solid-State Circuits Conference Digest of Technical Papers (ISSCC). IEEE, 2013, pp. 432–433. [25] M. Qazi *et al.*, "A 3.4-pJ FeRAM-enabled D flip-flop in 0.13-CMOS for
- nonvolatile processing in digital systems," *IEEE Journal of Solid-State Circuits*, vol. 49, no. 1, pp. 202–211, 2014.
- [26] N. Sakimura et al., "A 90nm 20MHz fully nonvolatile microcontroller for standby-power-critical applications," in International Solid-State Circuits Conference Digest of Technical Papers (ISSCC). IEEE, 2014, pp. 184–185.
- [27] Y. Liu et al., "A 65nm ReRAM-enabled nonvolatile processor with 6x reduction in restore time and 4x higher clock frequency using adaptive data retention and self-write-termination nonvolatile logic," in International Solid-State Circuits Conference Digest of Technical Papers (ISSCC). IEEE, 2016, pp. 84-86.
- X. Li et al., "RF-powered systems using steep-slope devices," in New Circuits and Systems Conference (NEWCAS), 2014, pp. 73–76. [28]
- [29] U. Heo *et al.*, "A high-efficiency switched-capacitance HTFET charge pump for low-input-voltage applications," in *Proceedings of the Inter-national Conference on VLSI Design*. IEEE, 2015, pp. 304–309.
- [30] H. Liu et al., "Tunnel FET RF rectifier design for energy harvesting applications," *IEEE Journal on Emerging and Selected Topics in Circuits and Systems*, vol. 4, no. 4, pp. 400–411, 2014.
- [31] K. Ma et al., "Nonvolatile processor optimization for ambient energy harvesting scenarios," in Non-volatile Memory Technology Symposium (NVMTS), 2015.
- [32] A. Isobe et al., "A 32-bit CPU with zero standby power and 1.5-clock sleep/2.5-clock wake-up achieved by utilizing a 180-nm c-axis aligned crystalline In-Ga-Zn oxide transistor," in Symposium on VLSI Circuits Digest of Technical Papers. IEEE, 2014, pp. 1–2.
- [33] S. George et al., "Device circuit co design of fefet based logic for low voltage processors," in VLSI (ISVLSI), 2016 IEEE Computer Society Annual Symposium on. IEEE, 2016, pp. 649-654.
- [34] K. Ma et al., "Nonvolatile processor architectures: Efficient, reliable progress with unstable power," *IEEE Micro*, vol. 36, no. 3, pp. 72–83, 2016.
- [35] K.-S. Li et al., "Sub-60mv-swing negative-capacitance FinFET without hysteresis," in International Electron Devices Meeting (IEDM). IEEE, 2015, pp. 22–6.
- [36] G. Park et al., "Energy harvesting for structural health monitoring sensor networks," Journal of Infrastructure Systems, vol. 14, no. 1, pp. 64-79, 2008
- [37] P. D. Mitcheson, "Energy harvesting for human wearable and im-plantable bio-sensors," in *Proceedings of the Annual International* Conference of the IEEE Engineering in Medicine and Biology. IEEE, 2010, pp. 3432-3436.
- [38] M. T. Lazarescu, "Design of a WSN platform for long-term environ-mental monitoring for IoT applications," *IEEE Journal on Emerging* and Selected Topics in Circuits and Systems, vol. 3, no. 1, pp. 45-54, 2013.

- [39] S. Naderiparizi et al., "Wispcam: A battery-free RFID camera," in Proceedings of the International Conference on RFID (RFID). IEEE. 2015, pp. 166-173.
- [40] F. Zhang et al., "A batteryless 19μw MICS/ISM-band energy harvesting body area sensor node SoC," in *International Solid-State Circuits Conference Digest of Technical Papers (ISSCC)*. IEEE, 2012, pp. 298– 300
- [41] D. Balsamo *et al.*, "Graceful performance modulation for power-neutral transient computing systems," *IEEE Transactions on Computer-Aided* Design of Integrated Circuits and Systems, vol. 35, no. 5, pp. 738-749, 2016.
- [42] H. Li et al., "An energy efficient backup scheme with low inrush current for nonvolatile SRAM in energy harvesting sensor nodes," in Proceedings of the Design, Automation & Test in Europe Conference &
- *Exhibition* (*DATE*). IEEE, 2015, pp. 7–12. X. Sheng *et al.*, "SPaC: A segment-based parallel compression for backup acceleration in nonvolatile processors," in *Proceedings of the* [43] Design, Automation & Test in Europe Conference & Exhibition (DATE).
- IEEE, 2013, pp. 865–868.
  [44] Y. Wang *et al.*, "PaCC: A parallel compare and compress codec for area reduction in nonvolatile processors," *IEEE Transactions on Very Large*
- [45] W. Zhao *et al.*, "High speed, high stability and low power sensing amplifier for MTJ/CMOS hybrid logic circuits," *IEEE Transactions on* Magnetics, vol. 45, no. 10, pp. 3784–3787, 2009. [46] H. Koike *et al.*, "A power-gated MPU with 3-microsecond entry/exit
- delay using MTJ-based nonvolatile flip-flop," in *Proceedings of the Asian* Solid-State Circuits Conference (A-SSCC). IEEE, 2013, pp. 317–320.
- D. Chabi *et al.*, "Ultra low power magnetic flip-flop based on check-pointing/power gating and self-enable mechanisms," *IEEE Transactions* [47] on Circuits and Systems I: Regular Papers, vol. 61, no. 6, pp. 1755-1765, 2014.
- [48] M.-F. Chang et al., "Embedded 1Mb ReRAM in 28nm CMOS with 0.27to-1V read using swing-sample-and-couple sense amplifier and self-boost-write-termination scheme," in *International Solid-State Circuits Conference Digest of Technical Papers (ISSCC)*. IEEE, 2014, pp. 332– 333
- S. George *et al.*, "NCFET based logic for energy harvesting systems," *SRC TECHCON*, 2015. [49]
- K. Ma et al., "Nonvolatile processor architecture exploration for energy-harvesting applications," *IEEE Micro*, vol. 35, no. 5, pp. 32–40, 2015. [50]
- X. Li et al., "Enabling Internet-of-Things: Opportunities brought by emerging devices, circuits, and architectures," in *Proceedings of the* [51] International Conference on Very Large Scale Integration (VLSI-SoC). IEEE, 2016, pp. 1–6. Z. Li et al., "HW/SW co-design of nonvolatile IO system in energy
- [52] harvesting sensor nodes for optimal data acquisition," in Proceedings of the 52nd Annual Design Automation Conference (DAC). ACM, 2016, p 154
- [53] H. Jayakumar et al., "Energy-aware memory mapping for hybrid FRAM-SRAM MCUs in IoT edge devices," in *Proceedings of the International* Conference on VLSI Design and International Conference on Embedded Systems (VLSID). IEEE, 2016, pp. 264-269.
- —, "Sleep-mode voltage scaling: Enabling SRAM data retention at ultra-low power in embedded microcontrollers," ACM Transactions on [54]
- [55]
- B. B. Stations of the infection interconnection of the stations of the theorem of the station of the theorem of the station of the theorem of theorem of the theorem of the theorem of the theorem of the theor [56]
- (IGSC). IEEE, 2015, pp. 1–2. X. Sheng *et al.*, "A high-efficiency dual-channel photovoltaic power system for nonvolatile sensor nodes," in *Proceedings of the Non-Volatile Memory Systems and Applications Symposium (NVMSA)*. IEEE, 2014, [57] pp. 1–2.
- Y. Sun et al., "Maximum energy efficiency tracking circuits for converter-less energy harvesting sensor nodes," *IEEE Transactions on* [58] Circuits and Systems II: Express Briefs, 2016.
- L. Liu et al., "A reconfigurable low-power BDD logic architecture using [59]
- L. LIU et al., "A reconfigurable low-power BDD logic architecture using ferroelectric single-electron transistors," *IEEE Transactions on Electron Devices*, vol. 62, no. 3, pp. 1052–1057, 2015. W.-Y. Tsai et al., "Enabling new computation paradigms with HyperFET-an emerging device," *IEEE Transactions on Multi-Scale Computing Systems*, vol. 2, no. 1, pp. 30–48, 2016. J. Borghetti et al., "memristive switches enable stateful logic operations via metrical implication". [60]
- [61] via material implication," *Nature*, vol. 464, no. 7290, pp. 873–876, 2010. [62] S. H. Jo *et al.*, "Nanoscale memristor device as synapse in neuromorphic
- systems," Nano letters, vol. 10, no. 4, pp. 1297-1301, 2010.