NBTI Mitigation Method by Inputting Random Scan-In Vectors in Standby Time

Hiroaki KONOURA††(a), Toshihiro KAMEDA††(b), Student Members, Yukio MITSUYAMA†††(b), Masanori HASHIMOTO††(c), and Takao ONOYE††(d), Members

SUMMARY Negative Bias Temperature Instability (NBTI) is one of the serious concerns for long-term circuit performance degradation. NBTI degrades PMOS transistors under negative bias, whereas they recover once negative bias is removed. In this paper, we propose a mitigation method for NBTI-induced performance degradation that exploits the recovery property by shifting random input sequence through scan paths. With this method, we prevent consecutive stress that causes large degradation. Experimental results reveal that random scan-in vectors successfully mitigate NBTI and the path delay degradation is reduced by 71% in a test case when standby mode occupies 10% of total time. We also confirmed that 8-bit LFSR is capable of random number generation for this purpose with low area and power overhead.

key words: NBTI, NBTI mitigation, performance degradation, scan path, aging, reliability

1. Introduction

In nanoscale integrated circuit design, performance degradation due to aging effects is becoming a major concern. Conventionally, to cope with the performance degradation due to aging, a design margin is given so that even an aged circuit will satisfy the performance requirement. However, as the necessary margin increases with technology scaling, the performance loss due to the margin is becoming less acceptable. Therefore, mitigating aging effects is highly demanded.

Among various aging effects, negative bias temperature instability (NBTI) is nowadays one of the most influential effects. NBTI causes a gradual increase in |ΔVth| while negative bias is applied to PMOS, i.e., PMOS is ON. Here, this condition is defined as stress phase of NBTI. The |ΔVth| increase reduces drain current and consequently lengthens path delay, which leads to timing failures. On the other hand, while PMOS is OFF, |ΔVth| gradually decreases to its initial value before the stress enforcement. This condition is defined as recovery phase of NBTI. The PMOS degradation by NBTI gradually progresses through circuit operations that repeat the stress and recovery phases. Besides, reference [1], [2] pointed out that the ratio of stress and recovery phases, which is hereafter called stress probability, is a key parameter that determines the amount of degradation in a long-term perspective. As the stress probability gets close to 1, the degradation increases and approaches to the worst case that DC stress is given. This NBTI property of recovery can be exploited for NBTI mitigation at circuit level by controlling the stress probability.

A possible way to control the stress probability is input vector control (IVC) [3]–[5]. IVC is well known as a leakage reduction technique that exploits the dependency of leakage current on the input vector [6]. For NBTI mitigation, a pre-determined input vector is given to a combinational circuit to minimize performance degradation during standby mode. In [3], Y. Wang et al. evaluated IVC technique for NBTI mitigation taking into account the temperature that affects NBTI-induced performance degradation. The same authors also derived Pareto sets that simultaneously optimize NBTI mitigation and leakage currents in [4]. D. R. Bild et al. introduced internal node control (INC) to IVC in order to improve the controllability of internal nodes [5]. INC adds some devices to set the internal node values to 0 or 1, which involves an increase both in area and path delay.

However, only using a single input vector, a considerable number of PMOSs are in stress phase and their performance degrades in standby mode, because CMOS gates are basically inverting logic and all the logic values cannot be 1 simultaneously. This problem becomes significant as the portion of standby mode increases, because the degradation during the standby mode becomes comparable, or rather larger than that during active mode. To overcome this problem, S. Jin et al. presented a multiple input vector approach [7]. They derived a set of input vectors and allocated standby time to each of them to maximize the NBTI mitigation effect. On the other hand, the number of input vectors is thought to increase as the target circuit becomes larger, which means larger memory is necessary to store input vectors.

We then propose a random input vector approach that shifts random values through scan paths. This approach gives different input vectors for each scan-shift operation, which means the proposed random input vector approach can be regarded as one of multiple input vector approaches, though it does not prepare input vectors beforehand. In addi-
tion, the proposed approach does not need memory to store input vectors. First, we experimentally show the difference in performance degradation with different scan-in vectors, and reveal that random scan-in vectors attain a good mitigation effect. We then evaluate how the quality of randomness affects the mitigation effect with various linear feedback shift register (LFSR) configurations.

It should be noted that another circuit level approach is power gating for NBTI mitigation [8]. As for circuits that implement power gating for leakage current reduction during standby mode, NBTI can be mitigated as a byproduct. On the other hand, it is difficult to apply power gating to high-performance design whose permissible speed degradation is highly limited, because the sleep transistors degrade the performance and increase uncertainty in performance. In addition, aging of sleep transistors could be a serious problem in some cases. The proposed NBTI mitigation is applicable to even such high-performance designs.

The rest of this paper is organized as follows. Section 2 refers the NBTI model used for our evaluation. Section 3 introduces the concept of NBTI mitigation by giving scan-in vectors. In Sect. 4, we evaluate the effect of NBTI mitigation assuming various operations of target circuits, and also clarify that giving random scan-in vectors generated in linear feedback shift register (LFSR) is effective to mitigate NBTI. Finally, we conclude this paper in Sect. 5.

2. NBTI Model

To characterize NBTI, a number of measurement results and closed-form models have been proposed [1], [9], [10]. Among them, long-term degradation and its dependency on stress probability are well modeled in [1]. We therefore assume the model below in this work, though other models can be used as long as the dependency on stress probability is well characterized,

$$|\Delta V_{th}(t)| = \left( \frac{\sqrt{K_e^2 \cdot T_{clk} \cdot \alpha}}{1 - \beta(t)^{1/2n}} \right)^{2n},$$

where $K_e$ is a parameter dependent on supply voltage and temperature. $T_{clk}$ is a clock period, and $\alpha$ is the stress probability of PMOS. $T_{clk} \cdot \alpha$ represents a period of stress time. $\beta(t)$ is a function of time $t$ that has a dependence on temperature and $T_{clk}$, and it includes a term of $(1 - \alpha)$. Both $\alpha$ and $\beta(t)$ are less than 1. Moreover, $n$ is equal to 1/6 in a hydrogen molecule diffusion based model [11].

Reference [12] reported that $|\Delta V_{th}|$ is hardly dependent on $T_{clk}$ in case of $t \gg 1,000 \text{ s}$. In this case, Eq. (2) is more useful for $|\Delta V_{th}|$ estimation instead of Eq. (1) [12].

$$|\Delta V_{th}| \approx \left( \frac{0.001n^2 K_e^2 \alpha C_v}{0.81 \sigma_e^2 (1 - \alpha)} \right)^{n}.$$  

Here, in Eq. (2), when $\alpha$ approaches to 1, $|\Delta V_{th}(t)|$ reaches an infinite value and is not appropriate. In such a situation, as its upper limit, we use Eq. (3) which models only stress phase of NBTI [13].

![Fig. 1 Threshold voltage degradation vs. stress probability (10 years later).](image)

$$|\Delta V_{th}(t)| = \left( K_e^2 t \right)^n$$

Figure 1 exemplifies the amount of threshold voltage degradation derived by Eqs. (2) and (3) after a lapse of ten years in the condition that supply voltage is 1.1 V and temperature is 125°C. The initial threshold voltage is $-180 \text{ mV}$. Here, other parameters in the NBTI model except the initial threshold voltage were referred to [1], where [1] extracted and reported the parameters for transistors in a 65 nm process. In this graph, the degradation is quite large when the stress probability is near 100%. This result suggests that for PMOSs with near 100% stress probability, even a small reduction in the stress probability is helpful to mitigate $|\Delta V_{th}|$ increase. The proposed method aims to decrease stress probability of such PMOSs.

3. NBTI Mitigation by Giving Scan-in Vectors

For mitigating NBTI, it is effective to make PMOSs in consecutive stress experience recovery phase. For this purpose, we give a set of input vectors to combinational circuits through scan paths and switch stressed PMOSs. Note that this work assumes that the target circuit has two operation modes; active and standby modes. In standby mode, FF values are flushed and can be overwritten through scan paths. In some cases, an operation to store and restore FF values might be necessary before and after inputting vectors for NBTI mitigation. By using a set of input vectors, the multiple input vector approach solves the problem of single IVC that some of PMOSs are fixed in stress phase during standby mode. Figure 2 illustrates a conceptual diagram of the NBTI mitigation. To give a set of input vectors to combinational circuits, we use a scan path which is often embedded for design for testability (DFT). In active mode, there are PMOSs which hardly transit in recovery phase, such as a PMOS in the upper NAND gate in Fig. 2(a). The multiple input vector approach gives a chance to such PMOSs to transit in recovery phase during standby mode (Fig. 2(b)).

To apply the multiple input vector approach to a circuit, we need to determine what set of input vectors should be used as scan-in vectors. A possible way is to gather the input vectors derived by IVC techniques, which is proposed in [7]. On the other hand, because an additional memory that stores input vectors is necessary, the number of input
vectors should be small, while targeted PMOSs should have a chance to be in recovery phase. In addition, during scan-in and scan-out operations, the input vectors given to combinational circuits are different from those derived by the technique of [7]. This means that even if a good set of input vectors are available, they cannot be efficiently enforced to the combinational circuit in time when the scan paths are long and/or the standby time is short, because it takes a long time to change the input pattern.

Another approach is to give random scan-in vectors supposing that almost all PMOSs have a chance to be in recovery phase, and this is the approach proposed in this paper. In this approach, every set of input values given to a combinational circuit, which is generated each time the scan path is one-bit shifted with a random scan-in value, is regarded as an input vector in multiple input vector approach. An advantage of this approach is that an additional memory to store the input patterns is unnecessary and the number of input vectors can be easily increased by lengthening the cycle of random numbers. As weak points, this cannot guarantee that targeted PMOSs have a chance of recovery phase and additional mechanism to generate random patterns is necessary.

In the following, we focus on the approach of random scan-in vectors, and experimentally evaluate how much mitigation of performance degradation can be obtained through a case study.

4. Evaluation of NBTI Mitigation

In this section, we show a case study of NBTI mitigation and reveal that inputting random values in scan path is sufficiently effective. We also discuss the generation of random number and its cost of area and power.

4.1 Experimental Setup

As target circuits for evaluation, we selected two circuit blocks in MIPS R3000 processor: 32-bit MDU (Multiply Divide Unit) which operates multiplication and division, and 32-bit ALU (Arithmetic Logic Unit) which operates addition, subtraction and logical operations. In addition, we also target a circuit of discrete cosine transform (DCT) for image processing in MPEG-4 video standard.

These circuits were synthesized with Synopsys DFT Compiler using an industrial 65 nm standard cell library. 32-bit MDU, 32-bit ALU and DCT have 198, 68 and 677 FFs, for which the Synopsys DFT Compiler construct 20, 7 and 5 scan paths with an ordinary option, respectively. Their circuit sizes are 4,670, 3,149 and 11,591 NAND2-equivalent gates, and their increase rates from the non-scan circuits are 5.8%, 3.3% and 9.5%, respectively. In this work, we focus on long-term degradation rather than short-term one, and hence operating frequency $T_{clk}$ both for active mode operation and scan-in operation is not explicitly considered.

Besides, the performance degradation depends on circuit operations during active mode because stress probabilities of PMOS transistors vary depending on input vectors. For taking into account different situations of circuit operation, the following five situations are considered for 32-bit MDU.

**General:** Both operations and operands are randomly selected and executed.

**32-bit_mult:** Operands are randomly selected. Only multiplication is executed.

**32-bit_div:** Operands are randomly selected. Only division is executed.

**16-bit_mult:** Higher 16-bit of operands are fixed to 0. Only multiplication is executed.

**16-bit_div:** Higher 16-bit of operands are fixed to 0. Only division is executed.
division is executed.

Operations for 32-bit-ALU are listed below.

**General:** Both operations and operands are randomly selected and executed.

**16-bit add:** Higher 16-bit of operands are fixed to 0. Only addition is executed.

**Const add:** One operand is fixed to 00000001_{16}, and the other is randomly selected. Only addition is executed.

For DCT, we change input images as follows.

**Black:** Pixel data of inputs is always 0.

**White:** Pixel data of inputs is always 11111111_{2}.

**Noise:** Pixel data of inputs is randomly selected.

**Bus:** A photo image (‘Bus’) is given.

We gave four types of scan-in vectors (“all 1”, “all 0”, “0101” and “random”) during standby mode. “all 0” applies 0’s to scan inputs, and “all 1” applies 1’s. In these two inputs, which are special cases of single-IVC, DC stress is given during standby mode. On the other hand, “0101” and “random” change input vectors to combinational circuits during standby mode. “0101” injects 1 and 0 alternately to scan inputs, in other words, the sequence stored in scan-FFs alternates “010101...” and “101010...” after the scan inputs reach the end of scan path. Moreover, “random” inputs 1 and 0 randomly, and hence the sequence stored in scan-FFs changes variously for every scan-shift. In any cases, primary inputs are supposed to remain the last input values of the previous active mode. Besides, this study does not assume reloading logic values after scan-in. Some circuits need to restore logic values of FFs after standby time, but others need not. The necessity of restoration depends on applications and circuits. In the case that the restoration is necessary, a possible approach for low latency restoration is to embed a shadow latch into each FF to store the original logic value. The area and power overhead for the shadow latches will be briefly investigated in Sect. 4.4.

We evaluated the initial circuit delay and the delay after 10 years assuming supply voltage is 1.1 V, temperature is 125°C and initial threshold voltage is −180 mV. We adopted the procedure of NBTI-aware timing analysis [14] in this paper. We first calculated stress probabilities of all PMOSs with logic simulation taking into account both active and standby modes, next annotated |ΔVth| for each PMOS according to the stress probability, and carried out transistor-level static timing analysis with Synopsys Nanotime. To estimate the upper bound of NBTI mitigation effect, we also computed the circuit delay supposing that all PMOSs were always in recovery phase during the standby mode, which is called as “all recovery”. Note that in complementary digital circuits, such an ideal situation is practically infeasible due to Boolean algebra, but the upper bound of NBTI mitigation effect is useful to evaluate how effectively NBTI mitigation methods work. For example, if the NBTI mitigation effect of a method is very close to this upper bound, this method is working well and there is little room for further improvement.

### 4.2 Evaluation Results

#### 4.2.1 Evaluation on Standby Time Ratio

When implementing the proposed NBTI mitigation method, standby time ratio can be a key parameter that determines the amount of NBTI mitigation. Here, the ratio is defined as (time of standby mode)/(time of active mode + time of standby mode). If the ratio necessary for mitigating NBTI sufficiently is quite large, the designs that can use the proposed method become limited. Therefore, we evaluate the relationship between standby time ratio and mitigated critical path delay.

Figures 3−5 show the dependency of critical path delay on the ratio of standby mode in each case of 16-bit mult of 32-bit MDU, 16-bit add of 32-bit ALU, and black of DCT. Here, the ratio of 0% means that the circuit is in active mode all the time, and 100% corresponds to the case that the circuit is always in standby mode. The initial critical path delays of 32-bit MDU, 32-bit ALU and DCT are 0.978, 3.424 and 2.441 ns, respectively.

First, we can see from Figs. 3(a) and (b) that the critical path delays of “all 0” and “all 1” are larger than those of “0101” and “random”. A single input vector during standby mode significantly degrades the circuit delay. While other single-IVC input vectors might give better results, it should be noted that even such single-IVC input vectors increase the circuit delay as the ratio of standby time approaches to 100%, because almost DC stress is given to the circuit. On the other hand, “random” attained the minimum delay increase, which indicates that giving a set of input vectors to combinational circuits is helpful to mitigate NBTI. Another important observation is that even a short ratio of standby time, such as 2%, greatly contributed to NBTI mitigation (Fig. 3(b)), which suggests that our approach is applicable for busy circuits operating with little idle time.

Figures 4(a) and (b) show the result of 16-bit add. Similarly to Fig. 3, a small standby ratio reduced the delay increase. Below 90% of the ratio, the critical path delays are almost the same for “all 1”, “0101” and “random”, but above 90%, “all 0” and “all 1” induced a significant delay increase, whereas “random” mitigated NBTI further. This is because the degradation of PMOS under static stress during standby mode becomes quite large along the increase of the standby time ratio. Besides, under the DC stress (standby time ratio is 100%), there is 1 ns difference between critical path delay of “all 0” and that of “all 1”. This large difference is explained as follows. In Fig. 4, the critical paths consist of carry propagation logics in full adders. Figure 6 illustrates a part of full adder including the path from carry-in (CIN) to carry-out (COUT). When inputs [A, B, CIN] are “all 0”, then COUT becomes ‘0’. In this case, PMOSs (P1, P2, P3) are under stress. On the other hand, when inputs [A, B, CIN] are “all 1”, then COUT becomes ‘1’. In this case, a PMOS P4 is under stress. Here, the path delays from CIN to COUT of “all 0” and “all 1” after 10 years are evaluated at 0.108 ns.
and 0.056 ns. Note that for both cases, inputs and outputs of n-bit full adders are kept the same, and thus stressed PMOSs are also the same. Therefore, the difference of path delays between “all 0” and “all 1” of 16-bit full adders is estimated as $(0.108 - 0.056) \times 16 = 0.832$ ns, which is mostly correlated with the results of Fig. 4. This result indicates that even under DC stress, path delay degradation can vary significantly depending on scan-in vectors.

Also, Figs. 5(a) and (b) show the result of black of DCT. Similarly to Figs. 3 and 4, a small standby ratio suc-
cessfully mitigated the delay increase. The critical path delay of “all 0” is larger than those of other scan-in vectors. Especially, there is a large difference between “all 0” and “all 1”. We confirmed that this can be explained by the carry propagation logics in full adders similarly to Fig. 4.

Here, to investigate the impact of the number of scan-chains on the mitigation effect of NBTI-induced delay degradation, we evaluated an additional circuit of DCT having only one scan path as an example. This evaluation result is shown in Fig. 7. Compared with Fig. 5 where the number of scan-chains is 5, we could not see a visible difference and the delay differences were within a few pico-seconds in this test case.

4.2.2 Evaluation on Various Operations

As NBTI model of Eq. (1) indicates, degradation amount due to NBTI is expressed as a function of PMOS usage, i.e. circuit operation. Therefore, we here evaluate NBTI mitigation assuming various circuit operations. Table 1 lists the delay increases of 32-bit MDU in the case of 10% standby ratio. In the situations of 16-bit\textsubscript{mult} and 16-bit\textsubscript{div}, the amounts of mitigation with “random” are large, and especially in 16-bit\textsubscript{mult} situation, “random” reduced the delay increase by 71% ((30.0%−8.79%) / 30.0%).

Table 2 lists the evaluation results of 32-bit ALU. Similarly to 32-bit MDU, except for general, “random” attained large NBTI mitigation effects, while “0101” and “all 1” also mitigated NBTI significantly. It is due to that the degradation amount of “all 1” is smaller than “all 0”, as explained in the previous section. It is also surmised that “all 1” achieves large mitigation because operands in which higher bits are

![Fig. 6 A schematic of carry logic in full adder.](image)

![Fig. 7 Critical path delay vs. standby time ratio in using 1 scan-chain (black of DCT).](image)

<table>
<thead>
<tr>
<th>Operation</th>
<th>Critical path delay [ns] (degradation)</th>
</tr>
</thead>
<tbody>
<tr>
<td>No standby time</td>
<td>All recovery</td>
</tr>
<tr>
<td>All 0</td>
<td>All 1</td>
</tr>
<tr>
<td>16-bit\textsubscript{mult}</td>
<td>1.271 (30.0%)</td>
</tr>
<tr>
<td>32-bit\textsubscript{mult}</td>
<td>1.131 (15.6%)</td>
</tr>
<tr>
<td>16-bit\textsubscript{div}</td>
<td>1.282 (29.0%)</td>
</tr>
<tr>
<td>32-bit\textsubscript{div}</td>
<td>1.077 (10.1%)</td>
</tr>
<tr>
<td>General</td>
<td>1.049 (7.3%)</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Operation</th>
<th>Critical path delay [ns] (degradation)</th>
</tr>
</thead>
<tbody>
<tr>
<td>No standby time</td>
<td>All recovery</td>
</tr>
<tr>
<td>All 0</td>
<td>All 1</td>
</tr>
<tr>
<td>16-bit\textsubscript{add}</td>
<td>4.206 (22.8%)</td>
</tr>
<tr>
<td>Const\textsubscript{add}</td>
<td>4.565 (33.3%)</td>
</tr>
<tr>
<td>General</td>
<td>3.599 (5.1%)</td>
</tr>
</tbody>
</table>
Table 3 Critical path delay with and without NBMI mitigation (DCT). The initial delay is 2.441 ns. Numbers in parentheses mean the ratios of delay increase from the initial circuit.

<table>
<thead>
<tr>
<th>Operation</th>
<th>Critical path delay [ns] (degradation)</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>No standby time</td>
<td>All recovery</td>
</tr>
<tr>
<td>Black</td>
<td>3.167 (22.9%)</td>
<td>2.643 (7.6%)</td>
</tr>
<tr>
<td>White</td>
<td>3.170 (23.0%)</td>
<td>2.639 (7.5%)</td>
</tr>
<tr>
<td>Noise</td>
<td>2.594 (5.9%)</td>
<td>2.581 (5.4%)</td>
</tr>
<tr>
<td>Bus</td>
<td>2.594 (5.9%)</td>
<td>2.580 (5.4%)</td>
</tr>
</tbody>
</table>

Table 4 Critical path delay with different LFSR configurations (10 years later).

<table>
<thead>
<tr>
<th>Configuration</th>
<th>Critical path delay [ns]</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>32-bit MDU</td>
</tr>
<tr>
<td>Random</td>
<td>1.064</td>
</tr>
<tr>
<td>4-bit LFSR</td>
<td>1.104</td>
</tr>
<tr>
<td>8-bit LFSR</td>
<td>1.064</td>
</tr>
<tr>
<td>12-bit LFSR</td>
<td>1.064</td>
</tr>
<tr>
<td>16-bit LFSR</td>
<td>1.064</td>
</tr>
</tbody>
</table>

PMOSs commonly follow this tendency. Here, $V_{th}$ mitigation is normalized by that of “all recovery”. This figure indicates that if we could allocate 3% of standby time to recovery phase, 60% of mitigation effect can be obtained. When 15% of standby time can be given to recovery phase, we can have 80% of mitigation effect. We think this is the reason why “random” attained a good NBMI mitigation effect.

4.3 Random Number Generation

The previous section showed that random scan-in vectors are useful to mitigate NBMI. On the other hand, when applying this scheme to a circuit on a chip, we need an on-chip random number generator. We here focus on LFSR which is a popular circuit to generate pseudo random numbers, and evaluate the relation between the quality of random number and NBMI mitigation effect. As the number of registers in LFSR increases, the cycle of random number becomes longer and the quality of random number improves, while the area needed for implementation becomes large.

We here evaluate critical path delays when LFSRs with different bits are used. Here, $V_{th}$ mitigation is normalized as that for active mode. Note that even LFSR and scan-chain are operating fast, the stress probability changes depending on the scan-in vectors, and hence the NBMI mitigation effect can be obtained. The results of 4-bit, 8-bit, 12-bit, and 16-bit LFSRs are shown in Table 4. When 8-bit or larger-bit LFSRs are used, there is little difference from “random” in mitigation efficiency. For 32-bit MDU, 32-bit ALU, and DCT used for experiments in this work, 8-bit LFSR can generate a sufficiently random scan-in vectors for NBMI mitigation. As the number of LFSR bits increases, the cycle of random number becomes
longer, i.e. the number of input vectors for a combinational circuit increases. To give all the input vectors in a limited time, scan-shift must be performed faster when the number of LFSR is large. On the other hand, in the case of smaller LFSR, such as 8-bit, slow shifting is sufficient, which is desirable in terms of power dissipation because fast shifting involves large power dissipation.

4.4 Area and Power Estimation

Thus far, we revealed that NBTI mitigation can be performed by inputting the random scan-in vectors generated in LFSR during a short standby time. We finally discuss the cost of the proposed method, i.e. area and power overhead.

Table 5 lists the area and static power consumption of 32-bit MDU, 32-bit ALU and DCT. Original means non-scan design. Here, we consider that scan design is a common practice for DFT and then is regarded as a baseline for comparison. In this table, the area increase from original circuits to scan designs is in the range of 3.2–8.7%. On the other hand, the differences between scan designs and scan designs with 8-bit LFSR are less than 2.0%. Furthermore, when shadow latches are embedded in scan design for logic value restoration with 8-bit LFSR, the area increase from scan designs is in the range of 8.7–19.4%. As for static power consumption, the differences between original and scan designs are in the range of 1.0–8.3%. Meanwhile, the differences between scan designs with and without 8-bit LFSR are below 2.0%. Besides, as for scan design with shadow latch and 8-bit LFSR, static power consumption increases by 10.5% to 26.2% compared with scan design. These results lead to a deduction that adding LFSR only has a quite limited impact on area and static power consumption.

In addition, Fig. 9 shows the relationship between clock frequency for scan-shift and dynamic power consumption. Here, we assumed that “0101” was given to scan paths. This figure shows that when the clock frequency is kept below 1 MHz, the dynamic power consumption is at most comparable to static power consumption. Through these evaluations, we confirmed that giving random scan-in vectors generated in 8-bit LFSR with up to 1 MHz clock helps the mitigation of NBTI degradation with low area and power overhead.

5. Conclusion

In this paper, we proposed a NBTI mitigation method that injects random patterns to scan paths, and evaluated the NBTI mitigation effect through case studies. The experimental results with 32-bit MDU and ALU in MIPS R3000 processor and DCT circuit for MPEG-4 showed that the proposed method well mitigated NBTI. The reduction of delay increase reached 71% in a test case and was comparable to the upper limit assuming all PMOSs were in recovery phase. We also confirmed that 8-bit LFSR is capable of random number generation for NBTI mitigation with low area and power overhead.

References

KONOURA et al.: NBTI MITIGATION METHOD BY INPUTTING RANDOM SCAN-IN VECTORS IN STANDBY TIME


Hiroaki Konoura received B.E. and M.E. degrees in Information Systems Engineering from Osaka University, Japan, in 2009 and 2011, respectively. He is currently a doctoral student in the Department of Information Systems Engineering at Osaka University. His research interest is development of high reliable reconfigurable architecture. He is a student member of IEEE.

Toshihiro Kameda received the B.E. and M.E. degrees in Information Systems Engineering from Osaka University, Osaka, Japan, in 2010 and 2012, respectively. He is currently with NTT communications. His research interests include testing and fault avoidance for reconfigurable devices.

Hiroaki Konoura received B.E., M.E., and Ph.D. degrees in Information Systems Engineering from Osaka University, Japan, in 1998, 2000, and 2010, respectively. He is currently an Associate Professor in School of Engineering, Kochi University of Technology. His research interests include reconfigurable architecture and its VLSI design. He is a member of IEEE and IPSJ.

Masanori Hashimoto received the B.E., M.E. and Ph.D. degrees in Communications and Computer Engineering from Kyoto University, Kyoto, Japan, in 1997, 1999, and 2001, respectively. Since 2004, he has been an Associate Professor in Department of Information Systems Engineering, Graduate School of Information Science and Technology, Osaka University. His research interest includes computer-aided design for digital integrated circuits, and high-speed circuit design. Dr. Hashimoto served on the technical program committees for international conferences including DAC, ICCAD, ITC, ASP-DAC, DATE, ISPD, and Symposium on VLSI Circuits. He is a member of IEEE, ACM, and IPSJ.

Takao Onoye received the B.E. and M.E. degrees in Electronic Engineering, and Dr.Eng. degree in Information Systems Engineering all from Osaka University, Japan, in 1991, 1993, and 1997, respectively. He is currently a professor in the Department of Information Systems Engineering, Osaka University. His research interests include media-centric low-power architecture and its SoC implementation. He is a member of IEEE, IPSJ, and ITE-J.