ECC error correction IP design based on BCH code

R. X. Wang; H. Y. Qin; X. N. Zheng; J. Y. Tao

doi:10.1117/12.2660783

2 February 2023 ECC error correction IP design based on BCH code

R. X. Wang, H. Y. Qin, X. N. Zheng, J. Y. Tao

Author Affiliations +

Proceedings Volume 12462, Third International Symposium on Computer Engineering and Intelligent Communications (ISCEIC 2022); 124621A (2023) https://doi.org/10.1117/12.2660783
Event: International Symposium on Computer Engineering and Intelligent Communications (ISCEIC 2022), 2022, Xi'an, China

Abstract

This paper describes a scheme for the ECC Error Correction IP. The common circuit is extracted based on the traditional BCH (Bose, Ray-Chaudhuri, Hocquenghem) codes innovatively, which make the logic gate requirement reduced by 33%; at the same time, the interleaving idea is used for multi-channel ECC error correction, which can correct 4-bit adjacent errors at most; the p-channel parallel chien search circuit is used to replace the serial search circuit to reduce the error correction delay; the time-sharing error correction makes the reading and writing transparent to ECC error correction process. The simulation and tape show that the design can effectively ensure the correctness of DRAM internal data without affecting its access speed.

1. INTRODUCTION

Internet of things (IOT) technology puts forward more stringent requirements on the area and power consumption of SOC (System On Chip). Memory units, such as SRAM (Static Random Access Memory), usually occupy 1/2~2/3 of the chip area, which is an important direction of chip area and power consumption optimization. The 6T SRAM provided by foundry is designed for general use, and its area and power consumption are not ideal. The basic memory cell of DRAM is 2T structure, which has smaller size and leakage current¹.

Replacing SRAM with DRAM can significantly reduce the area and power consumption of the chip. However, its retention time is short, so it needs to refresh the stored charge periodically, which may lead to a small number of bit errors. In order to ensure the correctness of the data, the wrong data bits must be corrected, and the correction circuit must be simple, otherwise the ratio of the internal storage unit of DRAM to the total area of DRAM will be too small, which is contrary to the optimization goal^2-3. Spatial radiation often leads to multi bit data adjacent errors, so the interleaving circuit is adopted in the design to correct them. Access time is an important indicator of memory, and the error search circuit takes up most of it. The parallel search circuit is much faster than the serial search circuit, so the p-channel parallel chien search is adopted to achieve this target. BCH codes are a good linear error correction codes with strong error correction ability, convenient construction and simple coding for correcting multiple random errors.

Based on this background, a BCH coded ECC error correction circuit is designed and implemented, which can effectively correct the random errors in DRAM and ensure the accuracy of data and anti-interference ability⁴.

2. DRAM DATA ERROR ANALYSIS

DRAM data content errors are mainly caused by data attenuation, refresh error and particle radiation.

2.1

2T storage data attenuation characteristics and α Particles and space radiation

The 2T storage structure and read-write circuit are simple, but there is no latch structure. The data content is mainly maintained by the grid parasitic capacitance. Therefore, the retention time is short, resulting in data errors over time. The data attenuation characteristics with time are shown in Figure 1.Without optimization, the data retention time is only about 100μs under 55nm process.

Figure 1.

Data attenuation characteristics

In large scale integrated circuits, packaging materials contain trace amounts of radioactive substances, α Particles will be generated with the decay of these substances⁵. 2T memory cell uses grid parasitic capacitance to realize storage, so it is more vulnerable to α Effects of particles and high-energy particle.The error rate is shown in Figure 2 under 90nm and 55nm fabrication.

Figure 2.

Continuous bit error rate at 90nm and 55nm

2.2

Gamete manipulation interleaved implicit refresh

The 2T storage unit adopts a folding structure to form a storage array. And the write operation is line destructive, which will cause other data in the line to be overwritten except for the target storage unit to be written^6-8.

The design adopts sub-operation interleaved with implicit refresh, that is, the refresh operation is hidden between two external accesses in parallel. The refresh operation occurs in the second half of the previous access and the first half of the next access in parallel, as shown in Figure 3. Due to the situation of refreshing and accessing the same row, the gate circuit will be in an uncertain switching state for a short time, resulting in the possibility of data error.

Figure 3.

Gamete manipulation staggered refresh

3. BCH CODING ERROR CORRECTION

Based on the above error characteristics of memory cells, considering the area and frequency index of DRAM, interleaved BCH coding can be used to correct the error data content.

3.1

BCH coding principle and code interleaving

BCH code is a kind of cyclic linear coding. The polynomial generated by cyclic coding is shown in Formula 1.

LCM represents the minimum common multiple, and its minimum code distance is d > 2t + 1, and it can correct t errors. In the encoding process, the polynomial of the message to be sent is m(x), the encoding polynomial is g(x) and the sent information polynomial is r(x) = m(x)g(x).

In the decoding process, the received information polynomial is r(x), the encoding polynomial is g(x), and the remainder mode is mod(r(x), g(x)), if the remainder is not zero, there is a bit error, and then the error bit can be found through the error positioning polynomial; If the remainder is zero, there is no bit error.

The encoding process can be realized by a shift register with a feedback loop. The connection relationship of the feedback loop is controlled by the coefficients of the generated polynomial. The feedback loop shift register is shown in Figure 4.

Figure 4.

Feedback loop shift register

The bit errors caused by space radiation are generally continuous multi bit errors. If the data is continuously non-interleaved, because a0, a1, a2 and a3 share a set of verification circuit, the verification circuit cannot verify the data with more than 2-bit errors. The data non interleaved arrangement is shown in Figure 5(a).

Figure 5.

(a) Data non-interleaved arrangement. (b) Data interleaving arrangement

Four groups of interleaved arrangement are adopted for the data. In this way, a0, b0, c0 and d0 each adopt independent verification circuits, that is, four sets of calibration circuits, which can calibrate continuous 8-bit errors. The data interleaving arrangement is shown in Figure 5(b).

3.2

BCH coding optimization

BCH code is finally converted into a series of XOR operations to facilitate DRAM layout and function tuning. Some intermediate items are as follows.

s1=bit1^bit2^bit4^bit5^bit7^bit9^bit11^bit12^bit14^bit16^bit18

s2=bit1^bit3^bit4^bit6^bit7^bit10^bit11^bit13^bit14^bit17^bit18

s3=bit2^bit3^bit4^bit8^bit9^bit10^bit11^bit15^bit16^bit17^bit18

May order: t1=bit5^bit12, t2=bit6^bit13, t3=bit8^bit15,

t12=bit1^bit7^bit14, t13=bit2^bit9^bit16, t23=bit3^bit10^bit17, t123=bit4^bit11^bit18,

Then, s1=t1^t12^t13^t123, s2=t2^t12^t23^t123, s3=t3^t13^t23^t123

This design originally required 30 XOR gates, but after optimization, only 20 XOR gates were required, which can save 33% of the area.

3.3

Implementation and optimization of parallel chien search circuit

The p-channel parallel chien search is adopted instead of the serial chien search to find out the error bit position and correct it. Only 1/p of the original search time is required, which can greatly shorten the error bit decoding time and improve the working frequency of DRAM. The p-channel parallel chien search circuit is shown in Figure 6⁹, in which a ~ a^t is the multiplier coefficient. Finally, it is converted into an exclusive OR gate, and the area connection is small.

Figure 6.

Parallel chien search circuit

4. SIMULATION AND VERIFICATION

The corresponding Python model of ECC algorithm is built for simulation, as shown in Figure 7. Among them, the black circle represents the original data before writing to the DRAM, and its value is the equal difference sequence data with step size of 1, the blue square represents the data after adding random errors (the maximum any 2-bit errors) to the DRAM, and the red triangle represents the correct data after calibration. The abscissa is the address to be accessed, and the ordinate is the data value. It can be seen from the figure that after ECC correction, the error data is exactly the same as the original data, that is, the dots and triangles on the figure completely coincide.

Figure 7.

ECC verification algorithm simulation

5. CONCLUSION

A digital IP is designed and implemented that can accurately verify and correct DRAM using BCH coding. The data interleaving method is used in ECC circuit to improve the error correction ability, extract the circuit common factor to simplify the required gate circuit, and the parallel chien search circuit is used to reduce the time of error location. It occupies a small area and is suitable for large-scale common logic process integration. The design has been verified in the 55nm SOC chip project. When the main frequency is 200 MHz, the DRAM read / write refresh is normal, and the data is correct.

REFERENCES

[1]

Ansari, M., and Singh, J., “Capacitorless 2T-DRAM for Higher Retention Time and Sense Margin,” IEEE T. Electron Dev, 67 (3), 902 –906 (2020). https://doi.org/10.1109/TED.16 Google Scholar

[2]

Bang, S., Han, K., Kahng, A. B. and Luo, M., “Delay uncertainty and signal criticality driven routing channel optimization for advanced DRAM products,” ASP-DAC, 697 –704 (20162016). Google Scholar

[3]

Shin, W., Choi, J., Jang, J., Suh, J., Moon, Y. and Kwon, Y., “DRAM-Latency Optimization Inspired by Relationship between Row-Access Time and Refresh Timing,” IEEE T. Comput, 65 (10), 3027 –3040 (2016). https://doi.org/10.1109/TC.2015.2512863 Google Scholar

[4]

Micheloni, R., Ravasio, R., Marelli, A., Alice, E., Altieri, V. and Bovino, A., “A 4Gb 2b/cell NAND Flash Memory with Embedded 5b BCH ECC for 36MB/s System Read Throughput,” ISSCC, 497 –506 (20062006). Google Scholar

[5]

Sasada, T., Ichikawa, S., Kanai, T., “In-flight measurement of space radiation effects on commercial DRAM,” ICM, 480 –483 (20042004). Google Scholar

[6]

Pattabiraman, K., Zorn, B. G., Liu, S., Moscibroda, T. and Zorn, B. G., “Flikker: Saving DRAM Refresh-power through Critical Data Partitionin,” ser. ASPLOS, (20112011). Google Scholar

[7]

Tillinghast, C. W., Cohen, M. S., Voshell, T. W., “Temperature-dependent DRAM refresh circuit,” US, (1994). Google Scholar

[8]

Ohsawa,T., Kai, K., Murakami, K., “Optimizing the DRAM refresh count for merged DRAM/logic LSIs,” IEEE Cat. No. 98TH8379, 82 –87 (1998). Google Scholar

[9]

Chen, Y., Parhi, K. K., “Small area parallel Chien search architectures for long BCH codes,” IEEE T. VLSI. Syst, 12 (5), 545 –549 (2015). https://doi.org/10.1109/TVLSI.2004.826203 Google Scholar

Citation Download Citation

R. X. Wang, H. Y. Qin, X. N. Zheng, and J. Y. Tao "ECC error correction IP design based on BCH code", Proc. SPIE 12462, Third International Symposium on Computer Engineering and Intelligent Communications (ISCEIC 2022), 124621A (2 February 2023); https://doi.org/10.1117/12.2660783

Access the abstract

PROCEEDINGS
5 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Error analysis

Data storage

Computer programming

Feedback loops

Particles

Signal attenuation

Calibration

1.

INTRODUCTION

2.

DRAM DATA ERROR ANALYSIS

2.1

2T storage data attenuation characteristics and α Particles and space radiation

Figure 1.

Figure 2.

2.2

Gamete manipulation interleaved implicit refresh

Figure 3.

3.

BCH CODING ERROR CORRECTION

3.1

BCH coding principle and code interleaving

Figure 4.

Figure 5.

3.2

BCH coding optimization

3.3

Implementation and optimization of parallel chien search circuit

Figure 6.

4.

SIMULATION AND VERIFICATION

Figure 7.

5.

CONCLUSION

REFERENCES

Show All Keywords

Keywords/Phrases

Search In:

Publication Years