Compact eternal diffractive neural network chip for extreme environments

Dong, Yibo; Lin, Dajun; Chen, Long; Li, Baoli; Chen, Xi; Zhang, Qiming; Luan, Haitao; Fang, Xinyuan; Gu, Min

doi:10.1038/s44172-024-00211-6

Download PDF

Article
Open access
Published: 01 May 2024

Compact eternal diffractive neural network chip for extreme environments

Communications Engineering volume 3, Article number: 64 (2024) Cite this article

492 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

Artificial intelligence applications in extreme environments place high demands on hardware robustness, power consumption, and speed. Recently, diffractive neural networks have demonstrated superb advantages in high-throughput light-speed reasoning. However, the robustness and lifetime of existing diffractive neural networks cannot be guaranteed, severely limiting their compactness and long-term inference accuracy. Here, we have developed a millimeter-scale and robust bilayer-integrated diffractive neural network chip with virtually unlimited lifetime for optical inference. The two diffractive layers with binary phase modulation were engraved on both sides of a quartz wafer. Optical inference of handwritten digital recognition was demonstrated. The results showed that the chip achieved 82% recognition accuracy for ten types of digits. Moreover, the chip demonstrated high-performance stability at high temperatures. The room-temperature lifetime was estimated to be 1.84×10²³ trillion years. Our chip satisfies the requirements for diffractive neural network hardware with high robustness, making it suitable for use in extreme environments.

Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit

Article 12 April 2021

Design of task-specific optical systems using broadband diffractive neural networks

Article Open access 02 December 2019

Metasurface-enabled on-chip multiplexed diffractive neural networks in the visible

Article Open access 27 May 2022

Introduction

Over the years, artificial intelligence (AI) methods have been widely used in recognition¹, autonomous driving², scientific research^3,4,5, human-computer interaction⁶, and robotics⁷. With the gradual development of AI capabilities, AI methods are expected to replace manual approaches in extreme environments, such as inclement weather, the deep sea, and space. Neuromorphic hardware is important for the development of AI approaches. For applications in extreme environments, low energy consumption, high robustness, and high speed are important evaluation criteria for neuromorphic hardware. Light is an ideal information carrier because of light-speed wireless passive propagation characteristics, which can help greatly increase the calculation speed while reducing the energy consumption to the level of femtojoules-per-bit⁸, and can be used in various calculations, like complex-valued calculations⁹, matrix multiplication^10,11 and convolution calculations^12,13. Compared with electronic parameters (mobility or carrier number), optical parameters (refractive index or transmittance) of materials are generally less sensitive to changes in temperature and humidity. Therefore, photonic components show great potential for use in extreme environments.

Recently, a wave-based optical neural network, the diffractive neural network (DNN)¹⁴, was reported. DNNs have demonstrated superior performance in various AI tasks, such as image recognition^{14,15,16,17,18,19,20}, optical computing²¹, phase retrieval²², adaptive focusing²³, and terahertz pulse shaping²⁴. In contrast to waveguide-based optical neural networks^25,26,27,28, DNNs mimic the human nervous system in three-dimensional (3D) domains. This feature is realized through the diffraction of waves, thus enabling direct parallel processing of optical image data without converting the data to sequential inputs²⁹. This feature enables DNNs to more quickly recognize target objects in extreme environments.

However, because of structural and material limitations, existing DNNs cannot be used in extreme environments. DNNs are composed of cascaded diffractive layers, and currently, 3D printing is widely used to construct DNNs^{17,21,22,24,30,31,32,33}. However, high robustness and long lifetimes cannot be guaranteed for DNNs made of organic materials. More importantly, the diffractive layers in DNNs are usually spatially separated and operate in the terahertz band^{21,24,30,31,32,33}. As a result, DNNs are typically on the centimeter scale and cannot be integrated on-chip. However, DNNs integrated on Si wafers have been reported^34,35,36,37. With this method, the three-dimensional networks are designed in two dimensions, thus losing the unique parallel processing advantage for 2D optical images. Therefore, the implementation of a 3D-integrated DNN made with stable materials is highly desirable for applications in extreme environments.

Here, we report an on-chip bilayer DNN for optical inference with virtually unlimited lifetime. Based on double-sided lithography, the two binary phase-modulated diffractive layers in the DNN chip were surface engraved on both sides of a single-crystal quartz substrate. Therefore, the input wave travels through the quartz during the computing. More than one million neurons were achieved in each layer. Handwritten digit recognition experiments show that the recognition accuracy for ten types of handwritten digits (0~9) is 82%. The consistency and robustness of this chip in fabrication and test sessions are analyzed. The adaptability of this chip to other tasks, including fashion product recognition and phase imaging, has also been verified through simulations. Moreover, the lifetime of the DNN chip was measured. After accelerated aging at high temperatures, the DNN still demonstrates high performance, and the recognition accuracy for two types of digits can be maintained at 100%. The lifetime at room temperature was estimated to be 1.84 × 10³⁵ years. This DNN chip strategy satisfies the mass-fabrication requirements for DNN hardware with high robustness and can be used for various AI tasks in extreme environments.

Results

Figure 1a shows the schematic of the handwritten digit recognition task with the bilayer DNN chip working at a wavelength of 532 nm. The information carried by the neurons was encoded by the phase values and reflected via the pixel heights, which determined the interference of the secondary waves (Fig. 1b). Thus, through the propagation and diffraction of the coherent waves from the input layer to the DNN and finally to the output layer, a feedforward optical neural network was constructed (Fig. 1c). The DNN inference results are displayed by the output layer through the light intensity distribution. Notably, different from the DNNs with separated layers, the signal transmission between the two diffractive layers occurs within the quartz substrate. Therefore, this integration method can ensure the long-term stability of the layer spacing and the diffractive medium, which guarantees the calculation accuracy. Figure 1d shows a digital image of the DNN chip. The diffractive layers are approximately 8.2 mm × 8.2 mm in size.

**Fig. 1: Bilayer diffractive neural network (DNN) integrated on a quartz substrate.**

TensorFlow-based DNN training

Matrix multiplication is the main mathematical operation used in artificial neural networks. The multiplication of the input and weight matrices of each artificial neural layer reflects the biological signal transmission between neurons through synapses. In DNNs, matrix multiplication operations are implemented optically through the transmission and coherent superposition of the incident coherent waves between the diffractive layers. Therefore, when designing a DNN, a light propagation model between the diffractive layers must be constructed.

Here, we used angular spectrum diffraction to simulate the propagation of the incident light (Supplementary Note 1)³⁸. The Fourier transform used in angular spectrum diffraction is suitable for training DNNs with many neurons. Figure 2a illustrates the forward propagation model and error backpropagation model used in the training process. The training was implemented with the TensorFlow 2.0 framework (Google Inc.). We used the Modified National Institute of Standards and Technology (MNIST)³⁹ handwritten digit database for training. To fabricate the proposed chips, the phase values were trained and binarized (Supplementary Note 2). A training set of 1000 images was used for the DNN training. Based on the principles of DNN, larger training sets can also be used, but accordingly, the training time will increase dramatically. It takes more than 50 h to train a DNN with the same parameters using the full MNIST dataset. More importantly, as shown in Supplementary Fig. 1, the phase distribution resulting from using the full MNIST dataset has a higher frequency compared to a smaller dataset. This will lead to a significant increase in the difficulty of alignment in the optical setup. Therefore, considering the above two factors, we chose a training set of 1000 images.

**Fig. 2: Simulated results of the bilayer diffractive neural network (DNN) chip.**

In addition, the amplitude field of the ten regions was optimized to follow a Gaussian distribution. The area detecting a Gaussian distribution tends to exhibit a more concentrated intensity compared to a typical uniform distribution area, thereby increasing the maximum light intensity density in these regions. Consequently, the camera can capture effective signals more easily, allowing it to operate with shorter exposure time and/or under a lower-laser-power configuration. Thus, the noise in the recorded images can be reduced.

Analysis of the training results

Supplementary Fig. 2a shows the binary phase distributions obtained in each diffractive layer. Supplementary Fig. 2b illustrates the confusion matrix of the recognition results based on the training set. The results show that the recognition accuracy is 96.1%, with a loss of approximately 0.198. Compared with multilevel phase modulation, we found that binary phase modulation does not appear to affect the accuracy of the DNN. The simulation results presented in Supplementary Table 1 show that the accuracy of the bilayer DNN with binary or 256-level phase modulation differs by only 0.2%.

Figure 2b shows the simulated inference results for 10 typical handwritten digit images. The light spots corresponding to the input digits have the strongest intensity (Fig. 2c), indicating that the DNN successfully recognized the input digit. In addition, it was observed that the digital images are present in the output layer, indicating that the incident light is not fully modulated by the DNN. This could be due to the binary phase modulation exhibited by the diffractive layers, leading to a decrease in diffraction efficiency. As a result, the zero-order diffraction, which is represented by the digital image, is visible on the output layer. To resolve this issue, the loss function can be modified (refer to Supplementary Fig. 3). However, as constraints increase, the accuracy of DNNs decreases as well, which is contrary to the expected results. Therefore, in our case, the diffraction efficiency and recognition accuracy appear to be two trade-off parameters. Thus, we chose to guarantee a high accuracy and did not change the loss function.

Importantly, we used a commercial single-crystal quartz wafer as the substrate. Because the thickness of the quartz wafer is 500 μm, the layer spacing is fixed at 500 μm. Therefore, the neurons in the DNN are not fully connected. Supplementary Fig. 4 shows that one neuron in the 1st layer is connected to approximately 7 × 7 = 49 neurons in the 2nd layer through zero-order diffraction. Although the DNN is not fully connected, the bilayer DNN still exhibits significantly better performance than the monolayer DNN. We also trained a monolayer DNN, and its recognition accuracy was only 91.2%, with a loss of approximately 0.437 (Supplementary Table 2). The neural network training process can be simply regarded as the optimization of the weight matrices. Therefore, we analyzed the influence of the layer number on the DNN performance based on the degrees of freedom of the weight values (Supplementary Note 3 and Supplementary Fig. 5). The results show that increasing the layer number increases the degrees of freedom of the weight values, thereby increasing the accuracy of the DNN. For a DNN that is not fully connected, improving the neural connections between layers can increase accuracy. This can be achieved by increasing layer spacing or reducing neuron size (Supplementary Fig. 6). The underlying reason can also be explained by the increase in degrees of freedom of the weight values (Supplementary Note 4).

Performance characterization of the bilayer DNN

We used double-sided photolithography followed by dry etching to engrave the DNN on the surfaces of the quartz plate (“Methods” section). Due to the difference in the refractive indices of quartz and air, pixels with different heights distinctly modulate the phase of the incident light. Supplementary Figs. 7 and 8 show the scanning electron microscopy (SEM) and optical images of the obtained diffractive layer. The diffractive layer pattern is well fabricated on the quartz surface. Plasma dry etching can achieve high-precision engraving with an error of tens of nanometers. Supplementary Fig. 9 shows the height profile of the diffractive layer. The etching depth is approximately 284 nm, which is close to the phase modulation of π/2 for the 532 nm laser.

Figure 3a shows the optical setup. The laser power was adjusted by a half-wave plate and a polarized beam splitter. We used lenses L₁ and L₂ to form a 4f system for beam expansion. We adopted a double Fourier transform to generate optical digit images via phase-only spatial light modulation⁴⁰. Then, we filtered out the 0th-order and -1st-order diffraction patterns through spatial filtering, preserving only the 1st-order diffraction pattern for the DNN test. The positions of the input layer, the DNNs chip, and the output layer are shown in Fig. 3b. The dashed orange line following Lens-4 (L₄) depicts the conjugate plane of the spatial light modulation. An initial input, represented by an amplitude-only digit pattern ‘7’, propagates across a 5 cm free space and serves as the input for the bilayer DNN chip. Subsequently, after traversing approximately 16.4 cm, the output intensity field with the ‘7’ area brightest is captured by a Complementary Metal Oxide Semiconductor (CMOS) camera. Figure 3c shows the experimental results of the 10 typical handwritten digit images. When a digital image is input into the DNN, the corresponding light spot has the largest intensity (Fig. 3d). We also noted that the unmodulated digit image is obtained, which is consistent with the simulation results. Since the images in Fig. 3c are small, a few enlarged images of the results are shown in Supplementary Fig. 10. Except for the unmodulated digit image, the noise outside the target detection regions is very low, which demonstrates the effects of optimizing the output amplitude field and using many neurons in the design.

**Fig. 3: Experimental inference results of the bilayer diffractive neural network (DNN) chip.**

Figure 4 shows the confusion matrices of the recognition results based on the test set. Figure 4a shows the simulation results for 1000 images, with the DNN achieving an accuracy of 85.4%. Figure 4b presents the experimental results for 50 images, with the DNN achieving an accuracy of 82%. The accuracy difference between the experimental and simulation results may be due to chip fabrication and measurement errors. The accuracy is not high, mainly because the number of diffractive layers is only two, and the neurons are not fully connected. Simulations show that under the existing bilayer integration configuration of the chip, subsequently reducing the neuron size or increasing the layer spacing can improve the performance (Supplementary Fig. 6). Correspondingly, these optimizations require improvements in fabrication processes. We compare our DNN chip with other reported works in terms of fabrication, integration, robustness, and performance. As shown in Supplementary Table 3, despite some performance differences, our chip shows high robustness and advances in 3D integration compared to other methods.

**Fig. 4: Confusion matrices for the simulation and experimental results based on the test set. Pct. percentage.**

To verify the chip’s capability to perform other AI tasks, we trained the DNN using the Fashion-MNIST dataset⁴¹ with the same chip parameters. The Fashion-MNIST dataset comprises 10 categories of fashion products, which presents greater complexity compared to the MNIST dataset. Supplementary Fig. 11 shows that the DNN chip can achieve approximately 92.2% and 80.1% accuracy for the training set and the test set, respectively. Meanwhile, we also tried a non-recognition AI task with potential practical applications: phase imaging. Phase is invisible, and extracting phase information from light has important applications in wavefront shaping, biological detection, and other fields. In our simulation, the input digital image is phase information (Supplementary Fig. 12a). From the result (Supplementary Figs. 12b, c), we can see that the DNN can directly convert the input phase to intensity information to realize phase imaging. These results indicate that this DNN chip strategy can be used for various AI tasks.

Robustness studies are crucial for chips. We analyze the impact of errors that may occur during fabrication and testing on the performance of DNNs. Alignment of the diffractive layers in DNN is often technically challenging. In our chip, we achieve this through double-sided photolithography, which typically has an overlay accuracy of about 1~2 μm. Based on the simulation results presented in Fig. 5a, b, it can be concluded that alignment errors may cause a decrease in the accuracy of DNN. However, this decrease is slow within the range of 1~2 pixels (8~16 μm). Therefore, it can be inferred that the overlay error of double-sided photolithography has minimal impact on the performance of the DNN chip, and the fabrication process is capable of ensuring high consistency. Then, the thickness of the substrate affects the layer spacing of DNN. We simulated this impact on the accuracy of DNN. As illustrated in Fig. 5c, the accuracy gradually decreases with the thickness deviation, but the rate is slow. Even with a thickness error of ±50 μm (about 10%), the accuracy only decreases by about 2%. Next, we simulated the effects of wavelength shift in the chip’s test session. The DNN chip was designed using binary phase modulation of 0 and π/2, which represents height distributions of 0 and 289 μm for a quartz substrate. Although variations in the incident wavelength only lead to slight changes in phase modulation (Fig. 5d), they can cause a significant decrease in accuracy (Fig. 5e). This is because that the detecting area undergoes magnification or demagnification owing to changes in the numerical aperture. Figure 5f displays intensity patterns at three distinct operating wavelengths, and it is noticeable that the location of the intensity peak shifts. This will lead to a change of the total light intensity in the detection area (white circle area), resulting in recognition errors. However, in the experiment, we can calibrate the position of the output layer to accommodate this change, as well as the positions of the detection areas. In this way, the chip can still demonstrate adaptability to function effectively at other wavelengths. Finally, the test’s error also encompasses the misalignment between the input layer and the DNN. As depicted in Fig. 5g, the accuracy of DNNs decreases as the input layer’s position shifts. Thus, it is crucial to guarantee a minimal alignment error between the input layer and DNNs. For our chip, the design of DNNs with binary phase modulation helps reduce the difficulty of alignment (“Methods” section).

**Fig. 5: Robustness analysis of the diffractive neural network (DNN) chip in fabrication and test sessions.**

Lifetime analysis of the DNN integrated on the quartz substrate

Because of the high melting point and high stability of quartz, the quartz-based optical element has an extremely long and even unlimited lifetime⁴². The reduction in the accuracy of the DNN during accelerated aging was studied. We designed and fabricated several bilayer DNN chips that recognize only two digits (0 and 1), and the accuracy reached 100% (Supplementary Fig. 13). Then, we placed the DNN chip in a box furnace for 2 h at 1400 °C in an air atmosphere. As shown in Fig. 6a, after annealing, the roughness of the surface of the DNN sample increased. The surface roughness of the DNN was R_a = 0.97 nm before annealing and R_a = 20.1 nm after annealing, thereby increasing the noise in the output layer (Fig. 6b). However, since the prediction results were obtained by comparing the total intensity in the target regions, we found that the increased noise did not affect the recognition results (Fig. 6c), and the DNN accuracy was still 100% for 50 handwritten digit images. After annealing, the mean profile spacing R_sm is only hundreds of nanometers, which is much smaller than the neuron size in the DNN (8 × 8 μm²). Therefore, the phase modulation shift in a neuron caused by the increased roughness after annealing is approximately 0. In our experiment, the degradation of the DNN samples at high temperatures is mainly due to the reaction between the quartz and the Al₂O₃ boat at high temperatures, resulting in the formation of Al₂O₃·SiO₂⁴³. Increasing the annealing time to 3 h completely damaged the samples; the sample shattered and could not be used for handwritten digital recognition.

**Fig. 6: Lifetime analysis of the diffractive neural network (DNN) integrated on the quartz substrate.**

The estimated lifetime of a device is typically determined based on the Arrhenius equation⁴²:

$$\frac{1}{\tau }=k=A\exp (-{E}_{{{{{{\rm{a}}}}}}}/{k}_{{{{{{\rm{B}}}}}}}T)$$

(1)

where k is the decay rate, A is the frequency factor, k_B is the Boltzmann constant, E_a is the activation energy and T is the absolute temperature. The room-temperature lifetime of a sample can be estimated by extrapolating the lifetime at high temperature to room temperature. We define the time for the DNN sample to be completely damaged as its lifetime. Therefore, as shown in Fig. 6d, the lifetime of the DNN chip at room temperature (300 K) is estimated to be 5.81 × 10⁴² s (1.84 × 10²³ trillion years). Even at a temperature of 500 K, the lifetime is still 7.71 × 10²³ s (2.44 × 10⁴ trillion years). The ultralong lifetime of the quartz-based DNN ensures that the chip can operate stably and reliably for a very long time. In addition to high temperatures, we also analyze the impact of other extreme environments on the chip (see Supplementary Note 5). Moreover, unexpected damage, beyond conventional aging, may occur during long-term chip operation. We simulated the loss of neuron information resulting from severe damage or wear. Supplementary Fig. 14 shows that the DNN chip can maintain its highest performance when the damaged area is below 20%. This is also a guarantee for the long-term reliable operation of our chip.

Discussion

In this work, a bilayer DNN chip was integrated on a quartz plate. Our approach based on semiconductor manufacturing technology establishes a more commercial and mature integration solution for DNNs with 3D structures. We present an in-depth analysis of the effect of increasing the number of layers and layer spacing on the DNN performance. The robustness of the chip in fabrication and testing is analyzed by simulations. The quartz-based DNN is verified to have an ultralong lifetime and high-performance stability. Thus, it is suitable for long-term operation in various extreme environments, such as strong radiation environments (outer space) and high-pressure environments (deep sea). Other tasks demonstrated based on DNN, including but not limited to beam shaping²⁴, logical computing²¹, and data downscaling⁴⁴, could also be theoretically performed using our chip design. This expands the range of potential applications.

We note that non-fully connection constrains the performance of DNN. Therefore, reducing the neuron size or designing wafers with better thickness are important directions for improvement. Additionally, the number of layers in a DNN chip can restrict accuracy improvement. To increase the number of layers, existing bonding techniques can be utilized. Bonding technology allows for the joining of multiple substrates. Laser bonding techniques may be a viable solution for quartz substrates^45,46. The process can fuse specific areas of quartz, allowing for the selective bonding of two quartz plates. Combined with the double-sided photolithography to realize the alignment of diffractive layers, it is feasible to achieve more diffractive layers. Currently, the lack of nonlinear activation functions between the diffractive layers is a common bottleneck for DNNs, resulting in lower performance compared to deep neural networks. Therefore, it is urgent to solve the problem of inserting nonlinear optical layers between diffractive layers. Our strategy offers a solution by providing the possibility to insert a nonlinear active layer between the diffractive layers. Due to the high stability of quartz, it is possible to deposit nonlinear absorbing materials on its surface. This allows the absorption coefficient of the material to be incorporated into the design of the DNN, resulting in a truly deep DNN. For this purpose, some advanced nonlinear materials, such as two-dimensional materials⁴⁷ and perovskite materials⁴⁸, can be considered.

Finally, to further achieve the integration of the DNN system, there are pioneering endeavors to learn from. The DNNs chips can be integrated with camera chips^22,49. In addition, researchers have recently reported the integration of a DNN chip and an electrical neural network chip, demonstrating an analog programmable optoelectronic chip⁴⁴. Besides, we can also consider building a 3D-integrated DNN system based on the vertical-cavity surface-emitting lasers (VCSELs)²⁹. VCSELs are micron-sized on-chip light sources that emit light perpendicular to the substrate⁵⁰, so VCSEL can be used to generate optical images by constructing a two-dimensional addressable array. The DNNs can be directly integrated on the surface of VCSELs through heterogeneous bonding⁵¹. The detector array can also be integrated on DNNs with a similar method. In this case, it is technically possible to implement a fully system-integrated DNN chip.

Methods

Fabrication of the bilayer DNN

Phase modulation in the diffractive layer is achieved by etching pixels with different depths on the quartz wafer based on the equation φ(λ) = 2π(n-1)h/λ⁵², where φ(λ) is the target phase modulation of light with a wavelength of λ, n is the refractive index (1.46) of quartz and h is the etching depth. The fabrication process of the diffractive layer is shown in Supplementary Fig. 15. The etching process is based on the conventional semiconductor manufacturing process. The photolithography equipment is a SUSS MA6 UV lithography machine. Dry etching was carried out with a SENTECH inductively coupled plasma etching system with SF₆ gas flow. When fabricating the diffractive layer on the back side of the quartz wafer, double-sided photolithography technology was used to align the two-sided pattern.

Characterization and annealing of the DNN

The SEM images were acquired with ZEISS Gemini SEM 300 equipment. The height profile of the DNN sample surface was obtained with a Bruker step meter. The AFM images were acquired with a Bruker Dimension Icon microscope. The sample was annealed in an HF-Kejing KSL-1700X box furnace in an air atmosphere.

Alignment of the input layer and DNNs

In our experiment, we were able to observe the unmodulated optical digital image and the ten light spots generated by the diffraction of DNNs simultaneously on the CCD camera due to binary phase modulation. The digital image indicates the position of the input layer, while the ten light spots indicate the position of the DNNs. Therefore, alignment was achieved by observing their relative positions on the CMOS camera.

Data availability

The data that support the findings of this study are available on request from the corresponding authors.

Code availability

The code that supports the findings of this study is available on request from the corresponding authors.

References

Chen, H. et al. Pre-trained image processing transformer. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (IEEE, 2021).
Grigorescu, S., Trasnea, B., Cocias, T. & Macesanu, G. A survey of deep learning techniques for autonomous driving. J. Field Robot. 37, 362–386 (2020).
Article Google Scholar
Jia, Y. et al. A knowledge-inherited learning for intelligent metasurface design and assembly. Light Sci. Appl. 12, 82 (2023).
Article Google Scholar
Shou, Y., Feng, Y., Zhang, Y., Chen, H. & Qian, H. Deep learning approach based optical edge detection using ENZ layers. Prog. Electromagn. Res. 175, 81–89 (2022).
Article Google Scholar
Lin, D. et al. Inverse-designed multi-level diffractive doublet for wide field-of-view imaging. ACS Photonics 10, 2661–2669 (2023).
Article Google Scholar
OpenAI. GPT-4 technical report. Preprint at https://arxiv.org/abs/2303.08774v3 (2023).
Torresen, J. A review of future and ethical perspectives of robotics and AI. Front. Robot. AI 4, 75 (2018).
Guo, X., Xiang, J., Zhang, Y. & Su, Y. Integrated neuromorphic photonics: synapses, neurons, and neural networks. Adv. Photonics Res. 2, 2000212 (2021).
Article Google Scholar
Tan, Q., Qian, C., Cai, T., Zheng, B. & Chen, H. Solving multivariable equations with tandem metamaterial kernels. Prog. Electromagn. Res. 175, 139–147 (2022).
Article Google Scholar
Huang, C. et al. Prospects and applications of photonic neural networks. Adv. Phys. X 7, 1981155 (2022).
Google Scholar
Zhou, H. et al. Photonic matrix multiplication lights up photonic accelerator and beyond. Light: Sci. Appl. 11, 30 (2022).
Article Google Scholar
Luo, M. et al. Ultra-compact optical convolutional accelerators based on polarization-independent metasurfaces. In CLEO 2023 (Optica Publ. Group, 2023).
Fu, W. et al. Ultracompact meta-imagers for arbitrary all-optical convolution. Light Sci. Appl. 11, 62 (2022).
Article Google Scholar
Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004 (2018).
Article MathSciNet Google Scholar
Chen, H. et al. Diffractive deep neural networks at visible wavelengths. Engineering 7, 1483–1491 (2021).
Article Google Scholar
Zhou, T. et al. Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit. Nat. Photonics 15, 367–373 (2021).
Article Google Scholar
Goi, E. et al. Nanoprinted high-neuron-density optical linear perceptrons performing near-infrared inference on a CMOS chip. Light Sci. Appl. 10, 40 (2021).
Article Google Scholar
Liu, C. et al. A programmable diffractive deep neural network based on a digital-coding metasurface array. Nat. Electron. 5, 113–122 (2022).
Article Google Scholar
Qian, C. et al. Dynamic recognition and mirage using neuro-metamaterials. Nat. Commun. 13, 2694 (2022).
Article Google Scholar
Li, Z. et al. Event-based diffractive neural network chip for dynamic action recognition. Opt. Laser Technol. 169, 110136 (2024).
Article Google Scholar
Qian, C. et al. Performing optical logic operations by a diffractive neural network. Light Sci. Appl. 9, 59 (2020).
Article Google Scholar
Goi, E., Schoenhardt, S. & Gu, M. Direct retrieval of Zernike-based pupil functions using integrated diffractive deep neural networks. Nat. Commun. 13, 7531 (2022).
Article Google Scholar
Lu, H. et al. Eye accommodation-inspired neuro-metasurface focusing. Nat. Commun. 14, 3301 (2023).
Article Google Scholar
Veli, M. et al. Terahertz pulse shaping using diffractive surfaces. Nat. Commun. 12, 37 (2021).
Article Google Scholar
Ashtiani, F., Geers, A. J. & Aflatouni, F. An on-chip photonic deep neural network for image classification. Nature 606, 501–506 (2022).
Article Google Scholar
Feldmann, J. et al. Parallel convolutional processing using an integrated photonic tensor core. Nature 589, 52–58 (2021).
Article Google Scholar
Feldmann, J., Youngblood, N., Wright, C. D., Bhaskaran, H. & Pernice, W. H. P. All-optical spiking neurosynaptic networks with self-learning capabilities. Nature 569, 208–214 (2019).
Article Google Scholar
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441–446 (2017).
Article Google Scholar
Gu, M., Dong, Y., Yu, H., Luan, H. & Zhang, Q. Perspective on 3D vertically-integrated photonic neural networks based on VCSEL arrays. Nanophotonics 12, 827–832 (2023).
Article Google Scholar
Luo, Y. et al. Design of task-specific optical systems using broadband diffractive neural networks. Light Sci. Appl. 8, 112 (2019).
Article Google Scholar
Rahman, M. S. S., Li, J., Mengu, D., Rivenson, Y. & Ozcan, A. Ensemble learning of diffractive optical networks. Light Sci. Appl. 10, 14 (2021).
Article Google Scholar
Jia, W., Lin, D. & Sensale-Rodriguez, B. Machine learning enables multi-degree-of-freedom reconfigurable terahertz holograms with cascaded diffractive optical elements. Adv. Opt. Mater. 11, 2202538 (2023).
Article Google Scholar
Luo, Y. et al. Computational imaging without a computer: seeing through random diffusers at the speed of light. eLight 2, 4 (2022).
Article Google Scholar
Fu, T. et al. On-chip photonic diffractive optical neural network based on a spatial domain electromagnetic propagation model. Opt. Express 29, 31924–31940 (2021).
Article Google Scholar
Yan, T. et al. All-optical graph representation learning using integrated diffractive photonic computing units. Sci. Adv. 8, eabn7630 (2022).
Article Google Scholar
Zhu, H. H. et al. Space-efficient optical computing with an integrated chip diffractive neural network. Nat. Commun. 13, 1044 (2022).
Article Google Scholar
Fu, T. et al. Photonic machine learning with on-chip diffractive optics. Nat. Commun. 14, 70 (2023).
Article Google Scholar
Goodman, J. W. Introduction to Fourier Optics (Roberts and Company Publ., 2005).
Lecun, Y., Bottou, L., Bengio, Y. & Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998).
Article Google Scholar
Luan, H. et al. 768-ary Laguerre-Gaussian-mode shift keying free-space optical communication based on convolutional neural networks. Opt. Express 29, 19807–19818 (2021).
Article Google Scholar
Xiao, H., Rasul, K. & Vollgraf, R. J. Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. Preprint at https://arxiv.org/abs/1708.07747 (2017).
Zhang, J., Gecevičius, M., Beresna, M. & Kazansky, P. G. Seemingly unlimited lifetime data storage in nanostructured glass. Phys. Rev. Lett. 112, 033901 (2014).
Article Google Scholar
Zhongping, Z., Tao, J., Guanghui, L., Yufeng, G. & Yongbin, Y. Thermodynamics of reactions among Al₂O₃, CaO, SiO₂ and Fe₂O₃ during roasting processes. In Thermodynamics (ed. Juan Carlos M.-P.) (IntechOpen, 2011).
Chen, Y. et al. All-analog photoelectronic chip for high-speed vision tasks. Nature 623, 48–57 (2023).
Article Google Scholar
Huang, H., Yang, L.-M. & Liu, J. Ultrashort pulsed fiber laser welding and sealing of transparent materials. Appl. Opt. 51, 2979–2986 (2012).
Article Google Scholar
Zimmermann, F., Richter, S., Döring, S., Tünnermann, A. & Nolte, S. Ultrastable bonding of glass with femtosecond laser bursts. Appl. Opt. 52, 1149–1154 (2013).
Article Google Scholar
Zhang, X.-L. et al. Transient thermal effect, nonlinear refraction and nonlinear absorption properties of graphene oxide sheets in dispersion. Opt. Express 21, 7511–7520 (2013).
Article Google Scholar
Wei, T.-C. et al. Nonlinear absorption applications of CH₃NH₃PbBr₃ perovskite crystals. Adv. Funct. Mater. 28, 1707175 (2018).
Article Google Scholar
Luo, X. et al. Metasurface-enabled on-chip multiplexed diffractive neural networks in the visible. Light Sci. Appl. 11, 158 (2022).
Article Google Scholar
Dong, Y. et al. Nanoprinted diffractive-layer-integrated vertical-cavity surface-emitting vortex lasers with scalable topological charge. Nano Lett. 23, 9096–9104 (2023).
Article Google Scholar
Okumura, K., Higurashi, E., Suga, T. & Hagiwara, K. Low-temperature GaAs/SiC wafer bonding with Au thin film for high-power semiconductor lasers. In 2014 International Conference on Electronics Packaging (ICEP) (IEEE, 2014).
Lim, K. T. P., Liu, H. L., Liu, Y. J. & Yang, J. K. W. Holographic colour prints for enhanced optical security by combined phase and amplitude control. Nat. Commun. 10, 25 (2019).
Article Google Scholar

Download references

Acknowledgements

We would like to acknowledge support from the National Key Research and Development program of China (2022YFB2804301), the Science and Technology Commission of Shanghai Municipality (Grant No. 21DZ1100500), the Shanghai Municipal Science and Technology Major Project, the Shanghai Frontiers Science Center Program (2021–2025 No. 20), the Shanghai Rising-Star Program (20QA1404100), the Shanghai Sailing Program (23YF1429500) and the National Natural Science Foundation of China (Nos. 11974247, 62005164 and 62005166). This work was also sponsored by the Shuguang Program (23SG41) and Chenguang Program (No. 20CG54) supported by the Shanghai Education Development Foundation and Shanghai Municipal Education Commission.

Author information

These authors contributed equally: Yibo Dong, Dajun Lin.

Authors and Affiliations

Institute of Photonic Chips, University of Shanghai for Science and Technology, Shanghai, 200093, China
Yibo Dong, Dajun Lin, Long Chen, Baoli Li, Xi Chen, Qiming Zhang, Haitao Luan, Xinyuan Fang & Min Gu
Centre for Artificial-Intelligence Nanophotonics, School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai, 200093, China
Dajun Lin & Long Chen

Authors

Yibo Dong
View author publications
You can also search for this author in PubMed Google Scholar
Dajun Lin
View author publications
You can also search for this author in PubMed Google Scholar
Long Chen
View author publications
You can also search for this author in PubMed Google Scholar
Baoli Li
View author publications
You can also search for this author in PubMed Google Scholar
Xi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qiming Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Haitao Luan
View author publications
You can also search for this author in PubMed Google Scholar
Xinyuan Fang
View author publications
You can also search for this author in PubMed Google Scholar
Min Gu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.B.D and D.J.L contributed equally to this work. Y.B.D. conceived the idea. H.T.L., X.Y.F., and M.G. supervised the project. D.J.L. designed the DNN and organized the experimental results. Y.B.D. fabricated the DNN chip and carried out material characterizations. D.J.L., L.C. Y.B.D, and B.L.L. tested the DNN chip. X.C. and Q.M.Z. helped analyze the results. Y.B.D. and D.J.L. wrote the first draft of the paper. H.T.L., X.Y.F., and M.G. revised the manuscript.

Corresponding authors

Correspondence to Haitao Luan, Xinyuan Fang or Min Gu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Engineering thanks Fei Xia, Peter Kazansky, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Chaoran Huang and Anastasiia Vasylchenkova. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dong, Y., Lin, D., Chen, L. et al. Compact eternal diffractive neural network chip for extreme environments. Commun Eng 3, 64 (2024). https://doi.org/10.1038/s44172-024-00211-6

Download citation

Received: 24 October 2023
Accepted: 19 April 2024
Published: 01 May 2024
DOI: https://doi.org/10.1038/s44172-024-00211-6