Reusability report: Predicting spatiotemporal nonlinear dynamics in multimode fibre optics with a recurrent neural network

Teğin, Uğur; Dinç, Niyazi Ulaş; Moser, Christophe; Psaltis, Demetri

doi:10.1038/s42256-021-00347-6

Download PDF

Matters Arising
Published: 13 May 2021

Reusability report: Predicting spatiotemporal nonlinear dynamics in multimode fibre optics with a recurrent neural network

Nature Machine Intelligence volume 3, pages 387–391 (2021)Cite this article

4925 Accesses
22 Citations
6 Altmetric
Metrics details

Subjects

The Original Article was published on 18 February 2021

arising from L. Salmela et al. Nature Machine Intelligence https://www.nature.com/articles/s42256-021-00297-z (2021)

With their internal memory, recurrent neural networks can be used to learn and predict time-dependent behaviours. In their recent work, Salmela et al.¹ present a recurrent neural network architecture to learn and predict complex nonlinear propagation in an optical fibre based on the input pulse intensity profile in the time domain. Here, we use their model by extending it to the case of spatiotemporal nonlinear propagation for an arbitrary number of modes in graded-index multimode fibres. In addition to the original work’s focus on predicting the temporal evolution of pulses, we show that the method is applicable for modelling and predicting spatial beam propagation incorporating nonlinear mode coupling.

The demonstrated method of Salmela et al.¹ can be an alternative solution to time-consuming and computationally heavy nonlinear pulse propagation simulations. In essence, the method can accurately reproduce the complex nonlinear evolution governed by the nonlinear Schrödinger equation (NLSE) via employing long short-term memory (LSTM) nodes in an artificial neural network. Such a network architecture is capable of modelling sequential dependencies. Salmela et al.¹ tested their model for pulse compression and ultra-broadband supercontinuum generation. They were able to accurately predict temporal and spectral evolutions of ultrashort pulses in a highly nonlinear fibre. Using the same neural network architecture, we trained the network to predict the spatiotemporal evolution of ultrashort pulses. In this study, we hypothesized that their recurrent neural network might be suitable to predict the spatiotemporal field evolution, given that it successfully predicted the temporal physical dynamics that is governed by the same NLSE equation that describes the spatial domain as well. Given that the NLSE is also applicable to other physical systems, it may be possible to use a generic, normalized form of the NLSE, for example, in Bose−Einstein condensation, hydrodynamics and plasma physics².

Spatiotemporal nonlinearities and simulations

The study in ref. ¹ focuses on a single-mode fibre with spectral or temporal nonlinear evolution of pulses in propagation axis by computing 2-dimensional (2D, with 1 spatial coordinate +1 time coordinate) simulations. Further, the authors apply their method to a step-index multimode fibre by computing the propagation of five modes of the investigated fibre by following a similar (1 + 1)D simulation and incorporating mode coupling by a matrix product, calculated as the overlap integrals of the modes of interest. In this study, we change the medium from step-index to graded-index multimode fibre and compute 4D (3 spatial coordinates +1 time coordinate)D simulations where the interaction of all the available modes of the fibre fuses naturally since all the contributing spatiotemporal degrees of freedom in the NLSE are included.

With relatively low modal dispersion and periodic self-imaging, graded-index multimode fibres are of important interest for nonlinear optics, imaging and telecommunications studies. In recent years, various interesting nonlinear dynamics such as spatiotemporal instability^3,4, dispersive wave generation⁵, graded-index solitons^6,7, self-beam cleaning⁸, nonlinear pulse compression⁹, and supercontinuum generation^10,11 have been reported. In addition to the aforementioned single-pass dynamics, spatiotemporal mode-locked lasers^12,13,14 have been realized, thanks to the low-modal-dispersion pulse propagation in graded-index multimode fibres. Using a spatial light modulator, learning and controlling nonlinear optical dynamics in graded-index multimode fibres was demonstrated by modifying the spatial properties of the intense pump pulse^15,16. Recently, spatiotemporal nonlinear interactions in a graded-index multimode fibre were introduced as an optical computing engine, which performed well on a range of machine learning tasks, from classifying COVID-19 X-ray lung images and speech recognition to predicting age from images of faces¹⁷.

Numerical analysis is required to understand the underlying complex spatiotemporal dynamics of pulse propagation in a multimode fibre. The most important challenge in multimode fibre simulations is the addition of spatial degrees of freedom. In a single-mode fibre simulation, there is only the time domain grid to establish and then propagation can be implemented, for instance, by using split-step Fourier simulations, which has a low computational cost because 1D Fourier transforms are computed in every step. In multimode fibres, there exist multiple propagating modes having different spatial distributions. Hence, transverse dimensions must be included to describe a pulse that requires a sampling grid in two dimensions X and Y in addition to time. Hence, 3D Fourier transforms (two spatial dimensions in the transverse plane and one dimension in time) must be computed at every step taken along the propagation direction Z to provide a (3 + 1)D simulation. This is computationally costly and time-consuming. To overcome the computational load of (3 + 1)D beam propagation simulations, mode-resolved simulation methods based on pre-calculated nonlinear mode coupling have been proposed^18,19. However, mode-resolved simulations are time-efficient for fewer than 10 modes and a low number of modes may not give an accurate picture of the spatiotemporal nonlinear propagation in a fibre of more than 200 modes. In this regard, the work by Salmela et al.¹ enables a faster computation scheme when the neural network is trained²⁰.

In our study, we first tested the neural network presented in ref. ¹ by generating a dataset using a numerically computed fibre output using the (3 + 1)D split-step Fourier method that considers the interaction of all available fibre modes. We call this the time-dependent beam-propagation method (TD-BPM). We implemented a graphics processing unit (GPU) parallelized TD-BPM in Python to generate the dataset. To remain loyal to the original approach, we integrated the intensity of the TD-BPM outputs in the spatial domain to obtain only the time-domain evolution. The performance of the network in the time domain (but with spatial integrated modes) is illustrated in the ‘Temporal results’ section. The ‘Spatial results’ section shows how well the network predicts the intensity profile along the propagation from the time-integrated data. Owing to the network architecture, the data is fed after a dimension reduction by time-averaging or space-averaging. Nevertheless, the spatiotemporal effects are still inherited in the reduced data where each RNN model is able to capture it.

Results

Temporal results

The datasets generated by the aforementioned TD-BPM contain 1,000 examples of spatiotemporal nonlinear propagation of femtosecond pulses (see Supplementary Discussion 1 for details). Following the original work and using the sample code, with a small modification to increase the number of nodes in each layer from 250 to 500, we trained and tested spectral and temporal nonlinear propagation in a graded-index multimode fibre. Each dataset is split into 950 propagation samples for training and 50 propagation samples for testing. During the training, at each epoch, training data is split randomly with 9 to 1 ratio to generate the validation set, which is repeated for every training process in this study. The TD-BPM-generated data is first converted to logarithmic scale and normalized. The evolutions of the mean absolute error metric for training the networks are presented in Supplementary Discussion 4.

Similar to the work by Salmela et al.¹, we tested the recurrent neural network for stepwise and complete propagation predictions in the frequency and time domain but in a multimode fibre. The best performance of the neural network is observed for stepwise predictions. The stepwise performances of the network for spectral and temporal data are presented in Supplementary Fig. 1 and Supplementary Fig. 2. For the complete propagation predictions, using only the injected pulse profile leads to accumulated errors; however, as shown in Figs. 1 and 2, the difference between the TD-BPM (ground truth) and the predictions are small and in an acceptable range.

**Fig. 1: An example of the spectral intensity evolution of a high-power femtosecond pulse in a graded-index multimode fibre.**

**Fig. 2: An example of temporal intensity evolution of a high-power femtosecond pulse in a graded-index multimode fibre.**

Spatial results

The dataset is generated by integrating the outputs of TD-BPM in the time domain to generate spatial-domain-only intensity distributions. A graded-index fibre with a 50-µm core diameter, supporting 240 modes at 1,030 nm wavelength, is digitally created. 1,000 different propagation cases are generated by having different spatial excitations at the fibre input. LP₀₁, LP₀₂, LP₀₃, LP₁₁, LP₁₂, and LP₂₁ modes are superposed with random coefficients while keeping the peak power fixed at 1 GW to encourage nonlinear inter-modal coupling within a short fibre length that is chosen to be ten times the self-imaging period of the graded-index multimode fibre (see Supplementary Discussion 1 for further details on data generation). We note that the field launched is limited to 6 modes. However, the modes can couple into the higher-order modes of the GRIN fibre (240 available modes) upon propagation due to mode coupling. The dataset is divided into training data (950 samples) and testing data (50 samples). The data is down-sampled to 32 by 32 pixels on the transverse x and y axes and 120 steps on the z axis. The 2D spatial information is converted to a 1D array of 1,024 elements to employ the original network architecture that accepts 1D intensity profiles. The LSTM and dense layer node number is set to 1,000 and window size, which is the number of previous steps introduced in LSTM, is set to 15 (instead of 10 used by Salmela et al.¹). In Fig. 3, the prediction of the trained network on a test data is shown, with the X−Z propagation profile and X−Y transverse profiles of the first, middle and last steps, along with the corresponding TD-BPM results that serve as the ground truth.

**Fig. 3: An example spatial intensity evolution of a 1-GW femtosecond pulse in a graded-index multimode fibre.**

Discussion

During our study, we compared the simulation runtimes between the TD-BPM and that of the recurrent neural network architecture for training and inference. The required training time for the recurrent neural network is comparable to the data-generation time of the TD-BPM, which is around 50 min for 1,000 samples (the technical details are provided in Supplementary Discussion 1). On the other hand, as anticipated, the inference time of the recurrent neural network is more than 40 times faster than TD-BPM for single-pass pulse propagation with graphics-card-based parallel processing on an Nvidia Tesla V100 GPU.

Temporal results show that the network successfully infers the time evolution of a pulse. Since a different simulation method (TD-BPM instead of mode-resolved, as used in the original work) and medium (graded-index multimode fibre instead of single mode and step-index multimode fibre, as used in the original work) are chosen in this study, we can state that the proposed architecture is capable of grasping the NLSE-governed dynamics without relying on a particular method to generate training data. Owing to the selected pulse parameters (duration and central wavelength), the dataset contains supercontinuum generation from self-phase modulation and spatiotemporal instability^3,4. As shown in the ‘Results’ section, the neural network can predict the separate and combined spatiotemporal instability peaks, which occur at around 632 nm and 768 nm, respectively, remarkably well.

The proposed architecture is able to predict spatial propagation decently as demonstrated in Fig. 3. However, a important amount of error is also present, which is higher than the error obtained in temporal-only predictions. In Supplementary Discussion 3, we investigated a simpler spatial scenario where the input distribution is fixed as a doughnut shape and the pulse power is varied on the order of MW to have relatively mild nonlinear interactions. This scenario yielded less mean absolute error compared to the results provided in Fig. 3, where the spatial distribution of the input field is varied and pulse power is set to GW to enable more nonlinear interactions. This comparison hints that a degradation in the performance of prediction occurs as the variations within the dataset and the strength of nonlinear interaction increase. The main cause of this performance issue may arise from the intensity-only nature of the implemented neural network architecture. Physically, the field evolution is a product of the intensity and phase changes in time and space. However, in this architecture, the network is forced to learn the nonlinear propagation of a complex field without having the phase information. Even so, the network mimics the overall propagation trend, which is quite an achievement given the fact that half of the information required is not provided. This is because in the dataset the complex field evolution is generated by including the effects of all the dimensional degrees of freedom. The dimensional reduction by time-averaging and having the squared norm to convert the complex field into intensity does not completely erase the trace of the complex higher-dimensional field evolution because these traces manifest themselves in the intensity evolution. Verification of this point can be found in Supplementary Discussion 5, where the recurrent neural network is trained by phase-only varying input fields.

Another important factor is the resilience to under-sampling. Considering the low discretization in the inference, it is straightforward to say that the recurrent neural network is more flexible in terms of sampling constraints. However, this advantage is a result of the training phase where the data is generated with appropriately sampled simulation frames. The accurate data is then down-sampled and provided to the network. In the case of under-sampled training data that contains sudden pixel-to-pixel jumps, the trained network fails to model NLSE and yields unrelated predictions for the propagation.

Future directions

There are two main directions in which to expand the scope of the proposed recurrent neural network architecture: introducing spatiotemporal characteristics together in the network instead of decoupling the space and time information of the pulse as well as a network capable of handling complex fields. Neural network architectures that deal with complex fields have already been presented, such as a neural network that decomposes the output field into LP modes^21,22. With a similar scheme, the network could accept transverse complex fields and the time domain information in a (2 + 1)D fashion to perform the nonlinear evolution step by step in the propagation direction. With such augmented dimensionality, 2D and 3D convolutional layers could replace the fully connected layers before and after the LSTM, because light propagation is governed by convolution with a diffraction kernel.

Data availability

The data used in this paper is available at the following GitHub repository https://github.com/ugurtegin/MMF_RNN_Reuse.

Code availability

The code used in this paper is available at the following GitHub repository https://github.com/ugurtegin/MMF_RNN_Reuse.

References

Salmela, L. et al. Predicting ultrafast nonlinear dynamics in fibre optics with a recurrent neural network. Nat. Mach. Intell. 3, 1–11 (2021).
Article Google Scholar
Rogel-Salazar, J. The Gross–Pitaevskii equation and Bose–Einstein condensates. Eur. J. Phys. 34, 247 (2013).
Article Google Scholar
Krupa, K. et al. Observation of geometric parametric instability induced by the periodic spatial self-imaging of multimode waves. Phys. Rev. Lett. 116, 183901 (2016).
Article Google Scholar
Teğin, U. & Ortaç, B. Spatiotemporal instability of femtosecond pulses in graded-index multimode fibers. IEEE Photon. Technol. Lett. 29, 2195–2198 (2017).
Article Google Scholar
Wright, L. G., Christodoulides, D. N. & Wise, F. W. Controllable spatiotemporal nonlinear effects in multimode fibres. Nat. Photon. 9, 306–310 (2015).
Article Google Scholar
Renninger, W. H. et al. Optical solitons in graded-index multimode fibres. Nat. Commun. 4, 1719 (2013).
Article Google Scholar
Ahsan, A. S. & Agrawal, G. P. Graded-index solitons in multimode fibers. Opt. Lett. 43, 3345–3348 (2018).
Article Google Scholar
Krupa, K. et al. Spatial beam self-cleaning in multimode fibres. Nat. Photon. 11, 237–241 (2017).
Article Google Scholar
Krupa, K. et al. Spatiotemporal light-beam compression from nonlinear mode coupling. Phys. Rev. A 97, 043836 (2018).
Article Google Scholar
Lopez-Galmiche, G. et al. Visible supercontinuum generation in a graded index multimode fiber pumped at 1064 nm. Opt. Lett. 41, 2553–2556 (2016).
Article Google Scholar
Teğin, U. & Ortaç, B. Cascaded Raman scattering based high power octave-spanning supercontinuum generation in graded-index multimode fibers. Sci. Rep. 8, 1–7 (2018).
Article Google Scholar
Wright, L. G., Christodoulides, D. N. & Wise, F. W. Spatiotemporal mode-locking in multimode fiber lasers. Science 358, 94–97 (2017).
Article Google Scholar
Teğin, U., Kakkava, E., Rahmani, B., Psaltis, D. & Moser, C. Spatiotemporal self-similar fiber laser. Optica 6, 1412–1415 (2019).
Article Google Scholar
Teğin, U., Rahmani, B., Kakkava, E., Psaltis, D. & Moser, C. Single-mode output by controlling the spatiotemporal nonlinearities in mode-locked femtosecond multimode fiber lasers. Adv. Photon. 2, 1–8 (2020).
Article Google Scholar
Tzang, O., Caravaca-Aguirre, A. M., Wagner, K. & Piestun, R. Adaptive wavefront shaping for controlling nonlinear multimode interactions in optical fibres. Nat. Photon. 12, 368–374 (2018).
Article Google Scholar
Teǧin, U. et al. Controlling spatiotemporal nonlinearities in multimode fibers with deep neural networks. APL Photon. 5, 030804 (2020).
Article Google Scholar
Teğin, U., Yıldırım, M., Oğuz, İ., Moser, C. & Psaltis, D. Scalable optical learning operator. Preprint at https://arxiv.org/abs/2012.12404 (2020).
Poletti, F. & Horak, P. Dynamics of femtosecond supercontinuum generation in multimode fibers. Opt. Express 17, 6134–6147 (2009).
Article Google Scholar
Mafi, A. Pulse propagation in a short nonlinear graded-index multimode optical fiber. J. Lightwave Technol. 30, 2803–2811 (2012).
Article Google Scholar
Genty, G. et al. Machine learning and applications in ultrafast photonics. Nat. Photon. 15, 1–11 (2020).
Google Scholar
An, Y. et al. Learning to decompose the modes in few-mode fibers with deep convolutional neural network. Opt. Express 27, 10127–10137 (2019).
Article Google Scholar
Rothe, S., Zhang, Q., Koukourakis, N. & Czarske, J. W. Deep learning for computational mode decomposition in optical fibers. Appl. Sci. 10, 1367 (2020).
Article Google Scholar

Download references

Acknowledgements

We thank M. Yıldırım for discussions.

Author information

Authors and Affiliations

Optics Laboratory, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
Uğur Teğin, Niyazi Ulaş Dinç & Demetri Psaltis
Laboratory of Applied Photonics Devices, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
Uğur Teğin, Niyazi Ulaş Dinç & Christophe Moser

Authors

Uğur Teğin
View author publications
You can also search for this author in PubMed Google Scholar
Niyazi Ulaş Dinç
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Moser
View author publications
You can also search for this author in PubMed Google Scholar
Demetri Psaltis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

U.T. and N.U.D. performed simulations; C.M and D.P. supervised and directed the project. All the authors participated in the analysis of the data and the writing process of the manuscript.

Corresponding authors

Correspondence to Uğur Teğin or Niyazi Ulaş Dinç.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Machine Intelligence thanks Yichen Wu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figures 1–6 and Supplementary Discussion

Rights and permissions

Reprints and permissions

About this article

Cite this article

Teğin, U., Dinç, N.U., Moser, C. et al. Reusability report: Predicting spatiotemporal nonlinear dynamics in multimode fibre optics with a recurrent neural network. Nat Mach Intell 3, 387–391 (2021). https://doi.org/10.1038/s42256-021-00347-6

Download citation

Received: 27 February 2021
Accepted: 15 April 2021
Published: 13 May 2021
Issue Date: May 2021
DOI: https://doi.org/10.1038/s42256-021-00347-6

This article is cited by

Real-time observation of optical rogue waves in spatiotemporally mode-locked fiber lasers
- Uğur Teğin
- Peng Wang
- Lihong V. Wang
Communications Physics (2023)
Artificial Intelligence-Enabled Mode-Locked Fiber Laser: A Review
- Qiuying Ma
- Haoyang Yu
Nanomanufacturing and Metrology (2023)
Fiber laser development enabled by machine learning: review and prospect
- Min Jiang
- Hanshuo Wu
- Pu Zhou
PhotoniX (2022)