Memristive tonotopic mapping with volatile resistive switching memory devices

Milozzi, Alessandro; Ricci, Saverio; Ielmini, Daniele

doi:10.1038/s41467-024-47228-1

Download PDF

Article
Open access
Published: 01 April 2024

Memristive tonotopic mapping with volatile resistive switching memory devices

Nature Communications volume 15, Article number: 2812 (2024) Cite this article

3026 Accesses
3 Altmetric
Metrics details

Subjects

Abstract

To reach the energy efficiency and the computing capability of biological neural networks, novel hardware systems and paradigms are required where the information needs to be processed in both spatial and temporal domains. Resistive switching memory (RRAM) devices appear as key enablers for the implementation of large-scale neuromorphic computing systems with high energy efficiency and extended scalability. Demonstrating a full set of spatiotemporal primitives with RRAM-based circuits remains an open challenge. By taking inspiration from the neurobiological processes in the human auditory systems, we develop neuromorphic circuits for memristive tonotopic mapping via volatile RRAM devices. Based on a generalized stochastic device-level approach, we demonstrate the main features of signal processing of cochlea, namely logarithmic integration and tonotopic mapping of signals. We also show that our tonotopic classification is suitable for speech recognition. These results support memristive devices for physical processing of temporal signals, thus paving the way for energy efficient, high density neuromorphic systems.

Strain-invariant stretchable radio-frequency electronics

Article 22 May 2024

A programmable topological photonic chip

Article Open access 22 May 2024

Volatile working memory representations crystallize with practice

Article Open access 15 May 2024

Introduction

Perception of information from the surrounding environment is a crucial task for animals to detect external stimuli and react to them. Light, sound, gravity, touch, and chemicals are converted into encoded spiking signals by dedicated apparatus and then interpreted by the brain¹. Because of its high-energy efficiency and intrinsic error tolerance, the human brain provides inspiring novel paradigms to achieve better computational performance^2,3,4,5,6. In this framework, the auditory system has gained strong attention due to its remarkable features: for instance, sound can reach our ears from all possible directions in space and can be perceived anytime when we are awake as well as we are sleeping. Moreover, sound processing is not performed by the spatial arrangement of sensory-afferent neurons such as in retinotopic or somatosensory maps, rather it is internally processed by the auditory system thanks to an internal representation of physical features^1,7,8. As a result, the auditory system does not rely on the spatial position of the source such as in vision, where different light rays are focused on different sensors in the retina. Instead, the sound is processed via mechanical vibrations that are purely temporal signals. The frequency of natural sounds perceived by mammals usually spans from tens of Hz to tens of kHz, covering about three orders of magnitude. To classify these signals, there is a need for a spatial representation of this broad range of temporal features. The cochlea solves this task by realizing a tonotopic map of the incoming signals i.e., a mapping of different frequency components along logarithmically-spaced positions of the cochlear channel^1,9,10. Emulating this kind of spatiotemporal signal processing through simple and scalable hardware remains an open challenge for neuromorphic computing¹¹.

Resistive switching memory (RRAM) devices have attracted strong interest for their ability to implement artificial neurons and synapses in high-density, energy-efficient artificial neural networks^{12,13,14,15,16,17}. However, the main computing approach adopted in state-of-the-art systems relies on the spatial arrangement of neural elements, i.e., spatial coding. In these systems, the capability to capture the temporal component is introduced through complex auxiliary CMOS circuitry and sophisticated temporal encoding of the programming pulses, thus losing advantages in terms of area occupation, energy efficiency, and biological plausibility. This is because RRAM devices are used as static first-order memristors that are unable to directly cope with spatiotemporal signals, just playing the role of static memory for mapping weights of neural networks¹⁸. To address this limitation and enable device-level computation over time and frequency, it becomes imperative to explore innovative materials and methodologies within the increasing set of memristive devices. Such advancements hold substantial promise for enhancing spatiotemporal pattern recognition by exploiting the intrinsic dynamics of the device to capture the crucial temporal component that is otherwise missing. Moreover, temporal features generally cover broad scales, while actual demonstrations of neurons and synapses integration mostly operate on a limited linear scale due to physical limitations in the mechanisms of conductance change^19,20,21. This is in contrast with the brain being capable of perception and classification of sound over a broad frequency range and in presence of noisy signals²². By leveraging the dynamic, stochastic response of volatile memristors, we demonstrate a device-level spatial mapping of temporal spike signals on a logarithmic scale, where, similar to biological systems, the device volatility contributes to the system ability to relax to a resting state, being spontaneously ready for a new computation. These characteristics serve as the fundamental ground for replicating the intricate audio processing functions executed by the human brain.

Results

Stochastic switching of volatile RRAM devices

To enable time and rate computation, we adopted volatile RRAM with one-transistor/one resistor (1T1R) structure. Figure 1a shows a schematic illustration of the device, where the select transistor allows to control the maximum current flow (see Supplementary Note 2 for MOSFET characteristics). The device relies on a switching layer made of hafnium oxide (HfO_x) interposed between two metallic electrodes. Figure 1b shows the device structure, including a silver (Ag) active top electrode (TE) and a bottom electrode (BE) made of carbon (C). The RRAM device is initially in a high resistive state (HRS) due to the low electrical conductivity of the HfO_x layer. The application of a relatively large positive voltage between TE and BE results in the formation of a conductive filament (CF) made of migrated Ag atoms thus resulting in a set transition to the low resistive state (LRS). When the voltage across the device is removed, the CF spontaneously dissolves after a suitable retention time, bringing the device back to the HRS^23,24,25. Thanks to the spontaneous dissolution of the CF, differently from non-volatile RRAM devices, our device does not need a reset phase thus it can operate with unipolar voltages. Figure 1c shows the measured quasi-static current-voltage (I-V) curve of the RRAM devices, indicating the presence of a characteristic threshold voltage V_set to initiate Ag migration and to build the CF and a hold voltage V_hold to start the dissolution. The value of V_set might stochastically change from cycle to cycle due to the continuous rearrangement of materials structure at the interfaces and in the switching layer²⁶. The cycle-to-cycle V_set variation generally obeys a normal distribution (see Supplementary Notes 3–4 where device-to-device variability is also reported). The V_set distribution describes the probability for set transition by applying a pulse with a specific voltage amplitude and duration. High values of voltage amplitude compared to the median value of the distribution of threshold voltage result in a high probability of switching while low values of voltage amplitude result in a low probability of switching.

**Fig. 1: Ag-based resistive random access memory.**

To characterize the switching probability of our RRAM devices in pulsed regime, we applied a train of voltage spikes reported in Fig. 2a with fixed pulse-width T_pulse = 2.5 μs and a total duration of T_window = 25 ms. Figure 2b shows the response current and the evolution of the device conductance G. After an initial phase where the current is zero, corresponding to the device being in the OFF state, the device switches to the ON state after a number of spikes equal to N_set = 38 which is marked by the onset of a current response and a transition to G = 8 μS. In this experiment, the waiting time between each spike was lower than the retention time, thus ensuring that the device remains in the ON state (see Supplementary Note 5). Figure 2c shows the current response for various spike voltage amplitudes, indicating that N_set (hence the switching time t_ON) decreases for increasing spike amplitude (details about waveforms are reported in Supplementary Note 6). Figure 2d shows the average t_ON as a function of voltage amplitude and spike frequency with fixed T_pulse = 2.5 μs, indicating that t_ON decreases at increasing voltage amplitudes and frequency. We can define the switching probability due to the application of a train of spikes as:

$${P}_{{{{{{\rm{switch}}}}}}}=\frac{{N}_{{{{{{\rm{trains|ON}}}}}}}}{{N}_{{{{{{\rm{trains}}}}}}}}$$

(1)

where, ${N}_{{{{{{\rm{trains|ON}}}}}}}$ represents the count of applied trains that cause the device to switch to the ON state and ${N}_{{{{{{\rm{trains}}}}}}}$ is the total number of applied trains (${N}_{{{{{{\rm{trains|ON}}}}}}}\le {N}_{{{{{{\rm{trains}}}}}}}$). It is important to notice that this definition of switching probability does not refer to the single pulse, but rather to a train of pulses with amplitude V, frequency f, and duration T_window, representing a more comprehensive and generalized framework for addressing switching probability. Figure 2e shows that P_switch increases by increasing the spiking frequency, while Fig. 2f shows that P_switch increases by increasing the applied voltage. Figure 2g summarizes the dependence of P_switch on f and V, respectively. Note that the frequencies on the x-axis of Fig. 2e and on the y-axis of Fig. 2g are logarithmically spaced, spanning 3 orders of magnitude, in analogy with the pitch tonotopic classification in the human cochlea.

**Fig. 2: Switching time and switching probability for different operative conditions.**

RRAM circuit for frequency sensing

Based on the spiking frequency properties of the device, Fig. 3a shows the RRAM circuit to provide the tonotopic sensing of the auditory signal frequency. In this circuit, RRAM devices have separate TE and a common BE to collect the summation current from all devices based on Kirchhoff’s law. The gate voltage is common for all devices, thus ensuring that the current is approximately the same for each device in the ON state due to transistor channel saturation. The spike trains applied to different TEs have the same frequency while the voltage amplitude V_TE decreases from one TE to the next one, e.g., TE voltages V₁ = 2 V, V₂ = 1.5 V, and V₃ = 1 V are applied to the three RRAM devices in Fig. 3a. Based on Fig. 2, the application of a signal at relatively low frequency causes the switching of only a small fraction of devices in the range of high V_TE, while an input signal with high-frequency causes the switching of a large fraction of devices, including those biased at relatively low voltages.

Figure 3b shows a trace example of the measured response current for the circuit of Fig. 3a, where spiking trains were applied with increasing frequencies from 20 Hz to 20 kHz. Based on the maximum measured current, it is possible to infer the number of devices in the ON state thanks to the compliance current I_c of the select transistor. Results in Fig. 3b indicate that in the reported experiment, for f = 20 Hz, none of the devices can switch within the experimental time window of 25 ms. As we increase the frequency of spikes, the number of devices in the ON state increases, reaching the maximum of 3 devices in the ON state for f = 20 kHz. In this case, it is possible to identify three distinct steps in the current trace, each corresponding to the switching on of a device. Figure 3c shows the experimental histograms of the number of devices switching to the ON state for a specific spiking frequency: Being normalized histograms, it is possible to interpret the y-axis as the probability P_N,ON of observing a specific number of ON-state devices (shown on the x-axis) for a particular frequency of input train. Figure 3d shows the average number of devices in the ON state as a function of the input train frequency. Note that the number of devices in the ON state increases linearly with the logarithm of the input frequency. Such a logarithmic dependence of the frequency sensitivity is the key point for processing audio signals in the auditory system from the environment¹. In the cochlea, in fact, different frequencies from 20 Hz to 20 kHz are mapped into linearly spaced distances from the apex of the cochlea as reported in Fig. 3d^27,28. This highlights the similarity of the RRAM-based frequency sensing circuit of Fig. 3a to the biological cochlea.

These experimental results were described by a probabilistic model reported in Supplementary Note 10 and validated in Supplementary Note 11. The model enables the simulation of large-scale networks and is utilized in the following sections to simulate the logarithmic integration and tonotopic mapping in bioinspired neuromorphic systems where the variability of device-to-device switching probability is included (see Supplementary Note 8). Nevertheless, the model applies to any RRAM device by adjusting the model parameters, e.g., the threshold voltages and their distributions, thus providing a general simulation tool for stochastic computing with resistive switching devices.

Cochlea-inspired tonotopic sensing of audio frequency

Figure 4a schematically illustrates the human auditory system where the acoustic wave reaches the tympanic membrane and the cochlea. Along the cochlear channel, different frequencies of the acoustic wave are detected at different positions by the hair cells, which are specialized biological strain detectors²⁹. The stimulation of hair cells causes the mechanical opening of ion channels, thus enabling the flow of a small ionic current converting mechanical stimulation into an electrical signal^30,31, which eventually propagates to the brain through auditory nerves. High frequencies (up to 20 kHz) are detected in the initial part of the cochlea, while low frequencies (around 20 Hz) are detected in the deepest region of the cochlea i.e., the center of the spiral. The intermediate values are logarithmically spaced through the length of the cochlear channel^28,32. The cochlea thus allows for a spatiotemporal processing capable of mapping temporal signals in different spatial coordinates of the cochlear channel. Such a tonotopic map of frequencies was experimentally demonstrated by von Békésy, worth the Nobel prize for medicine in 1961³³. Figure 4b shows the calculation results of a model derived after Zwislocki²⁸: the spectral amplitude response shows a peak for a particular frequency, thus enabling frequency detection on a logarithmic scale.

**Fig. 4: Tonotopic mapping of spatiotemporal signals to emulate cochlea processing.**

Emulating the processing of the auditory system, Fig. 4c shows the schematic of a RRAM circuit that enables tonotopic mapping of different frequencies which we refer to as memristive tonotopic map (MTM), where we completed a system of parallel volatile RRAMs with a XOR gate comparing the output voltages in each pair of RRAM devices. As in the previous experiment, the trains applied to different top electrodes have a common frequency, corresponding to the input signal frequency, while the applied voltage V_TE decreases from the highest V₁ to the lowest V_N. As a result, device i + 1 is sensitive to higher frequencies compared to the device i, with index i = 1, 2, …, N. This property becomes evident when examining the calculated switching probability P_switch shown in Fig. 4c: we show this behavior in six cells with increasing V_TE values, derived from our probabilistic model, which was calibrated using experimental characterization data (see Supplementary Note 11). Specifically, as V_TE increases, the probability of switching to a lower frequency also increases. The role of the XOR gate is to identify the boundary between the last ON device and the first OFF device (see Supplementary Note 12). Figure 4c also shows a matrix plot of the simulated average XOR output as a function of frequency, indicating that the response of each XOR gate is maximum for a specific frequency. This behavior is further highlighted in Fig. 4d, showing the normalized simulated response for each XOR output as function of train frequency. The first XOR gate has peak activity around f = 20 Hz, while XOR gates at higher orders respond at higher frequencies. The maximum frequency of the audio range (f = 20 kHz) is detected in the last XOR.

This approach can be further optimized by emulating the hair-cell redundancy in the biological system: as depicted in Fig. 4e (left), hair cells in the cochlea are on average 15.000, working in different locations but also in parallel at the same location thus introducing redundancy to mitigate the effects of stochasticity and/or malfunctioning hair cells. The same approach can be emulated by introducing a higher number of parallel RRAM devices in the tonotopic circuit to provide a better averaging as reported schematically in Fig. 4e (right). To support this latter approach, we simulated a larger network with n = 30 cells in the circuit of Fig. 4c and 500 parallel RRAM devices for each frequency. As an audio sample, we choose the “Finale” of Symphony n°9 in D minor by Beethoven due to its reduced tones (frequencies) spanning from C note (260 Hz) to G note (390 Hz). Figure 4f shows the simulated response of the circuit, clearly indicating the change of the active XOR following the behavior of the music sheet.

Interpretability of tonotopic map: speech recognition

In the biological auditory system, the electrical signals generated by the cochlea in response to the incoming audio signal need to be interpreted by the auditory cortex³⁴. Here, a classification process takes place to recognize the sound, discriminating the tweet of a bird from the sound of the flowing water of a river. The human brain, moreover, shows the capability of speech recognition i.e., the capability of attributing meaning to a particular combination of phonemes³⁵. To demonstrate such capability in our memristive auditory system, we simulated the task of speech recognition implemented with our MTM. We selected a set of 4 words that carry logical information (“yes”, “no”) or spatial information (“up”, “down”) spoken by a single person and repeated 20 times. Figure 5a shows the spectrograms of audio traces for these four words: one can notice different time durations, e.g., “up” is shorter than “down”, and different frequency spectra, e.g., the high frequencies in “yes” due to the letter “s”. Also, note that phonemes differ not only in terms of frequencies but also regarding the pressure that they exert on hair cells. Figure 5b adapted from ref. ³⁶, shows a map of phonemes that are located depending on their frequency and threshold pressure, highlighting the features used by biological systems for classification. Vowels span a broad spectrum of frequencies and are linked to a higher threshold pressure, such as the higher amplitude of the sound “ye” in “yes” and “o” in “no”. The system for speech recognition is shown in Fig. 5c, with the MTM for n = 3. We kept this value as low as possible to showcase the capabilities of our system with a minimal number of elements, thus gaining in power consumption and occupied area (see Supplementary Note 13 for results for different n values and different numbers of cortical network neurons). Spike trains are applied to each TE based on the raw normalized audio trace with the analog-to-spike (A2S) conversion reported in Supplementary Note 14. The conversion operates with different thresholds for each channel to capture different pressure levels i.e., amplitude of the signal, thus providing an additional feature for the recognition task. The three outputs of the XOR gates are submitted to a classical feedforward neural network that performs the same role as the auditory cortex of the biological brain. We selected N = 20 for the MTM parallelization to obtain an average response to the applied trains. Figure 5d shows the output values of MTM for each XOR for 100 trials for each word. At every cycle, the audio sample of the selected word is randomly chosen from the set of available samples. It is possible to recognize the different patterns, where, e.g., “yes” corresponds to a higher average activity while “up” corresponds to a lower average activity. This behavior can be attributed to the spectrograms of Fig. 5a, where “up” displays a shorter duration and medium-frequency composition whereas “yes” displays a longer duration and high-frequency components. We used the first 50 examples of MTM output traces to train the neural network and the last 50 examples for inference. Figure 5e shows the confusion matrix for the inference, reaching an accuracy of 96.5%.

**Fig. 5: Speech recognition from the tonotopic map.**

Discussion

By exploring the building blocks of sensory biological systems, it is possible to identify the features that are responsible for the exceptional computation capabilities of the brain. Implementing these paradigms in hardware, however, requires new technologies and methodologies. Through CMOS technology system-level approach, simulation of the functional behavior of biological mechanisms can be achieved, however at the cost of a large area occupation, a complex design, and a high power consumption (an indicative comparison is provided in Supplementary Note 15). Furthermore, these systems pose challenges in terms of interpretability, as the biological mapping and meaning are often lost³⁷. This difficulty makes it challenging to implement powerful computational neuroscience models in hardware, thus hindering the potential to attain the efficiency and capabilities of the biological brain³⁸.

Memristive devices mitigate these issues thanks to their high scalability and low power, as well as the capability to directly compute within the memory for reduced latency and energy consumption. Memristive devices display 2 terminal devices and tunable conductance, thus providing a realistic hardware description of synaptic plasticity in the brain, paving the way for the emulation of biological neural elements. Major research efforts have gone in this direction, where the memristive element is used as a static first-order memory element to store the synaptic weight of an artificial neural network³⁹. In this framework, memristors can act as accelerators of hardware neural networks within the context of in-memory computing^40,41. Other research directions aim at the exploration of biologically plausible paradigms using memristors as synaptic dynamic elements. The main features of biological mechanisms such as spike-timing dependent plasticity (STDP) and Hebbian learning have been demonstrated through these approaches^42,43,44. However, the full potential of memristor devices can be exploited by moving the computation to the device level, thus minimizing the need for external circuitry⁴⁵, building through a bottom-up approach, explainable and biologically plausible systems.

In our work, the computation in memristors relies on the probability of switching i.e., the probability of transmission of the information in a volatile memory, dealing with bursts of spikes rather than individual spikes. Biological systems, in fact, operate through an ensemble of probabilistic elements that together perform computation in a stochastic way⁴⁶. Moreover, biological networks make unreliable elements sufficiently reliable by working with spike bursts as the units of neural information⁴⁷. Thanks to this approach, we have shown that it is possible to perform a stochastic integration of the number of spikes, where the integrable range can be controlled by the voltage amplitude of spikes without the need for large capacitances. This result enables a new approach for spiking networks, capable of enlarging the space of computation to a logarithmic range of times and/or frequencies. Moreover, as the brain does not rely on a single synapse, we are not relying on a single device to properly mitigate the stochastic variation, using multiple parallel devices. Our methodology, which can map 3 orders of magnitudes of temporal features into a linear space of voltage, has a general validity beyond audio signals, with applications ranging from tactile to visual sensors^48,49. Also, there is a large margin for improving the system by increasing the number of devices in the single MTM and increasing the number of parallel MTMs to perform a more challenging task, such as recognizing more phonemes and even complete words. However, large-scale simulations go beyond the scope of this work, which is mainly focused on the proof-of-concept demonstration of the tonotopic classification of audio signals by memristive networks. Nevertheless, our devices promise good energy scaling even in large systems: when the device is in the OFF state, the resistance value is in the range of tens of TΩ, and no relevant current is flowing in the device. Figure 1c shows that the OFF-current is below the resolution of the instrument in the order of picoampere (pA). Referring to the same figure, it is possible to see that when the device is in the ON state the current I_C flowing in the 1T1R series is limited by the transistor. In this work, the value of I_C has been chosen high enough to be properly readable through a low-impedance channel of the oscilloscope (current resolution in the order of 1 µA). This value also provides a fast estimation of the energy consumption of a single spike through the ON device, given by:

$$E={V}_{{{{{{\rm{TE}}}}}}}{I}_{{{{{{\rm{C}}}}}}}{t}_{{{{{{\rm{pulse}}}}}}}=1V\times 16\,{{{{{\rm{\mu A}}}}}}\times 2.5\,{{{{{\rm{\mu s}}}}}}=40\,{{{{{\rm{pJ}}}}}}$$

(2)

However, note that the device can be switched on with much lower currents (see Supplementary Note 16) down to I_C = 10 nA, thus providing excellent scalability of energy consumption. The ultra-low energy operation also allows for efficient parallelization, accommodating high values of n and N parameters for MTM. Additionally, the presented system achieves logarithmic integration through a probability mechanism, thus eliminating the need to supply energy for charging a capacitance with each spike or for integration operations. Thus, the device reduces the number of spikes, being effective in consumption just after its switching i.e., when conductance is in the LRS.

The device-level approach also exploits the volatility of the memristive devices, which allows the spontaneous relaxation to a ground energy state of the system where all devices are OFF. As a result, after sufficient time dictated by the RRAM retention, the system becomes ready again for a new computation without any need for repeated initialization, thus leading to a substantial reduction of energy consumption and system complexity. This property is also fundamental to emulate the asynchronous computation of the biological nervous system since the arrival of the signal serves as the triggering input for computation while the absence of the signal lets the system return to the ground state by spontaneous relaxation^1,50. Furthermore, since the reset phase is not required for our RRAM, the device can be operated with unipolar voltages, thus providing a significant advantage in the complexity of programming circuits and area scaling of the selector device. Although the area scaling of the 1T1R structure is not the focus of this study, it is useful to highlight certain features of the presented RRAM device that may offer insight for future developments. Specifically, our device exhibits the capability to reduce the crucial parameters of programming voltage and programming current in scaling 1T1R structures, in compliance with the requirements for the integration of RRAM in the most recent technological nodes⁵¹. Additionally, it is important to emphasize that, due to its volatility, the device does not require bipolar voltages for resetting since it can operate in a unipolar manner. This characteristic could also potentially pave the way for a transition to a bipolar junction transistor (BJT) selector, similar to the reported phase-change memory (PCM) technology with related advantages⁵².

In summary, this work presents the neuromorphic circuits for spatio-temporal signal processing with volatile RRAM relying on a device-level approach. We demonstrate the implementation of the main neuromorphic primitives for audio signal processing in the cochlea, namely logarithmic integration and tonotopic mapping of temporal information. Our tonotopic transformation is suitable for speech recognition mimicking the biological counterpart, preserving biological plausibility and explainability. These results have a general validity beyond the audio signal processing thus supporting memristive device for hardware processing of temporal signals with logarithmically spaced features, enlarging the set of available neuromorphic primitives necessary to reach the energy efficiency, error tolerance, and high integration density promised by neuromorphic computing paradigm based on memristive devices.

Methods

Devices fabrication

The volatile resistive devices presented in this work are co-integrated on with Si-based transistors fabricated with standard CMOS technology. The bottom electrode is a graphitic carbon pillar with a diameter of 70 nm connected to the transistor drain. The 5 nm of hafnium oxide (HfO_x) switching layer and the 100 nm silver (Ag) top electrode are sequentially deposited by e-beam evaporation without breaking the vacuum, within a monitored pressure of 3 × 10⁻⁶ mbar to carefully tune the HfO_x stoichiometry and the Ag/HfO_x interface quality.

Devices characterization

All electrical characterizations and experiments are carried out in probe station by using rhodium-plated tungsten needles for contacting. Semiconductor parameter analyzer Agilent HP4156C is used for the quasi-static characterization of the devices. Dynamic properties, as well as the experiments, are studied using an AimTTi TGA12104 Arbitrary Waveform Generator and a Tektronik MSO58 Oscilloscope for the acquisition.

Switching probability measurement protocol

In the presented measurement, the train amplitude and frequency were set randomly in each cycle to avoid correlation effects. We tested every possible combination of selected voltage and frequency for the desired number of cycles. Supplementary Note 7 schematically reports our measurement protocol.

Simulations

All the numerical simulations concerning the switching probability and MTM for cochlear-sensing and speech recognition are carried out on MATLAB R2022b with our developed models. Neural network training and inference for interpretability of MTM results are performed with MATLAB Statistical and Machine learning toolbox.

Analog-to-Spike conversion

For our study of speech recognition, the analog audio signals are processed with an analog-to-spike conversion algorithm. The information about the amplitude of audio signals is captured thanks to 3 different thresholds used to generate 3 different pulse waveforms that are supplied as input signal to the memristive tonotopic map circuit (see Supplementary Note 14 for the block diagram of analog-to-spike conversion and conversion traces examples).

Data availability

All data that support the findings of this study are provided within the paper and its Supplementary Material. All additional information is available from the corresponding authors upon request.

Code availability

The code used for simulations is available from the corresponding author upon request.

References

Kandel, E. R. et al. (eds.) Principles of Neural Science Vol. 4, 1227–1246 (McGraw-Hill, 2020).
Mead, C. Neuromorphic electronic systems. Proc. IEEE 78, 1629–1636 (1990).
Article Google Scholar
Kar, A. K. Bio inspired computing–a review of algorithms and scope of applications. Expert Syst. Appl. 59, 20–32 (2016).
Article ADS Google Scholar
Furber, S. Large-scale neuromorphic computing systems. J. Neural Eng. 13, 051001 (2016).
Article ADS PubMed Google Scholar
Marković, D., Mizrahi, A., Querlioz, D. & Grollier, J. Physics for neuromorphic computing. Nat. Rev. Phys. 2, 499–510 (2020).
Article Google Scholar
Schuman, C. D. et al. Opportunities for neuromorphic computing algorithms and applications. Nat. Comput. Sci. 2, 10–19 (2022).
Article PubMed Google Scholar
Schreiner, C. E. & Winer, J. A. Auditory cortex mapmaking: principles, projections, and plasticity. Neuron 56, 356–365 (2007).
Article CAS PubMed PubMed Central Google Scholar
Hudspeth, A. J. How the ear’s works work. Nature 341, 397–404 (1989).
Article ADS CAS PubMed Google Scholar
von Békésy, G. Direct observation of the vibrations of the cochlear partition under a microscope. Acta Otolaryngol. 42, 197–201 (1952).
Article PubMed Google Scholar
von Békésy, G. Experiments in Hearing (ed Weaver, E. G.) (McGraw-Hill, 1960).
Hasler, J. & Marr, B. Finding a roadmap to achieve large neuromorphic hardware systems. Front. Neurosci. 7, 118 (2013).
Article PubMed PubMed Central Google Scholar
Burr, G. et al. Neuromorphic computing using non-volatile memory. Adv. Phys. X 2, 89–124 (2017).
Google Scholar
Ielmini, D. & Ambrogio, S. Emerging neuromorphic devices. Nanotechnology 31, 092001 (2019).
Article ADS PubMed Google Scholar
Islam, R. et al. Device and materials requirements for neuromorphic computing. J. Phys. D Appl. Phys. 52, 113001 (2019).
Article ADS Google Scholar
Christensen, D. V. et al. 2022 roadmap on neuromorphic computing and engineering. Neuromorphic Comput. Eng. 2, 022501 (2022).
Article Google Scholar
Zidan, M. A., Strachan, J. P. & Lu, W. D. The future of electronics based on memristive systems. Nat. Electron. 1, 22–29 (2018).
Article Google Scholar
Ielmini, D. & Wong, H. S. P. In-memory computing with resistive switching devices. Nat. Electron. 1, 333–343 (2018).
Article Google Scholar
John, R. A. et al. Ionic-electronic halide perovskite memdiodes enabling neuromorphic computing with a second-order complexity. Sci. Adv. 8, eade0072 (2022).
Article CAS PubMed PubMed Central Google Scholar
Farronato, M. et al. Low-current, highly linear synaptic memory device based on MoS2 transistors for online training and inference. IEEE 4th International Conference on Artificial Intelligence Circuits and Systems (AICAS), 1–4 (2022).
Wang, Z. et al. Reinforcement learning with analogue memristor arrays. Nat. Electron. 2, 115–124 (2019).
Article Google Scholar
Kang, J. et al. Cluster-type analogue memristor by engineering redox dynamics for high-performance neuromorphic computing. Nat. Commun. 13, 4040 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Deco, G., Rolls, E. T. & Romo, R. Stochastic dynamics as a principle of brain function. Prog. Neurobiol. 88, 1–16 (2009).
Article PubMed Google Scholar
Wang, W. et al. Volatile resistive switching memory based on Ag ion drift/diffusion Part I: Numerical modeling. IEEE Trans. Electron Devices 66, 3795–3801 (2019).
Article ADS CAS Google Scholar
Covi, E. et al. Switching dynamics of Ag-based filamentary volatile resistive switching devices—Part I: Experimental characterization. IEEE Trans. Electron Devices 68, 4335–4341 (2021).
Article ADS CAS Google Scholar
Wang, W. et al. Switching dynamics of Ag-based filamentary volatile resistive switching devices—Part II: Mechanism and modeling. IEEE Trans. Electron Devices 68, 4342–4349 (2021).
Article ADS CAS Google Scholar
Wang, W. et al. Surface diffusion-limited lifetime of silver and copper nanofilaments in resistive switching devices. Nat. Commun. 10, 81 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Zwislocki, J. Cochlear waves: interaction between theory and experiments. J. Acoust. Soc. Am. 55, 578–583 (1974).
Article ADS CAS PubMed Google Scholar
Zwislocki, J. Theory of the acoustical action of the cochlea. J. Acoust. Soc. Am. 22, 778–784 (1950).
Article ADS Google Scholar
Hudspeth, A. J. The hair cells of the inner ear. Sci. Am. 248, 54–65 (1983).
Article ADS CAS PubMed Google Scholar
Liu, S., Wang, S., Zou, L. & Xiong, W. Mechanisms in cochlear hair cell mechano-electrical transduction for acquisition of sound frequency and intensity. Cell. Mol. Life Sci. 78, 5083–5094 (2021).
Article CAS PubMed Google Scholar
Dallos, P. The active cochlea. J. Neurosci. 12, 4575 (1992).
Article CAS PubMed PubMed Central Google Scholar
von Békésy, G. Travelling waves as frequency analysers in the cochlea. Nature 225, 1207–1209 (1970).
Article ADS Google Scholar
von Békésy, G., Concerning the pleasures of observing, and the mechanics of the inner ear. Nobel Lecture Physiology or Medicine 1942–1962 (Elsevier, 1964).
Saenz, M. & Langers, D. R. Tonotopic mapping of human auditory cortex. Hearing Res. 307, 42–52 (2014).
Article Google Scholar
Mesgarani, N., David, S. V., Fritz, J. B. & Shamma, S. A. Phoneme representation and classification in primary auditory cortex. J. Acoust. Soc. Am. 123, 899–909 (2008).
Article ADS PubMed Google Scholar
Lord, H. W., Gatley, W. S., & Evensen, H. A. Noise Control for Engineers (McGraw-Hill, 1980).
Roscher, R., Bohn, B., Duarte, M. F. & Garcke, J. Explainable machine learning for scientific insights and discoveries. IEEE Access 8, 42200–42216 (2020).
Article Google Scholar
Zenke, Friedemann et al. Visualizing a joint future of neuroscience and neuromorphic engineering. Neuron 4, 571–575 (2021).
Article Google Scholar
Mehonic, A. et al. Memristors—from in‐memory computing, deep learning acceleration, and spiking neural networks to the future of neuromorphic and bio‐inspired computing. Adv. Intell. Syst. 2, 2000085 (2020).
Article Google Scholar
Ielmini, D. & Pedretti, G. Device and circuit architectures for in‐memory computing. Adv. Intell. Syst. 2, 2000040 (2020).
Article Google Scholar
Mannocci, P. et al. In-memory computing with emerging memory devices: Status and outlook. APL Mach. Learn. 1, 010902 (2023).
Article Google Scholar
Wang, Z. et al. Memristors with diffusive dynamics as synaptic emulators for neuromorphic computing. Nat. Mater. 16, 101–108 (2017).
Article ADS CAS PubMed Google Scholar
Milo, V. et al. Demonstration of hybrid CMOS/RRAM neural networks with spike time/rate-dependent plasticity. 2016 IEEE International Electron Devices Meeting 16–18 (IEDM, 2016).
Indiveri, G., Linares-Barranco, B., Legenstein, R., Deligeorgis, G. & Prodromakis, T. Integration of nanoscale memristor synapses in neuromorphic computing architectures. Nanotechnology 24, 384010 (2013).
Article ADS PubMed Google Scholar
Kumar, S., Wang, X., Strachan, J. P., Yang, Y. & Lu, W. D. Dynamical memristors for higher-complexity neuromorphic computing. Nat. Rev. Mater. 7, 575–591 (2022).
Article ADS Google Scholar
Branco, T. & Staras, K. The probability of neurotransmitter release: variability and feedback control at single synapses. Nat. Rev. Neurosci. 10, 373–383 (2019).
Article Google Scholar
Lisman, J. E. Bursts as a unit of neural information: making unreliable synapses reliable. Trends Neurosci. 20, 38–43 (1997).
Article CAS PubMed Google Scholar
Minglu, Z., Tianyiyi, H. & Chengkuo, L. Technologies toward next generation human machine interfaces: From machine learning enhanced tactile sensing to neuromorphic sensory systems. Appl. Phys. Rev. 7, 31305 (2020).
Article Google Scholar
Gallego, G. et al. Event-based vision: a survey. IEEE Trans. pattern Anal. Mach. Intell. 44, 154–180 (2020).
Article Google Scholar
Hudspeth, A. J. & Peter, G. Pulling springs to tune transduction: adaptation by hair cells. Neuron 12, 1–9 (1994).
Article CAS PubMed Google Scholar
Levisse, Alexandre, et al. RRAM crossbar arrays for storage class memory applications: Throughput and density considerations. Conference on Design of Circuits and Integrated Systems (DCIS), 1–6 (IEEE, 2018).
Conte, A. et al. An 18nm ePCM with BJT selector NVM design for advanced microcontroller applications. IEEE International Memory Workshop (IMW), 1–4 (IEEE, 2023).

Download references

Acknowledgements

This article has received funding from the European Union’s Horizon 2020 research and innovation program (grant agreement No. 899559).

Author information

Authors and Affiliations

Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano and IU.NET, Piazza Leonardo da Vinci 32, 20133, Milano, Italy
Alessandro Milozzi, Saverio Ricci & Daniele Ielmini

Authors

Alessandro Milozzi
View author publications
You can also search for this author in PubMed Google Scholar
Saverio Ricci
View author publications
You can also search for this author in PubMed Google Scholar
Daniele Ielmini
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M. and D.I. conceived the work direction and experiments. S.R. fabricated the volatile memristorsand set up the testing protocols. A.M. and S.R. performed the device characterization andexperiments. A.M. implemented stochastic model and system simulations. A.M., S.R. and D.I. wrotethe manuscript. D.I. supervised the project overall.

Corresponding author

Correspondence to Daniele Ielmini.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Peng Zhou and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Milozzi, A., Ricci, S. & Ielmini, D. Memristive tonotopic mapping with volatile resistive switching memory devices. Nat Commun 15, 2812 (2024). https://doi.org/10.1038/s41467-024-47228-1

Download citation

Received: 27 July 2023
Accepted: 25 March 2024
Published: 01 April 2024
DOI: https://doi.org/10.1038/s41467-024-47228-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.