Putting AI in your Ears with 3D Neural Networks
Article written by Matthew Davies, PhD
It’s difficult to communicate with someone when neither of you speak the same language; apps and online tools like Google Translate can help to bridge the gap, but they hardly make for a natural conversation. However, the rise of artificial intelligence (AI) has opened the door for the development of speech-to-speech technology, where spoken language is translated instantly in real time. To bring this idea closer to reality, a collaboration of European researchers led by Professor Cristell Maneux at the University of Bordeaux have proposed concepts for a 3D artificial neural network accelerator. This ultra-compact, highly efficient design could enable the construction of standalone earpieces capable of translating spoken language instantly, with no need for internet access.
Breaking Down Language Barriers with Speech-to-Speech Translation
As our world becomes ever more complex and interconnected, effective communication is more important than ever. In today’s globalised economy, businesses, research collaborations, and even casual conversations often involve speakers of several different languages. This diversity can be enriching, but it can also create barriers to communicating effectively.
The dream of speech-to-speech translation is to break these barriers down entirely. Speech-to-speech refers to the direct conversion of speech in one language to speech in another. This can involve several steps including:
- Automatic Speech Recognition (ASR), where raw spoken language is converted to a digital form such as text,
- Machine Translation (MT), where the generated text is translated to another language,
- and Text-to-Speech Synthesis (TTS), where the translated text is converted into natural-sounding speech.
A device capable of speech-to-speech translation would enable natural conversations between multiple individuals speaking different languages, by translating in real-time. You could speak to someone in your native language and speech-to-speech technology would immediately translate your words as you spoke them to your correspondent’s native language. This has obvious applications to industries such as tourism, but it could also play a vital role in international trade and diplomacy, where it’s vital that communication is precise with minimal loss of meaning.
The Technical Challenges of Constructing EdgeAI Devices
The rise of AI has already facilitated the creation of speech-to-speech technologies. But one drawback of the ones currently available on the market is that they don’t perform the translation themselves. They rely on large, energy-devouring supercomputers with thousands of graphics processing units (GPUs). Typical AI tasks require lots of computing resources, far more than an individual earpiece could supply. It’s these machines which process the speech data, translate it, and then communicate the results via the cloud back to the earpieces. There are several key issues with this way of operating. First of all, since all the conversation data is being sent to the AI model’s servers, there may be concerns over privacy and data protection. Secondly, with so much data being transferred both via the cloud and within the AI model’s own machinery, large quantities of energy are consumed.
The alternative would be an Edge AI device – a small device capable of performing AI tasks by itself without connection to a datacentre. They’re called ‘Edge’ devices because they would operate independently of the large AI models like ChatGPT, on the edge of the AI sphere, much like a satellite orbiting the Earth. Nevertheless, current computer architecture is too inefficient for Edge AI devices to be realistic. This is what a pan-European research team coordinated by the University of Bordeaux wants to change.
The FVLLMONTI project, led by Prof Cristell Maneux, aims to develop an Edge AI device specifically for speech-to-speech translation. This would be a small, lightweight earpiece capable of translating speech from one language to another, without any need for internet access or connection to a supercomputer. The FVLLMONTI team involves experts in nanotechnology and electronic engineering from universities across France, Germany, Austria, and Switzerland. One of their most important results has been the development of a new class of neural network accelerators. These are computational units specifically designed to perform the complex tasks needed by AI with much greater efficiency.
Computing in 3D with the Neural Network Compute Cube
One of the most promising ideas the team have explored is the development of three-dimensional computing architectures. In traditional computers, components are spread out over a 2D processing unit. Thanks to decades of development, 2D designs are extremely reliable and cost-efficient, but they simply aren’t designed to efficiently handle the kind of neural network tasks that are used in AI technologies.
One of the biggest challenges the team identified is the volume of data that needs to be transferred between the computer’s processor and memory. For AI tasks, the amount of data that needs to be transferred is orders of magnitude larger than for traditional computations and, when you’re confined to just two dimensions, this uses up lots of time and energy. Since compact devices, such as earpieces, have significant limitations when it comes to space, power, and battery life, increased efficiency is a paramount necessity. To address this, the FVLLMONTI researchers have been investigating the potential of vertically stacked units. By layering computing elements vertically, as well as horizontally, they found that the speed and efficiency with which a device can perform AI tasks can be vastly improved.
To this end, the FVLLMONTI team detailed the specifications for a new fundamental computing unit they call the Neural Network Compute Cube (N2C2).
The Beating Heart of the Machine: The Systolic Array
To test the potential of their N2C2 units, the team began to investigate how they could be combined in a way that can perform matrix calculations effectively. Working with matrices, which are large arrays of numbers, is the fundamental basis of AI operations. The FVLLMONTI group started by arranging N2C2 units into a structure known as a systolic array. The term ‘systolic’ comes from the name for the rhythmic beats of our hearts. Like a heart, the systolic array operates in ‘pulses’, whereupon data flows through successive N2C2 units in the grid.
Combining the N2C2 units in a systolic array offers several key advantages. Systolic arrays enable higher processing speeds by having the units work together in parallel. Whereas it would take a traditional processor more time to deal with calculations involving larger matrices, systolic arrays perform the tasks in constant time. This means that systolic arrays are ideally suited for dealing with the matrix-heavy computations characteristic of AI tasks. The FVLLMONTI team reported impressive speed-ups when N2C2 units were used to perform tasks. A 4 x 4 arrangement of N2C2 units achieved speeds of over 13 times the baseline, with larger 8 x 8 and 16 x 16 arrangements offering speeds of around 37- and 90-fold the baseline.
Unlocking the N2C2 Units’ Full Potential
But a systolic array can be constructed from older, classic technology. The specific design of the N2C2 units offers great potential for more impressive results. For a start, the systolic array the team used to test the units was still a 2D structure, but one of their main draws is that they can be stacked in 3D too. The team estimates that this alone would offer an additional 10x speed-up.
Another key design feature of the N2C2 units is that they are built using Vertical Gate-All-Around Nanowire Field-Effect Transistors (VNWFETs). A FET is effectively a tiny switch used in traditional computer central processing units (CPUs) or GPUs to control the flow of electricity through the circuit board. Nowadays, FETs are mostly planar and spread out only in two dimensions. On the other hand, a VNWFET is where the switch —or gate— surrounds the entire unit, enabling them to be stacked in both horizontal and vertical directions. This overcomes one of the limitations of traditional FETs, which is that the gate length cannot be reduced much below 10nm. With VNWFETs, the gate is aligned vertically, meaning that horizontal spread of hardware can be reduced. This can lead to ultra-compact processing units with an estimated extra 10x gain in performance.
The N2C2 units were also designed to contain ferroelectric elements, which is what the F in FVLLMONTI stands for. Ferroelectricity means that units can store some of their own data non-volatilely — that is, they have memory which is not lost when the device is switched off. This has obvious benefits: it reduces the distance over which data needs to be transferred, improving speed, and also helps to reuse data during computations, saving energy. The FVLLMONTI team estimate that this could yield a further 20x boost in performance.
Designing Software and Hardware Together for Greater Performance
An additional avenue the team explored was to investigate how the N2C2 hardware and speech-to-speech translation software could be co-optimised to improve energy efficiency and speed. The team employed a strategy called ‘structured pruning’, which compresses data in the neural network by removing blocks with low value, called as such when they have limited impact on the final result. This method of pruning synergises with the underlying systolic array structure, meaning that significant speedups and improvements in energy efficiency can be achieved.
The research team found that each of the tasks involved in speech-to-speech translation could be pruned to improve speed: Automatic Speech Recognition tasks could be pruned by up to 15%, and Machine Translation tasks by up to 85%, with performance reduction of less than 10%. This pruning led to speed improvements of up to 74-fold for a block of 32 x 32 N2C2 units. In addition, the energy consumption was reduced by between 22% and 34%. These results offer encouragement that the team’s N2C2 architecture has the potential to provide the speed and efficiency needed for the development of an Edge AI device like a speech-to-speech translation earpiece.
Towards Speech-to-Speech Translation on Edge AI Devices
The FVLLMONTI team’s proposed N2C2 unit is an exciting development towards the realisation of Edge AI devices capable of performing AI tasks locally. The latest results from the FVLLMONTI project indicate that this alternative architecture can drastically improve the efficiency in performing speech-to-speech related AI tasks. With the fundamentals of the N2C2 unit established, the team are now looking to take the next steps to build a functional, compact earpiece that could break down language barriers instantly, and with no need for an internet connection.
Moreover, beyond speech-to-speech translation, the FVLLMONTI team’s work has the potential to revolutionise computing in much broader ways. The development of alternative computing architecture based on vertical nanowire FET technologies will be critical for the construction of all sorts of chips capable of performing a variety of AI tasks efficiently.
SHARE
DOWNLOAD E-BOOK
REFERENCE
https://doi.org/10.33548/SCIENTIA1310
MEET THE RESEARCHERS
Professor Cristell Maneux
IMS Laboratory, Department of Sciences and Engineering, University of Bordeaux, Bordeaux, France
Prof Cristell Maneux is the Director of the IMS Laboratory in Bordeaux. She received her Master of Science Degree in Electronic Engineering in 1995 and her PhD in 1998, both from the University of Bordeaux where she still works today. With over 20 years of experience in the modelling of advanced and emerging devices, she has worked on a variety of projects and is especially interested in the development of new transistor technology. From 2008 to 2020, she led the MODEL team which pioneered the development of Carbon Nanotube based transistor models. She now leads several international research collaborations including FVLLMONTI and ULTIMATE. In particular, the FVLLMONTI project aims to develop 3D devices and architecture so that AI tasks might be performed efficiently by compact circuits.
CONTACT
E: Cristell.maneux@ims-bordeaux.fr
W: https://www.linkedin.com/in/cristell-maneux-7294945b/
Doctor Jens Trommer
NaMLab gGmbH, Dresden, Germany
Dr Jens Trommer is a Senior Scientist at NaMLab gGmbH in Dresden and leads the emerging devices development team there. He received his Degree in Electronic and Sensor Materials from the Freiberg University of Mining and Technology in 2011 and his PhD in Electrical Engineering from Dresden University of Technology in 2017. His main interest is in the doping of free nanowire and nanosheet devices and its applications in circuits. He’s published extensively on field-effect transistors and is developing reconfigurable and ferroelectric transistors as part of the FVLLMONTI project.
CONTACT
E: jens.trommer@namlab.com
W: https://www.linkedin.com/in/trommerjens/
Doctor Guilhem Larrieu
LAAS-CNRS, Toulouse, France
Dr Guilhem Larrieu is a Director of Research CNRS, working at the LAAS-CNRS laboratory in Toulouse, France. He received his PhD in Electronics in 2004 from the University of Lille and worked as an independent researcher until 2010 when he moved to the LAAS-CNRS. He is also a Research Fellow at the University of Tokyo. He currently leads research into Nano-&Neuro-Electronics aiming to develop new devices based on functional nanostructures that are ever smaller and ever more efficient. His expertise is critical for furthering the goal of the FVLLMONTI project in creating compact architecture that can perform AI tasks.
CONTACT
E: guilhem.larrieu@laas.fr
W: https://www.linkedin.com/in/guilhem-larrieu-1257b653/
Doctor Oskar Baumgartner
Global TCAD Solutions GmbH, Vienna, Austria
Dr Oskar Baumgartner is the Chief Operating Officer of Global TCAD Solutions (GTS), a spin-off company of Vienna University of Technology. It offers virtual environments for the simulation of semiconductor components to test and investigate new designs for electronic devices. Dr Baumgartner received his Degree in Electrical Engineering from Vienna in 2005, going on to obtain a Diplomingenieur in Microelectronics in 2007. He wrote his PhD Thesis on the Numerical Simulation of Quantum Transport. He collaborated with GTS in 2013 before joining it as a senior researcher in 2015. As part of the FVLLMONTI project, he and his team create models and TCAD Tools to build 3D computer architectures from the novel nanotechnologies.
CONTACT
E: o.baumgartner@globaltcad.com
W: https://www.linkedin.com/in/oskar-baumgartner-19754219b/
Professor Ian O’Connor
Lyon Institute of Nanotechnology, Ecole Centrale de Lyon, Lyon, France
Prof Ian O’Connor is a Distinguished Professor in the Department of Electronic, Electrical and Control Engineering at Ecole Centrale de Lyon, France and joint head of the Electronics group at the Lyon Institute of Nanotechnology. He received his PhD in Electronics from the University of Lille, France, in 1997. He has authored, or co-authored, nearly 300 book chapters, journal publications, conference papers and patents, and is a leading authority on chip design. He is particularly interested in designing novel computing architectures, such as the N2C2 that would be used for speech-to-speech translation as part of the FVLLMONTI project.
CONTACT
E: ian.oconnor@ec-lyon.fr
W: https://www.linkedin.com/in/ian-o-connor-7592153/
Doctor Giovanni Ansaloni
Embedded Systems Laboratory, EPFL Lausanne, Lausanne, Switzerland
Dr Giovanni Ansaloni is a researcher and lecturer at the Embedded Systems Laboratory of EPFL Lausanne. He received his Masters’ Degree in Electronic Engineering from the University of Ferrara, Italy, in 2003, an Executive Masters in Embedded System Design from the ALaRI Institute, Switzerland, in 2005, and a PhD from USI, Switzerland, in 2011. His work focuses on the development of architectures and algorithms for edge computing, where traditionally energy-consuming tasks might be optimized to work on compact devices. He is the co-author of more than 70 papers in international conferences and has collaborated in several European research projects including FVLLMONTI.
CONTACT
E: giovanni.ansaloni@epfl.ch
W: https://www.linkedin.com/in/giovanni-ansaloni-61a1871/
Doctor Jean-Luc Rouas
LABRI, University of Bordeaux, Bordeaux, France
Dr Jean-Luc Rouas is a CNRS Researcher and co-head of the Data Processing and Analysis team at the Bordeaux Computer Science Research Laboratory (LaBRI), France. He obtained his PhD from Université Paul Sabatier in Toulouse, France, in 2005 on the subject of automatic language identification. His research topics include automatic speech recognition and prosodic modelling linked to speech variability and expressivity for interaction analysis. He is the co-author of more than 100 peer-reviewed publications and has led or participated in more than 20 national and international collaborative projects including the FVLLMONTI project.
CONTACT
E: jean-luc.rouas@labri.fr
W: https://www.linkedin.com/in/jean-luc-rouas-59787747/
KEY COLLABORATORS
Dr Marina Deng, University of Bordeaux
Dr Chhandak Mukherjee, CNRS
Prof Alberto Bosio, École Centrale de Lyon
Dr Damian Deleruyelle, École Centrale de Lyon
Sara Mannaa, École Centrale de Lyon
Prof Thomas Mikolajick, NaMLab gGmbH
Cigdem Cakirlar, NaMLab gGmbH
Dr Bruno Neckel-Wesling, NaMLab gGmbH
Dr Mischa Thesberg, Global TCAD Solutions GmbH
Dr Christoph Lenz, Global TCAD Solutions GmbH
Sylvain Pelloquin, LAAS-CNRS
Konstantinous Moustakas, LAAS-CNRS
Prof. David Atienza, EPFL
Alireza Amirshahi, EPFL
FUNDING
Funded by the European Union’s Horizon 2020 Research and Innovation Program. Views and opinions expressed are however those of the authors only and do not necessarily reflect those of the European Union or the European Research Council. Neither the European Union nor the granting authority can be held responsible for them.
FURTHER READING
C Maneux, C Mukherrjee, M Deng, et al. “Modelling of vertical and ferroelectric junctionless technology for efficient 3D neural network compute cube dedicated to embedded artificial intelligence.” 2021 IEEE International Electron Devices Meeting (IEDM). IEEE, 2021. DOI: 10.1109/IEDM19574.2021.972057
I O’Connor, S Mannaa, A Bosio, et al., FVLLMONTI: The 3D Neural Network Compute Cube (N2C2) Concept for Efficient Transformer Architectures Towards Speech-to-Speech Translation, 2024 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2024, 1-6. DOI: 10.23919/DATE58400.2024.10546700
A Kumar, J Müller, S Pelloquin, et al., Logic Gates Based on 3D Vertical Junctionless Gate-All-Around Transistors with Reliable Multilevel Contact Engineering, Nano Letters, 2024, 24 (26), 7825-7832. DOI: 10.1021/acs.nanolett.3c04180
M Thesberg, F Schanovsky, Z Stanojevic, et al., Compact Metal-Ferroelectric-Insulator-Semiconductor (MFIS) Approaches Versus TCAD For The Modeling Of Ferroelectric Transistors (FeFETs): Percolation, Steep-Subthreshold and Depolarization, 2023 International Conference on Simulation of Semiconductor Processes and Devices (SISPAD), 2023, 117-120. DOI: 10.23919/SISPAD57422.2023.10319645
JL Rouas, C Brazier, LB Letaifa, et al., Structured pruning for efficient systolic array accelerated cascade Speech-to-Text Translation, 26th Interspeech Conference 2025.
P Palacios, R Medina, JL Rouas, et al., Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems, GLSVLSI ’25: Proceedings of the Great Lakes Symposium on VLSI 2025, 2025, 320-32., DOI: 10.1145/3716368.3735158
C Cakirlar, M Simon, G Galderisi, et al., Cross-shape reconfigurable field effect transistor for flexible signal routing, Materials Today Electronics, 2023, Volume 4, 100040. DOI: 10.1016/j.mtelec.2023.100040
C Cakirlar, B Neckel-Wesling, K Moustakas, et al. “Process Integration of U-Shape Ambipolar Schottky-Barrier Field-Effect Transistors. Advanced Electronic Materials”, 2025, DOI: 10.1002/aelm.202500310
REPUBLISH OUR ARTICLES
We encourage all formats of sharing and republishing of our articles. Whether you want to host on your website, publication or blog, we welcome this. Find out more
Creative Commons Licence (CC BY 4.0)
This work is licensed under a Creative Commons Attribution 4.0 International License.
What does this mean?
Share: You can copy and redistribute the material in any medium or format
Adapt: You can change, and build upon the material for any purpose, even commercially.
Credit: You must give appropriate credit, provide a link to the license, and indicate if changes were made.
SUBSCRIBE NOW
Follow Us
MORE ARTICLES YOU MAY LIKE
Gopal Ranganathan | The Use of Artificial Intelligence in Personalised Airline Services
In today’s competitive airline industry, providing personalised services to passengers is becoming increasingly important for customer satisfaction and business success. Gopal Ranganathan from Quad Optima Analytics has developed an innovative artificial intelligence system to help airline executives implement and govern personalisation programmes. This cutting-edge technology aims to increase profits by tailoring services to individual customers while maintaining sound revenue management principles.
Professor Nancy Burnham | Imaging on the Nanoscale: Improving Techniques in Atomic Force Microscopy
Atomic force microscopy (AFM) provides the means to image surfaces with nanometre resolution, allowing scientists to look at the individual building blocks and forces that make up the world around us. Professor Nancy Burnham of Worcester Polytechnic Institute and her colleagues Lei Lyu and Lily Poulikakos at the Swiss Federal Laboratories for Materials Science and Technology (Empa) have worked on how we can reduce artefacts in these images and ensure they are accurately interpreted. By considering and applying these techniques, high-quality AFM research can be produced.
Dr Sebastian Rabien | Making Membrane Mirrors for Future Space Telescopes
Mirrors play a key role in space telescopes, but to keep increasing the scale of this technology, mirrors need to be light and compact, so they can be transported in spacecraft, but also able to be adaptively corrected and controlled to ensure their accuracy. Dr Rabien and his colleagues from the Max Planck Institute for Extraterrestrial Physics, in Germany, have developed a technique to make extremely thin and lightweight mirrors, which can then be controlled with adaptive optics, making them a potential solution for larger space telescopes.
Probing Electron Dynamics in the Ultrafast Regime
In the atoms that make up the matter around us, negatively charged particles called electrons have properties such as spin and orbital angular momentum. Researchers at Martin Luther University Halle-Wittenberg have developed a theoretical framework which allows them to simulate the dynamics of the spin and orbital angular momentum of electrons in materials when probed with an ultrafast laser pulse. Using this framework, they are able to simulate different materials and improve our understanding of dynamics on an atomic scale.