Audio AI Engineer (human)
Software Engineering, Data Science
Metzingen, Germany
- NEW
Audio AI Engineer (human)
Neura Robotics • Metzingen
Give Our Robots Consciousness
In the AI Department, you'll develop the cognitive abilities of our robots, enabling them to understand and interact with their surroundings. You'll work on algorithms that make machines intelligent and adaptive—ranging from object recognition and language processing to decision-making in complex environments. You'll be part of a team that pushes technological boundaries and collaborates closely with software, hardware, and product management teams. Using cutting-edge tools and methods from machine learning and AI, you'll work on projects that have a direct impact on our products. If you're excited about making machines smarter and shaping the future of human-machine interaction, the AI Department is the perfect place for you.
- Full-time
Metzingen
Hearing is essential to how a robot understands and responds to the world. At NEURA Robotics, audio is a first-class modality: spoken instructions, contact sounds, and ambient cues all inform autonomous action. As Audio AI Engineer, you own the real-time audio pipeline on the robot, the models that turn sound into meaning, and the voice interface that lets people speak to our humanoids the way they would to another person.
The role can emphasize conversational AI, audio ML modeling, or embedded audio DSP. We expect depth in at least one area and breadth across the others; you will lead where strongest and collaborate with AI and hardware teams on the rest.
Your mission & challenges-
Voice Interaction Stack: You build and own the edge-to-cloud hybrid automatic speech recognition, text-to-speech, wake-word, voice activity detection, and natural language understanding pipelines that connect the human voice to our robot's cognitive core, optimizing for low latency, multi-speaker scenarios, and noisy real-world environments.
-
Audio Encoder Research: You design, train, and integrate audio encoders that feed our foundation models, and develop the ambient and contact-acoustic event recognition that gives our robot situational awareness.
-
Real-Time Audio Pipeline: You architect the shared audio substrate from microphones to model input - acquisition, denoising, beamforming, source separation, and tight synchronization with vision and proprioception streams - and optimize it for our on-robot compute and latency budgets.
-
Models, Data & Evaluation: You evaluate, fine-tune, and deploy state-of-the-art models across speech and general audio, drive data collection from real deployments, and build the evaluation infrastructure that turns recordings into measurable model improvements.
-
Sensor Strategy & Integration: You help select and qualify audio hardware (mic arrays, contact and tactile microphones, ADC frontends) with the hardware team, define calibration and mounting requirements, and ensure clean integration with the AI, hardware, and agentic stacks.
-
An excellent Master's or PhD in Computer Science, Electrical Engineering, Computational Linguistics, or a related field.
-
3+ years of professional experience in audio-related AI engineering
-
A proven track record: your projects show measurable impact, whether through publications, shipped systems, or both.
-
Depth in at least one of the following, with curiosity and breadth across the others:
-
Conversational AI: ASR, TTS, NLP/NLU, dialogue systems, real-time speech systems with LLMs.
-
Audio ML modeling: audio representation learning, multimodal / VLA foundation models with an audio branch, generative audio.
-
Embedded audio DSP: real-time signal processing, mic-array processing, low-level audio I/O, quantized inference for on-device deployment.
-
-
Strong programming skills in Python; solid C/C++ a plus for real-time and on-device work.
-
Familiarity with ROS or robotics middleware is a plus.
-
Experience with agentic frameworks and LLM tool-use is a plus.
-
Experience with audio simulation, room acoustics, or spatial audio is a plus.
-
Hands-on experience setting up audio recording equipment for ML data collection (microphone selection, placement, calibration) is nice to have.
-
Team spirit, initiative, and the ability and willingness to explore new paths.
-
Excellent English skills; German is optional but welcome.
What you can look forward to
Creative Freedom and Agility
Enjoy a dynamic, self-reliant work culture with flat hierarchies, flexible hours, and 30 vacation days. Ideal for those seeking an inspiring professional setting, whether you're starting out or an experienced exec.
Passion for Winning
A passionate and highly skilled team of international experts aiming to redefine robot assistants.
Attractive Compensation
Enjoy a competitive salary package along with exclusive employee discounts.
One Team
Whether it's a summer party or company town hall meetings, we celebrate our successes together.
Professional Growth
Support for your personal and professional development.
Our values. The cornerstones of our success.
We are a team. We strive to achieve great things by promoting the success of our colleagues and partners.
We strive for technological progress in order to give people back their valuable time for enjoyable activities.
We strive to revolutionize the world of robotics by pushing the boundaries of technology every day.
We live a high level of appreciation through open communication and transparency.
We do our best to always be two steps ahead. We achieve this through empowerment, freedom of action and personal responsibility.
People are at the center of everything we do.
Our Location
Our headquarters in Metzingen and Riederich are the heart of our company. It's not just home to our offices, but also our production facilities, Academy, logistics, and Tech Labs—all working together to turn ideas into reality. Riederich itself is a small, peaceful town, just a kilometer away from Metzingen, a city with its own unique character. Metzingen is globally renowned as Outlet City, attracting visitors from all over the world. Here, you can enjoy exclusive designer stores in a relaxed and charming setting. The city also offers a variety of restaurants, cafés, and a down-to-earth Swabian coziness—perfect for unwinding after work.
Our application process
We ensure a transparent and efficient process and look forward to getting to know you during the application process.
Your insight into our daily work
Unsere Mission
"Our goal was to develop the world's first cognitive robot that can work with people, learn from them and provide them with targeted support. And that is exactly what we have achieved. But there is much more to come!"