Audio AI Engineer (human)

NEURA Robotics
NEURA Robotics

Software Engineering, Data Science

Metzingen, Germany

Posted on Jun 17, 2026
  • NEW

Audio AI Engineer (human)

Neura Robotics • Metzingen

Give Our Robots Consciousness

In the AI Department, you'll develop the cognitive abilities of our robots, enabling them to understand and interact with their surroundings. You'll work on algorithms that make machines intelligent and adaptive—ranging from object recognition and language processing to decision-making in complex environments. You'll be part of a team that pushes technological boundaries and collaborates closely with software, hardware, and product management teams. Using cutting-edge tools and methods from machine learning and AI, you'll work on projects that have a direct impact on our products. If you're excited about making machines smarter and shaping the future of human-machine interaction, the AI Department is the perfect place for you.

  • Full-time

Metzingen

from today

Hearing is essential to how a robot understands and responds to the world. At NEURA Robotics, audio is a first-class modality: spoken instructions, contact sounds, and ambient cues all inform autonomous action. As Audio AI Engineer, you own the real-time audio pipeline on the robot, the models that turn sound into meaning, and the voice interface that lets people speak to our humanoids the way they would to another person.

The role can emphasize conversational AI, audio ML modeling, or embedded audio DSP. We expect depth in at least one area and breadth across the others; you will lead where strongest and collaborate with AI and hardware teams on the rest.

Your mission & challenges
  • Voice Interaction Stack: You build and own the edge-to-cloud hybrid automatic speech recognition, text-to-speech, wake-word, voice activity detection, and natural language understanding pipelines that connect the human voice to our robot's cognitive core, optimizing for low latency, multi-speaker scenarios, and noisy real-world environments.

  • Audio Encoder Research: You design, train, and integrate audio encoders that feed our foundation models, and develop the ambient and contact-acoustic event recognition that gives our robot situational awareness.

  • Real-Time Audio Pipeline: You architect the shared audio substrate from microphones to model input - acquisition, denoising, beamforming, source separation, and tight synchronization with vision and proprioception streams - and optimize it for our on-robot compute and latency budgets.

  • Models, Data & Evaluation: You evaluate, fine-tune, and deploy state-of-the-art models across speech and general audio, drive data collection from real deployments, and build the evaluation infrastructure that turns recordings into measurable model improvements.

  • Sensor Strategy & Integration: You help select and qualify audio hardware (mic arrays, contact and tactile microphones, ADC frontends) with the hardware team, define calibration and mounting requirements, and ensure clean integration with the AI, hardware, and agentic stacks.

What we can look forward to
  • An excellent Master's or PhD in Computer Science, Electrical Engineering, Computational Linguistics, or a related field.

  • 3+ years of professional experience in audio-related AI engineering

  • A proven track record: your projects show measurable impact, whether through publications, shipped systems, or both.

  • Depth in at least one of the following, with curiosity and breadth across the others:

    • Conversational AI: ASR, TTS, NLP/NLU, dialogue systems, real-time speech systems with LLMs.

    • Audio ML modeling: audio representation learning, multimodal / VLA foundation models with an audio branch, generative audio.

    • Embedded audio DSP: real-time signal processing, mic-array processing, low-level audio I/O, quantized inference for on-device deployment.

  • Strong programming skills in Python; solid C/C++ a plus for real-time and on-device work.

  • Familiarity with ROS or robotics middleware is a plus.

  • Experience with agentic frameworks and LLM tool-use is a plus.

  • Experience with audio simulation, room acoustics, or spatial audio is a plus.

  • Hands-on experience setting up audio recording equipment for ML data collection (microphone selection, placement, calibration) is nice to have.

  • Team spirit, initiative, and the ability and willingness to explore new paths.

  • Excellent English skills; German is optional but welcome.

What you can look forward to

Creative Freedom and Agility

Enjoy a dynamic, self-reliant work culture with flat hierarchies, flexible hours, and 30 vacation days. Ideal for those seeking an inspiring professional setting, whether you're starting out or an experienced exec.

Passion for Winning

A passionate and highly skilled team of international experts aiming to redefine robot assistants.

Attractive Compensation

Enjoy a competitive salary package along with exclusive employee discounts.

One Team

Whether it's a summer party or company town hall meetings, we celebrate our successes together.

Professional Growth

Support for your personal and professional development.

Our values. The cornerstones of our success.

STRONGER TOGETHER​

We are a team. We strive to achieve great things by promoting the success of our colleagues and partners.

PASSION DRIVES US​

We strive for technological progress in order to give people back their valuable time for enjoyable activities.

MAKING A CHANGE​

We strive to revolutionize the world of robotics by pushing the boundaries of technology every day.

TRUST AND HONESTY

We live a high level of appreciation through open communication and transparency.

WE SPEED THINGS UP​

We do our best to always be two steps ahead. We achieve this through empowerment, freedom of action and personal responsibility.

WE ARE HUMAN​

People are at the center of everything we do.

Our Location

Headquarters: Innovate in Riederich, Live in Metzingen and Stuttgart

Our headquarters in Metzingen and Riederich are the heart of our company. It's not just home to our offices, but also our production facilities, Academy, logistics, and Tech Labs—all working together to turn ideas into reality. Riederich itself is a small, peaceful town, just a kilometer away from Metzingen, a city with its own unique character. Metzingen is globally renowned as Outlet City, attracting visitors from all over the world. Here, you can enjoy exclusive designer stores in a relaxed and charming setting. The city also offers a variety of restaurants, cafés, and a down-to-earth Swabian coziness—perfect for unwinding after work.

Our application process

We ensure a transparent and efficient process and look forward to getting to know you during the application process.

1/2

Your insight into our daily work

1/4

Unsere Mission

David Reger
Gründer und CEO

"Our goal was to develop the world's first cognitive robot that can work with people, learn from them and provide them with targeted support. And that is exactly what we have achieved. But there is much more to come!"

Sounds interesting?

Apply now