<aside>
💡 Our goal is an AI for voice conversations that's as natural as talking to a person.
MetaVoice is founded by
Today's voice AI fails at real-world conversations. It’s slow, turn-based like a walkie-talkie, breaks with interruptions, and doesn’t understand emotion.
Developers can't build compelling experiences and users disengage. This limits voice AI to simple receptionist tasks and basic customer support, blocking meaningful services (sales, therapy, coaching) where dialogue and emotional intelligence matter most. Scaling current tech does not work.
Our approach is a duplex speech-to-speech model that learns conversational behaviour directly from data.
That’s how we make voice the most natural way to interact with AI.
</aside>
Requirements and Experience
- Experience building infrastructure & distributed data pipelines to process 10s of TBs of data
- Experience working with multimodal data in the context of AI/ML products or systems
- Demonstrated ability to learn quickly and adapt in fast-paced environments
- Experience with batch processing, real-time streaming systems and distributed orchestration (e.g., Spark, Kafka, Flyte, Kubernetes)
Bonus
- Built something yourself (project, startup, side-hustle, etc.) or early-stage startup experience.
- Experience creating transformation pipelines for speech processing (e.g., transcription, diarization, enhancement, filtering)
What we offer
- Change the world (when we succeed)
- Environment to do the best work of your life.
- Small team, great people.
- Opportunity to make cutting-edge research work from scratch for production scenarios.
- Work with 100+ TBs of data and 10B+ parameter models.
Culture
- We're an in-person team in San Francisco & love working together. It helps us learn from one another and make decisions quickly.
- We ship fast & obsess over making customers happy.
- We offer high autonomy, allowing everyone to do their best work.
Compensation & Benefits