
Senior Member of Technical Staff, Multimodal AI

Senior Member of Technical Staff, Multimodal AI

Senior Member of Technical Staff, Multimodal AI
Cohere
Cohere is seeking a Senior Member of Technical Staff focused on Multimodal AI to design and develop advanced AI systems that integrate text, speech, and vision. The role involves conducting research on multimodal representation learning and collaborating with a talented team to push the boundaries of AI technology. Ideal candidates should have strong software engineering skills, expertise in Python, and experience with deep learning frameworks.
Qualification
- Exceptional software engineering skills with a proven track record of building robust systems.
- Strong command of Python and experience with deep learning frameworks like JAX, PyTorch, and TensorFlow.
- Knowledge of distributed training strategies for large-scale multimodal models.
- Familiarity with autoregressive models for tasks like image/video captioning and speech-to-text generation.
- Bonus: Publications in top-tier venues demonstrating expertise in the field.
Responsibility
- Design and develop cutting-edge multimodal AI systems integrating text, speech, and vision.
- Conduct research and experiments on advanced compute infrastructure for multimodal representation learning.
- Explore novel ideas in transfer learning and multimodal capabilities.
- Collaborate closely with world-class teams to enhance expertise in the field.
- Contribute to the development of robust and scalable AI systems.




