Overview

Posted
1 month ago
Internship Type
Remote Status
Location
Remote, USA
Education Level
Education Status
Not specified
Field of Study
All Majors
Categories
Not specified
Tags
Not specified
Cohere's mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. Ship state of the art models to production. Design and implement novel research ideas. Build elegant training/deployment pipelines. Join us at a pivotal moment, shape what we build and wear multiple hats as an intern! To be eligible you should be a student currently enrolled in a post-secondary program, available for a full-time 3-6 month internship, co-op, or research work term. As a Machine Learning Intern, you will: Design, train and improve upon cutting-edge models. Help develop new techniques to train and serve models safer, better, and faster. Train extremely large-scale models on massive datasets. Explore continual and active learning strategies for streaming data. Learn from experienced senior machine learning technical staff. Work closely with product teams to develop solutions. You may be a good fit if you have: Proficiency in Python and related ML frameworks such as Tensorflow, TF-Serving, JAX, and XLA/MLIR. Experience using large-scale distributed training strategies. Familiarity with autoregressive sequence models such as Transformers. Strong communication and problem-solving skills. A demonstrated passion for applied NLP models and products. Bonus: experience writing kernels for GPUs using CUDA. Bonus: experience training on TPUs. Bonus: papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).