Staff ML Engineer
We're looking for a
Staff ML Engineer
Full-time · NYC · Hybrid
About the role...
Cosmos is looking for a Staff ML Engineer specializing in personalization and recommendation systems to join our engineering team in New York City. You'll lead the development of cutting-edge machine learning models that enhance user experience, drive engagement, and personalize content for millions of users.
The candidate should have extensive experience in designing and implementing machine learning systems, specifically personalization and recommendation models, that can scale reliably and handle rapid user growth. As Cosmos continues to scale, this role will be crucial in addressing challenges related to growing data volumes, optimizing our ML infrastructure for increased demand, and delivering highly personalized, seamless user experiences. A core part of this role will involve striking the balance between delivering highly aesthetic, curated content and ensuring users feel deeply connected to the recommendations they receive.
What you'll do:
Architect, develop, and scale state-of-the-art personalization and recommendation models leveraging deep learning, collaborative filtering, embeddings, and real-time inference.
You'll be embedded on the product engineering team, collaborating with engineers and company leads to design experiments, evaluate performance, and continuously improve recommendation and personalization quality.
Own end-to-end deployment of ML pipelines, from data exploration and prototyping to deployment, monitoring, and iteration.
Mentor team members, fostering growth in ML expertise across Cosmos engineering.
Stay current with industry trends, actively applying insights and techniques from leading research to practical business solutions.
Who you are:
Significant hands-on experience (5+ years) designing, implementing, and deploying large-scale recommendation and personalization systems in production.
Expert knowledge of ML frameworks and tools such as TensorFlow, PyTorch, or similar.
Proven experience optimizing model performance and latency for real-time applications.
Familiarity with large-scale data systems (Spark, Flink, Kafka, Snowflake, etc.).
Strong coding skills in Python and familiarity with cloud platforms like AWS, particularly SageMaker, ECS, and EKS.
Excellent communication skills, with experience aligning ML strategies to broader company objectives
Benefits & perks:
Premium health, dental, and vision
20 days PTO, plus company holidays, plus a 2-week winter break
Monthly team events and an international offsite in early 2026
Top-spec MacBook Pro, Apple Studio Display
Monthly stipend for software and tools
Fully covered commute in NYC
Daily lunch stipend at the office
Annual comprehensive blood panel
