Posted 9 days ago
Description
Software Engineer, AI/ML Systems – Emerald AI
About Emerald AI
We’re at a pivotal moment for AI and energy. Demand for compute is skyrocketing, but power constraints are becoming a critical bottleneck. Emerald AI sits at the intersection of these worlds, enabling AI data centers to scale without overwhelming the grid. Our Emerald Conductor software platform makes data centers flexible and responsive, allowing them to adjust power usage dynamically. This unlocks massive AI growth without major new infrastructure, while also strengthening the grid and supporting renewable energy expansion. We’re a team of experts across AI, cloud, software, and energy on a mission to scale AI sustainably, backed by leading investors and partners including Radical Ventures and NVIDIA.
About the Role
We are looking for a Software Engineer with a strong background in AI/ML systems to design and build intelligent orchestration models that drive automated decision‑making across compute infrastructure. You will work at the intersection of systems engineering and applied machine learning, developing models and pipelines that optimize workload placement, resource scheduling, and power‑performance tradeoffs in large‑scale data center environments.
Key Responsibilities
- Design and implement approaches for orchestration of AI/ML workloads considering resource allocation, load balancing, data management, and performance.
- Contribute to the design and implementation of performance monitoring for various optimization strategies.
- Plan and conduct experiments with training and inference workloads using state‑of‑the‑art AI models, such as LLMs.
- Develop mechanisms for power optimization and control with deep knowledge of AI systems design.
- Develop predictive models for forecasting various parameters of data center compute jobs and demand.
Minimum Requirements
- Bachelor’s/Master’s in Computer Science, Computer Engineering, or Electrical Engineering with 5+ years of experience, or a PhD with 2+ years of industry experience.
- 3+ years of experience in systems engineering or backend infrastructure.
- Strong programming skills in multiple languages (Python, Go, Rust, C++, etc.) focused on high‑performance and reliable systems.
- Experience with ML workflows and tools (e.g., PyTorch, scikit‑learn), particularly for modeling system behavior.
- Deep understanding of distributed systems, resource scheduling, and telemetry instrumentation.
- Familiarity with platforms such as Kubernetes, Slurm, Ray, or similar schedulers.
Preferred Requirements
- Experience applying ML to systems problems (e.g., load prediction, anomaly detection, reinforcement learning for scheduling).
- Knowledge of workload characteristics in AI/ML pipelines (training, inference, batch vs. real‑time).
- Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry) and data infrastructure (Kafka, Parquet, etc.).
- Contributions to open‑source infrastructure projects.
What We Offer
- Make an impact by solving the AI power bottleneck and shaping how data centers scale sustainably.
- Join a world‑class team of AI, cloud, software, and energy experts in a collaborative, low‑ego environment.
- Build from 0→1— influence strategy, GTM, org design, and customer/investor engagement from day one.
- Competitive pay and equity; stock options to share in the value you help create.
- Comprehensive benefits, including medical, dental, vision, and 401(k) matching.
- Flexible location: work from D.C., Boston, or the Bay Area with one WFH day per week; backed by top investors, including Radical Ventures and NVIDIA.
We are an equal‑opportunity employer and value diversity. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability, and any other protected ground under applicable legislation.
Emerald AI respects the dignity and independence of people with disabilities and is committed to giving them the same opportunity to succeed as all other employees. Inclusiveness is core to our culture, and we strive to provide the best interview experience. We make reasonable accommodations for applicants with disabilities; please reach out to the Talent team if a reasonable accommodation is needed.
#J-18808-Ljbffr