Engineering · New Grad

AI / Software Engineer (New Grad)

BS or MS, 0–4 years · Remote (US)

Build the Nexus validation engine or the Expert Network platform. Two tracks, one role. You indicate preference in your application; we align in interviews.

Ready to apply? Sign up to the Expert Network and select BelmanAI.

Apply now →

About BelmanAI

BelmanAI builds AI evaluation and deployment infrastructure for critical operations. Our software validates whether AI systems perform as specified under real operating conditions, and produces the certified record that auditors, regulators, and operators require.

The role

Both tracks operate at the same level of ownership. Neither is a support role.

ML track. Build the Nexus validation engine: evaluation pipelines, rule corpus tooling, knowledge graph components, certification artifact generation. Your work determines whether AI systems pass or fail the BelmanAI standard.

SWE track. Build the Expert Network platform: APIs, data models, engagement workflow tooling, matching infrastructure. Your work connects businesses with domain experts and routes the engagement from scoping to delivery.

Key responsibilities

ML track: Design and implement evaluation pipelines that assess AI model output against domain-specific rules.
ML track: Build graph traversal and rule coverage scoring components.
ML track: Collaborate with Forward Deployed Engineers to translate field observations into validation logic.
SWE track: Build and maintain Expert Network APIs, data models, and backend infrastructure.
SWE track: Implement engagement matching logic and workflow tooling.
SWE track: Develop platform features that support the full engagement lifecycle, from scoping to delivery.
Both tracks: Maintain test coverage and documentation for all components you ship.

Required qualifications

BS or MS in Computer Science, Electrical Engineering, AI/ML, or equivalent.
Python proficiency.
Ability to own a project end-to-end: design, implement, test, ship, maintain.
ML track: Foundations in machine learning (coursework, research, or project experience), familiarity with LLM APIs (Anthropic, OpenAI, or Google Gemini), exposure to evaluation or benchmarking frameworks.
SWE track: Backend development experience in Python (FastAPI, Django, or equivalent), SQL and relational database fundamentals, REST API design, basic cloud experience (AWS or GCP).

Preferred qualifications

Prior research experience (thesis, lab, independent project).
Experience in or exposure to regulated industry contexts.
Open source contributions or shipped personal projects.
Knowledge graph or algorithmic coursework experience (ML track).

Back to all open roles