Build the control plane for the next generation of AI.
We're a small team solving a hard infrastructure problem. If you've run GPU workloads at scale and know the pain firsthand, we'd love to talk.
We ship to real clusters.
Every line of code runs on production GPU infrastructure. No toy benchmarks, no sandboxed demos — we operate where it matters.
Operators stay in the loop.
We build tools that augment human judgment, not replace it. Every recommendation is reviewable, reversible, and auditable.
Clarity over cleverness.
We explain what our system sees and why it recommends what it recommends. No black boxes — inside or outside the product.
Open roles
Founding Engineer — Platform
Build the core control plane: scanner, rules engine, and remediation workflows. You'll own architecture decisions and ship directly to production GPU fleets.
Responsibilities
- Design and build the piqc scanner — the Kubernetes-native component that discovers and inspects live inference workloads
- Develop and maintain the rules engine that encodes GPU optimization expertise and surfaces actionable recommendations
- Integrate with Temporal to implement durable, human-in-the-loop remediation workflows
- Build multi-cluster telemetry collection across vLLM, TGI, and other inference servers
- Own the full deployment lifecycle — from local dev to production GPU clusters
Qualifications
- 5+ years of backend or infrastructure engineering experience
- Deep familiarity with Kubernetes — controllers, operators, and the scheduling layer
- Experience running or building tooling for GPU workloads (inference, training, or HPC)
- Proficiency in Go or Python; comfort with both
- Strong opinions about observability, reliability, and operational correctness
- Startup DNA — you move fast, own outcomes, and don't wait to be told what to build
Solutions Engineer
Work directly with GPU cloud providers and inference platform teams to onboard, deploy, and get value from Paralleliq. You're the bridge between product and customer.
Responsibilities
- Lead technical onboarding for new customers — from cluster access to first recommendation surfaced
- Diagnose GPU waste patterns in customer environments and translate findings into actionable insights
- Work with the engineering team to close product gaps discovered during customer deployments
- Build repeatable onboarding playbooks and technical documentation
- Run discovery calls and technical demos with prospects at GPU cloud providers and enterprise AI teams
Qualifications
- 3+ years in a solutions engineering, customer engineering, or technical account management role
- Hands-on experience with Kubernetes and cloud infrastructure (AWS, GCP, or Azure)
- Familiarity with AI/ML infrastructure — model serving, GPU utilization, inference optimization
- Ability to read and understand Python and YAML; light scripting for customer environments
- Strong communicator — equally comfortable in a Slack thread and a C-suite demo
- Experience working with early-stage products where the playbook doesn't yet exist
Apply
Don't see your role? Select “General Interest” and tell us what you've built.