ParallelIQ
Careers

Build the control plane for the next generation of AI.

We're a small team solving a hard infrastructure problem. If you've run GPU workloads at scale and know the pain firsthand, we'd love to talk.

We ship to real clusters.

Every line of code runs on production GPU infrastructure. No toy benchmarks, no sandboxed demos — we operate where it matters.

Operators stay in the loop.

We build tools that augment human judgment, not replace it. Every recommendation is reviewable, reversible, and auditable.

Clarity over cleverness.

We explain what our system sees and why it recommends what it recommends. No black boxes — inside or outside the product.

Open roles

Founding Engineer — Platform

EngineeringRemote (US)Full-time
Apply

Build the core control plane: scanner, rules engine, and remediation workflows. You'll own architecture decisions and ship directly to production GPU fleets.

Responsibilities

  • Design and build the piqc scanner — the Kubernetes-native component that discovers and inspects live inference workloads
  • Develop and maintain the rules engine that encodes GPU optimization expertise and surfaces actionable recommendations
  • Integrate with Temporal to implement durable, human-in-the-loop remediation workflows
  • Build multi-cluster telemetry collection across vLLM, TGI, and other inference servers
  • Own the full deployment lifecycle — from local dev to production GPU clusters

Qualifications

  • 5+ years of backend or infrastructure engineering experience
  • Deep familiarity with Kubernetes — controllers, operators, and the scheduling layer
  • Experience running or building tooling for GPU workloads (inference, training, or HPC)
  • Proficiency in Go or Python; comfort with both
  • Strong opinions about observability, reliability, and operational correctness
  • Startup DNA — you move fast, own outcomes, and don't wait to be told what to build

Solutions Engineer

Customer EngineeringRemote (US)Full-time
Apply

Work directly with GPU cloud providers and inference platform teams to onboard, deploy, and get value from Paralleliq. You're the bridge between product and customer.

Responsibilities

  • Lead technical onboarding for new customers — from cluster access to first recommendation surfaced
  • Diagnose GPU waste patterns in customer environments and translate findings into actionable insights
  • Work with the engineering team to close product gaps discovered during customer deployments
  • Build repeatable onboarding playbooks and technical documentation
  • Run discovery calls and technical demos with prospects at GPU cloud providers and enterprise AI teams

Qualifications

  • 3+ years in a solutions engineering, customer engineering, or technical account management role
  • Hands-on experience with Kubernetes and cloud infrastructure (AWS, GCP, or Azure)
  • Familiarity with AI/ML infrastructure — model serving, GPU utilization, inference optimization
  • Ability to read and understand Python and YAML; light scripting for customer environments
  • Strong communicator — equally comfortable in a Slack thread and a C-suite demo
  • Experience working with early-stage products where the playbook doesn't yet exist

Apply

Don't see your role? Select “General Interest” and tell us what you've built.

Don't let performance bottlenecks slow you down. Optimize your stack and accelerate your AI outcomes.

Start for Free