Company: Bestinfo Systems LLC
Location: San Francisco, CA
Posted on: April 17
(Senior) Software Engineer - Compute Platform _ San Francisco-CA_Full-Time (FTE)_Direct Hire
Position: (Senior) Software Engineer - Compute Platform
Job Type: Full-Time (FTE)
Location: San Francisco-CA
Base Salary: $100,000 to $200,000+Best-in-class benefits
Role Impact:
This is a hybrid role spanning both our developer platform and infrastructure layers. You'll work on two key areas:
* Our developer-facing platform for AI workload management
* The underlying distributed infrastructure that powers our training systems
Core Technical Responsibilities:
Platform Development:
* Build intuitive web interfaces for AI workload management and monitoring
* Develop REST APIs and backend services in Python
* Create real-time monitoring and debugging tools
* Implement user-facing features for resource management and job control
Infrastructure Development:
* Design and implement distributed training infrastructure in Rust
* Build high-performance networking and coordination components
* Create infrastructure automation pipelines with Ansible
* Manage cloud resources and container orchestration
* Implement scheduling systems for heterogeneous hardware (CPU, GPU, TPU)
Technical Requirements:
Platform Skills:
* Strong Python backend development (FastAPI, async)
* Modern frontend development (TypeScript, React/Next.js, Tailwind)
* Experience building developer tools and dashboards
* RESTful API design and implementation
Infrastructure Skills:
* Systems programming experience with Rust
* Infrastructure automation (Ansible, Terraform)
* Container orchestration (Kubernetes)
* Cloud platform expertise (GCP preferred)
* Observability tools (Prometheus, Grafana)
Nice to Have:
* Experience with GPU computing and ML infrastructure
* Knowledge of AI/ML model architecture and training
* High-performance networking implementation
* Open-source infrastructure contributions
* WebSocket/real-time systems experience
What We Offer:
* Competitive compensation with significant equity and token incentives
* Flexible work arrangement (remote or San Francisco office)
* Full visa sponsorship and relocation support
* Professional development budget for courses and conferences
* Regular team off-sites and conference attendance
* Opportunity to shape the future of decentralized AI development
Skills and Certifications:
*Strong Python backend development (FastAPI, async)
*Modern frontend development (TypeScript, React/Next.js, Tailwind)
*Experience building developer tools and dashboards
*Systems programming experience with Rust
*Infrastructure automation (Ansible, Terraform)
*Container orchestration (Kubernetes)
*Cloud platform expertise (GCP preferred)
*Observability tools (Prometheus, Grafana)
Candidate Details:
*Seniority Level - Associate
*Minimum Education - Bachelor's Degree