Click here to join our community of experts to get information on job search, salaries and more.

Hire IT People

Data Science | Machine Learning With MLP Ops | SRE

Company: Hire IT People

Location: Austin, Texas

Posted on: January 03

Job Seekers, Please send resumes to [email protected]
Must Have Skills:
  • Machine Learning Operations
  • Kubernetes (K8s) for MLP Ops
  • AI/ML, Jupiter Notebook, and Jenkins
Detailed Job Description:
  • The role is for Big Data Engineer with MLP Ops SRE expertise with 7+ years of role experience
  • A solid understanding of AI/ML, Jupiter Notebook, and Jenkins is essential for this role.
  • The associate should also have a basic understanding of Kubernetes (K8s) and experience with Kubeflow for MLOps.
  • Person will be responsible for end-to-end machine learning lifecycle on our in-house Kubernetes (K8s) cluster
  • Ensuring the stability and availability of production services is a key responsibility.
  • Handle incident resolution when they occur. Maintain a culture of continuous learning and improvement in the incident resolution process.
  • The role involves developing best practices for operations.
  • The individual will be expected to create and maintain documentation as needed.
  • Associate need to work as per roaster which may include weekend support.
  • This role includes on-call duties to handle any urgent issues that occur outside of regular business hours.
  • Associate need to work with team member across different geographical location.
  • The role involves close collaboration with multiple teams to jointly resolve any major production issues.

Minimum Years of Experience: 8-10 years

Top 3 responsibilities you would expect the Subcon to shoulder and execute:
  1. Support, Analyze
  2. Solution implementation
  3. Testing