Experience the HCA Healthcare difference where colleagues are trusted, valued members of our healthcare team. Grow your career with an organization committed to delivering respectful, compassionate care, and where the unique and intrinsic worth of each individual is recognized. Submit your application for the opportunity below: Staff ML Ops Engineer
Job Summary and Qualifications
Position Summary
The Staff MLOps Engineer plays a pivotal role in shaping our MLOps practice within ITG by building and enhancing a scalable, reliable, and cutting-edge Machine Learning Operations (MLOps) platform. This role combines deep cloud architecture expertise with advanced AI/ML knowledge to develop solutions that streamline workflows, enable seamless collaboration, and drive innovation.
As a key contributor to the organization's AI/ML strategy, you will partner with cross-functional teams, including data scientists, product managers, and cloud engineers, to align platform development with business objectives. Your work will directly support the deployment of Responsible AI solutions that prioritize transparency, fairness, and ethical practices.
Major Responsibilities:
Platform Development: Lead the enhancement of the AI platform to improve the developer experience for data and ML engineers. Optimize workflows by integrating state-of-the-art tools and technologies, ensuring scalability and efficiency.
Cloud Infrastructure Design and Management: Architect and manage the cloud infrastructure supporting the MLOps platform, leveraging infrastructure-as-code (IaC) tools like Terraform. Optimize for scalability, security, cost-effectiveness, and high availability.
Cross-Functional Collaboration and Stakeholder Management: Partner with data science, product management, engineering, and business teams to understand their requirements and ensure the MLOps platform effectively supports their needs. Effectively communicate technical concepts and strategies to both technical and non-technical audiences.
AI/ML Reliability and Observability: Collaborate with the AI/ML reliability engineering team to design and implement components that ensure the platform's operational reliability, observability, and fault tolerance.
Cross-Disciplinary Knowledge: Apply knowledge from related disciplines, such as data science and health/biology sciences, to design holistic MLOps solutions that meet the unique needs of the organization.
DevOps for Machine Learning Workloads: Build and maintain robust DevOps pipelines tailored for ML workflows, enabling automated model training, testing, deployment, and monitoring.
Tool Development and System Reliability: Design and manage tools to enhance platform reliability, including dashboards, logging systems, and alerting frameworks, to ensure seamless operations.
Performs other duties as assigned
Practices and adheres to the "Code of Conduct" philosophy and "Mission and Value Statement."
Education & Experience:
Bachelor's degree preferred
Master's degree in Computer Science, Data Science, AI, or related field preferred
5+ years of experience in ML Ops, Dev Ops, or related role required
Knowledge, Skills, Abilities, Behaviors Required:
Service and Quality Excellence: Ability to demonstrate an uncompromising commitment to delivering exceptional care to create an unmatched value proposition for our patients.
Honor our Mission and Values: Ability to build trust and act with authenticity to cultivate a culture of integrity, inclusion, and mutual respect.
Effective Decision Making: Ability to make timely, informed decisions that are in the best interest of our patients, employees, providers, community and HCA.
Attain and Leverage Strategic Relationships: Ability to develop and strengthen collaborative relationships with both internal and external stakeholders to advance the care of our patients and the growth of HCA.
Lead and Develop Others: Ability to lead others to accomplish organizational goals and objectives; provide meaningful coaching and mentoring to increase the capabilities of individuals and teams and drive employee engagement.
Communicate with Impact: Ability to deliver information in a clear, concise, and compelling manner to effectively engage others and achieve desired results.
Achieve Success through Change: Ability to identify opportunities for improvement and innovation, remove barriers and resistance, and enable desired behaviors.
Drive Execution and Financial Results: Ability to commit to the success and financial wellbeing of HCA by challenging others to excel and hold themselves and others accountable for achieving results.
Advanced proficiency in cloud platforms, especially Google Cloud Platform (GCP). Experience with on-premises and edge deployments is a plus.
Solid understanding of AI/ML concepts, technologies, and best practices, with hands-on experience deploying ML solutions at scale.
Proven ability to work closely with peer teams, data scientists, and product managers to align platform development with strategic goals.
Proficiency in Python and other scripting tools for automation and platform optimization.
Strong analytical and troubleshooting skills, with a track record of solving complex problems under pressure.
Proven experience managing and leading cloud architecture and engineering teams preferred
Strong background in AI/ML or data science technologies and platform development.
Demonstrated expertise in leading Responsible AI initiatives, with a focus on ethical AI practices.
Excellent communication, leadership, and project management skills.
Required in office 2 days a week minimum and occasional/ intermittent required
Benefits
HCA Healthcare, offers a total rewards package that supports the health, life, career and retirement of our colleagues. The available plans and programs include:
Comprehensive benefits for medical, prescription drug, dental, vision, behavioral health and telemedicine services
Wellbeing support, including free counseling and referral services
Time away from work programs for paid time off, paid family leave, long- and short-term disability coverage and leaves of absence
Savings and retirement resources , including a 401(k) Plan with a 100% match on 3% to 9% of pay (based on years of service), Employee Stock Purchase Plan, flexible spending accounts, preferred banking partnerships, retirement readiness tools, rollover support and financial wellbeing counseling
Education support through tuition assistance, student loan assistance, certification support, dependent scholarships and a partnership with Galen College of Nursing
Additional benefits for fertility and family building, adoption assistance, life insurance, supplemental health protection plans, auto and home insurance, legal counseling, identity theft protection and consumer discounts
Learn more about Employee Benefits (https://careers.hcahealthcare.com/pages/employee-benefits-and-rewards)
Note: Eligibility for benefits may vary by location.
HCA Healthcare has been recognized as one of the World's Most Ethical Companies® by the Ethisphere Institute more than ten times. In recent years, HCA Healthcare spent an estimated $3.7 billion in cost for the delivery of charitable care, uninsured discounts, and other uncompensated expenses.
"There is so much good to do in the world and so many different ways to do it."- Dr. Thomas Frist, Sr.
HCA Healthcare Co-Founder
If you find this opportunity compelling, we encourage you to apply for our Staff ML Ops Engineer opening. We promptly review all applications. Highly qualified candidates will be directly contacted by a member of our team. We are interviewing - apply today!
We are an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.