At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We're looking for people who are determined to make life better for people around the world.
The Advisor Federated Learning Data Scientist plays an essential leadership role, responsible for identifying, assessing, and implementing cutting-edge algorithmic solutions that leverage diverse datasets while ensuring data privacy and security for our partners. This position requires comprehensive knowledge in small molecule drug development, ADME/Tox, antibody engineering, and/or genetic medicine, combined with expertise in data science and statistical analysis to develop sophisticated models utilising federated learning. This position will be instrumental in advancing Lilly's pipeline by designing critical algorithms and workflows that expedite the creation of transformative therapies.
This role is centered on creating sophisticated models that can simultaneously learn to predict multiple, related endpoints from decentralized data sources. The candidate will address the challenges of learning from heterogeneous clients, where each may only possess data for a subset of the desired tasks.
Key Responsibilities
Multi-Task Model Design: Architect and implement advanced multi-task learning (MTL) models that effectively leverage shared representations across tasks to improve predictive performance and data efficiency in a federated ecosystem.
Handling Data Heterogeneity: Develop novel algorithms specifically designed to address extreme task and feature heterogeneity across clients. This includes creating personalized models, implementing meta-learning approaches, or designing gradient aggregation methods that are robust to non-IID data.
Knowledge Transfer & Regularization: Investigate and apply techniques to manage the balance between shared and task-specific learning. Implement regularization methods to prevent negative transfer (where learning one task hurts the performance of another) and encourage positive knowledge sharing.
Problem Formulation: Collaborate closely with domain experts and stakeholders to define complex biological or chemical endpoints. Translate these scientific problems into a well-posed multi-task learning framework, identifying relevant tasks and data sources.
Model Validation in MTL: Establish rigorous validation and evaluation frameworks for federated multi-task models. This includes defining appropriate metrics for each task and developing strategies to assess overall model performance and fairness across different clients and tasks.
Interpretability and Explainability (XAI): Implement XAI techniques to understand and explain the predictions of complex multi-task models. Uncover the relationships between different endpoints as learned by the model to generate novel scientific insights.
Code & Model Governance: Write clean, high-quality, and reproducible code. Contribute to internal libraries and ML platforms. Implement version control for data, code, and models to ensure robust and transparent research.
Cross-Functional Collaboration: Work in a collaborative, multi-disciplinary team alongside software engineers, MLOps specialists, privacy experts, and domain scientists to translate research concepts into practical, impactful solutions.
Literature Review & Innovation: Maintain a thorough understanding of the latest advancements in federated learning, deep learning, and related fields to drive innovation and contribute to the team's research strategy.
Basic Qualifications
PhD in a data science field such as Biostatistics, Statistics, Machine Learning, Computational Biology, Computational Chemistry, Physics, Applied mathematics, or related field from an accredited college or university
Minimum of 2 years of experience in the biopharmaceutical industry or related fields, with demonstrated expertise in drug discovery and early development.
Additional Preferences
Experience in developing statistical and machine learning models for complex endpoints.
Broad understanding of emerging scientific and technical breakthroughs.
Exceptional interpersonal and communication skills, with a keen ability to understand, empathize, and navigate complex relationships and dynamics
Outstanding EQ, problem-solving, analytical, project management skills.
Highly self-motivated and organized.
Demonstrated ability to connect and influence at various levels across disciplines, both externally and internally.
Learning Agility : Ability to quickly adapt to changing circumstances, learn from past experiences, and apply those learnings to new situations.
Portfolio Mindset : Strong ability to think with a portfolio-level mentality, ensuring that individual program decisions align with the overall goals of Catalyze360.
Independent, self-starter, work without supervision
This is a site-based role in Indianapolis (preferred) or San Diego (preferred) or San Francisco or Boston and relocation is provided.
Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form ( https://careers.lilly.com/us/en/workplace-accommodation ) for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.
Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status.
Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include: Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women's Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups.
Actual compensation will depend on a candidate's education, experience, skills, and geographic location. The anticipated wage for this position is
$142,500 - $228,800
Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly's compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees.
WeAreLilly