About this role:
Wells Fargo is seeking a Principal Engineer to lead observability and reliability engineering for Marketing Technology platforms. This role focuses on enabling end-to-end visibility across a complex data ecosystem spanning SaaS platforms, hybrid cloud, and on-prem systems.
This position is responsible for designing and advancing full-stack observability for ETL pipelines, integrations, and data movement across marketing platforms. The goal is to improve reliability, reduce time to detect and resolve issues, and ensure consistent data flow supporting critical customer engagement channels.
You will join a focused team of engineers and application support professionals driving adoption of observability, automation, and Site Reliability Engineering (SRE) practices across Marketing Technology. The team partners closely with data engineering, platform teams, and vendors to deliver stable, scalable, and highly visible systems.
The role supports all aspects of the platform lifecycle-from instrumentation and monitoring design to incident response and continuous improvement-ensuring strong operational health, proactive alerting, and resilient data pipelines across distributed systems.
The team operates across a global footprint, enabling a follow-the-sun support model and consistent platform reliability.
In this role, you will:
Define observability strategy
Establish standards, patterns, and designs for monitoring distributed data systems
Drive adoption of tools such as Grafana, Splunk, Prometheus, and OpenTelemetry
Enable end-to-end visibility
Instrument ETL pipelines, integrations, and APIs with metrics, logs, and traces
Build dashboards for data flow health, latency, throughput, and error tracking
Monitor data movement across SaaS, hybrid, and on-prem environments
Strengthen data reliability
Implement proactive alerting for pipeline failures, latency spikes, and data integrity risks
Identify and isolate issues across multi-step data hops and integrations
Partner with data engineering teams to embed observability into ETL frameworks (Airflow, Informatica)
Lead incident response and problem resolution
Drive triage for high-severity data and platform incidents
Distinguish root causes across data, platform, and vendor layers
Perform root cause analysis and implement long-term fixes
Develop runbooks and playbooks for repeatable issue resolution
Manage change and vendor coordination
Ensure observability readiness for vendor patches, hotfixes, and configuration changes
Validate monitoring coverage post-change to maintain visibility and alert accuracy
Participate in change advisory processes and track vendor release cycles
Document change impacts and lessons learned for audit and compliance
Drive automation and continuous improvement
Automate observability setup for new pipelines and integrations
Implement anomaly detection and predictive alerting
Continuously refine dashboards, thresholds, and alert logic based on trends
Influence architecture and engineering practices
Partner with Enterprise Architecture and engineering teams to align solutions with standards
Advocate for scalable, observable system design across Marketing Technology
Contribute to technology strategy and roadmap decisions
Support operational excellence
Share responsibility for production support of critical data flows and applications
Improve key metrics such as availability, time to detect, and time to recover
Promote SRE practices including SLIs, SLOs, and error budgets
Required Qualifications:
7+ years of Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
5+ years designing and implementing observability solutions (e.g., Splunk, Grafana, Prometheus, OpenTelemetry)
3+ years of experience with cloud and container platforms (Kubernetes, OpenShift)
Desired Qualifications:
Experience with databases (Oracle, SQL Server, PostgreSQL, MongoDB) and strong SQL skills
Familiarity with Java-based applications and performance monitoring (JVM metrics, heap/thread analysis)
Experience with CI/CD pipelines (Jenkins, GitLab, Harness)
Strong understanding of APIs, microservices, event-driven architecture, and messaging systems (Kafka, MQ)
Ability to troubleshoot data movement across SaaS, hybrid, and on-prem systems
Strong communication skills with the ability to work across technical and business teams
Strong experience with distributed systems monitoring, logging, and tracing
Experience supporting complex ETL pipelines and data integration workflows
Proficiency in scripting and automation (Python, Bash, Ansible)
Experience with Linux/Unix and Windows server environments
Experience supporting marketing or SaaS platforms (Adobe, Salesforce, Pega, etc.)
Knowledge of data governance and compliance frameworks
Experience troubleshooting OS-level and infrastructure-related issues
Experience supporting both commercial off-the-shelf and custom-built applications
Strong analytical mindset with a focus on continuous improvement
Experience working in Agile or Kanban environments
Job Expectations:
Ability to work on-site at posted location
Relocation assistance is not available for this position
Visa sponsorship is not available for this position
Pay Range
Reflected is the base pay range offered for this position. Pay may vary depending on factors including but not limited to demonstrated examples of prior performance, skills, experience, or work location. Employees may also be eligible for incentive opportunities.
$159,000.00 - $305,000.00
Benefits
Wells Fargo provides eligible employees with a comprehensive set of benefits, many of which are listed below. Visit Benefits - Wells Fargo Jobs (https://www.wellsfargojobs.com/en/life-at-wells-fargo/benefits) for an overview of the following benefit plans and programs offered to employees.
Health benefits
401(k) Plan
Paid time off
Disability benefits
Life insurance, critical illness insurance, and accident insurance
Parental leave
Critical caregiving leave
Discounts and savings
Commuter benefits
Tuition reimbursement
Scholarships for dependent children
Adoption reimbursement
Posting End Date:
21 Jun 2026
*Job posting may come down early due to volume of applicants.
We Value Equal Opportunity
Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.
Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit's risk appetite and all risk and compliance program requirements.
Applicants with Disabilities
To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo (https://www.wellsfargojobs.com/en/diversity/disability-inclusion/) .
Drug and Alcohol Policy
Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy (https://www.wellsfargojobs.com/en/wells-fargo-drug-and-alcohol-policy) to learn more.
Wells Fargo Recruitment and Hiring Requirements:
a. Third-Party recordings are prohibited unless authorized by Wells Fargo.
b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.
Req Number: R-552266