180 Reliability Engineer jobs in Ireland

Quality & Reliability Engineer

Galway, Connacht TE Connectivity

Posted today

Job Viewed

Tap Again To Close

Job Description

Quality & Reliability Engineer
**At TE, you will unleash your potential working with people from diverse backgrounds and industries to create a safer, sustainable and more connected world.**
**Job Overview**
Reporting to the Quality Manager/Quality Supervisor, the Quality Engineer is a member of the Quality group. The successful candidate will be responsible for Quality within their prescribed area of functional responsibility. They will be working as part of a team to maintain high quality/performance standards on all TE Connectivity products.
This challenging position will require an ability to work within a collaborative environment, pursuing continuous improvement and ensuring compliance to the TE Connectivity Quality System. The focus of the role is to supply high-quality medical devices and components to deliver an Extraordinary Customer Experience.
**Job Requirements**
+ Working cross functionally with other departments promote the achievement of the health and safety goals.
+ To deliver on all KPIs that help the business achieve its goal.
+ The Quality Engineer will perform an active role in further development and continuous improvement of the Quality Management system.
+ Establish and maintain risk management principles and methods throughout the product realization process in compliance with the company's Quality Management system and applicable regulations.
+ Maintain relevant documentation to comply with quality standards and customer requirements.
+ Offer quality guidance to the entire team in support of the day-to-day deliverables.
+ Develop strong links with customer organizations and other project stakeholders.
+ Support and initiate projects to ensure continuous improvement.
+ Quality review of responsible area validation documentation.
+ Investigation of root cause and implementation of effective corrective actions to prevent re-occurrence of compliance issues.
+ Overall responsibility for production GMP standards and compliance.Establish inspection standards, sampling plans and test methods where applicable.
+ Prepare and update procedures and associated documentation.
+ Support customer audits and surveillance/accreditation audits
+ Conduct and drive audits ensuring compliance with ISO13485.
+ The Quality Engineer will perform an active role in quality planning and new product introduction from a quality perspective.
+ Develop strong links with customer organizations and other project stakeholders.
+ Quality review of responsible area validation documentation
**What your background should look like**
**Qualifications**
+ Level 8 degree in Quality or Degree in Science / Engineering / Quality field.
+ Minimum of 2 years of industry experience is required.
**Key Requirements**
+ Working knowledge of FDA/ISO/MDD Quality systems for Medical Device companies.
+ Experience within a similar role as Quality Engineer is an advantage.
+ Quality experience in component and device manufacturing desirable.
+ Excellent written and oral communication skills essential.
**Competencies**
Values: Integrity, Accountability, Inclusion, Innovation, Teamwork
**ABOUT TE CONNECTIVITY**
TE Connectivity plc (NYSE: TEL) is a global industrial technology leader creating a safer, sustainable, productive, and connected future. Our broad range of connectivity and sensor solutions enable the distribution of power, signal and data to advance next-generation transportation, energy networks, automated factories, data centers, medical technology and more. With more than 85,000 employees, including 9,000 engineers, working alongside customers in approximately 130 countries, TE ensures that EVERY CONNECTION COUNTS. Learn more at and on LinkedIn ( ,Facebook ( ,WeChat, ( Instagram andX (formerly Twitter). ( TE CONNECTIVITY OFFERS:**
We are pleased to offer you an exciting total package that can also be flexibly adapted to changing life situations - the well-being of our employees is our top priority!
- Competitive Salary Package
- Performance-Based Bonus Plans
- Health and Wellness Incentives
- Employee Stock Purchase Program
- Community Outreach Programs / Charity Events
- Employee Resource Group
**IMPORTANT NOTICE REGARDING RECRUITMENT FRAUD**
TE Connectivity has become aware of fraudulent recruitment activities being conducted by individuals or organizations falsely claiming to represent TE Connectivity. Please be advised that TE Connectivity **never requests payment or fees** from job applicants at any stage of the recruitment process. All legitimate job openings are posted exclusively on our official careers website at te.com/careers, and all email communications from our recruitment team will come **only from** **actual** **email addresses ending in @te.com** . If you receive any suspicious communications, we strongly advise you not to engage or provide any personal information, and to report the incident to your local authorities.
Across our global sites and business units, we put together packages of benefits that are either supported by TE itself or provided by external service providers. In principle, the benefits offered can vary from site to site.
Location:
GALWAY, G, IE, H91 VN2T
City: GALWAY
State: G
Country/Region: IE
Travel: Less than 10%
Requisition ID:
Alternative Locations:
Function: Engineering & Technology
TE Connectivity and its subsidiaries, affiliates, and operating units (collectively, the "Company") is committed to providing a work environment that prohibits discrimination on the basis of age, color, disability, ethnicity, marital status, national origin, race, religion, gender, gender identity, sexual orientation, protected veteran status, disability or any other characteristics protected by applicable law or regulation.
This advertiser has chosen not to accept applicants from your region.

Quality & Reliability Engineer

Galway, Connacht TE Connectivity

Posted today

Job Viewed

Tap Again To Close

Job Description

Quality & Reliability Engineer
**At TE, you will unleash your potential working with people from diverse backgrounds and industries to create a safer, sustainable and more connected world.**
**Job Overview**
Reporting to the Quality Manager/Quality Supervisor, the Quality Engineer is a member of the Quality group. The successful candidate will be responsible for Quality within their prescribed area of functional responsibility. They will be working as part of a team to maintain high quality/performance standards on all TE Connectivity products.
This challenging position will require an ability to work within a collaborative environment, pursuing continuous improvement and ensuring compliance to the TE Connectivity Quality System. The focus of the role is to supply high-quality medical devices and components to deliver an Extraordinary Customer Experience.
**Job Requirements**
+ Working cross functionally with other departments promote the achievement of the health and safety goals.
+ To deliver on all KPIs that help the business achieve its goal.
+ The Quality Engineer will perform an active role in further development and continuous improvement of the Quality Management system.
+ Establish and maintain risk management principles and methods throughout the product realization process in compliance with the company's Quality Management system and applicable regulations.
+ Maintain relevant documentation to comply with quality standards and customer requirements.
+ Offer quality guidance to the entire team in support of the day-to-day deliverables.
+ Develop strong links with customer organizations and other project stakeholders.
+ Support and initiate projects to ensure continuous improvement.
+ Quality review of responsible area validation documentation.
+ Investigation of root cause and implementation of effective corrective actions to prevent re-occurrence of compliance issues.
+ Overall responsibility for production GMP standards and compliance.Establish inspection standards, sampling plans and test methods where applicable.
+ Prepare and update procedures and associated documentation.
+ Support customer audits and surveillance/accreditation audits
+ Conduct and drive audits ensuring compliance with ISO13485.
+ The Quality Engineer will perform an active role in quality planning and new product introduction from a quality perspective.
+ Develop strong links with customer organizations and other project stakeholders.
+ Quality review of responsible area validation documentation
**What your background should look like**
**Qualifications**
+ Level 8 degree in Quality or Degree in Science / Engineering / Quality field.
+ Minimum of 2 years of industry experience is required.
**Key Requirements**
+ Working knowledge of FDA/ISO/MDD Quality systems for Medical Device companies.
+ Experience within a similar role as Quality Engineer is an advantage.
+ Quality experience in component and device manufacturing desirable.
+ Excellent written and oral communication skills essential.
**Competencies**
Values: Integrity, Accountability, Inclusion, Innovation, Teamwork
**ABOUT TE CONNECTIVITY**
TE Connectivity plc (NYSE: TEL) is a global industrial technology leader creating a safer, sustainable, productive, and connected future. Our broad range of connectivity and sensor solutions enable the distribution of power, signal and data to advance next-generation transportation, energy networks, automated factories, data centers, medical technology and more. With more than 85,000 employees, including 9,000 engineers, working alongside customers in approximately 130 countries, TE ensures that EVERY CONNECTION COUNTS. Learn more at and on LinkedIn ( ,Facebook ( ,WeChat, ( Instagram andX (formerly Twitter). ( TE CONNECTIVITY OFFERS:**
We are pleased to offer you an exciting total package that can also be flexibly adapted to changing life situations - the well-being of our employees is our top priority!
- Competitive Salary Package
- Performance-Based Bonus Plans
- Health and Wellness Incentives
- Employee Stock Purchase Program
- Community Outreach Programs / Charity Events
- Employee Resource Group
**IMPORTANT NOTICE REGARDING RECRUITMENT FRAUD**
TE Connectivity has become aware of fraudulent recruitment activities being conducted by individuals or organizations falsely claiming to represent TE Connectivity. Please be advised that TE Connectivity **never requests payment or fees** from job applicants at any stage of the recruitment process. All legitimate job openings are posted exclusively on our official careers website at te.com/careers, and all email communications from our recruitment team will come **only from** **actual** **email addresses ending in @te.com** . If you receive any suspicious communications, we strongly advise you not to engage or provide any personal information, and to report the incident to your local authorities.
Across our global sites and business units, we put together packages of benefits that are either supported by TE itself or provided by external service providers. In principle, the benefits offered can vary from site to site.
Location:
GALWAY, G, IE, H91 VN2T
City: GALWAY
State: G
Country/Region: IE
Travel: Less than 10%
Requisition ID:
Alternative Locations:
Function: Engineering & Technology
TE Connectivity and its subsidiaries, affiliates, and operating units (collectively, the "Company") is committed to providing a work environment that prohibits discrimination on the basis of age, color, disability, ethnicity, marital status, national origin, race, religion, gender, gender identity, sexual orientation, protected veteran status, disability or any other characteristics protected by applicable law or regulation.
This advertiser has chosen not to accept applicants from your region.

Principal Reliability Engineer

Galway, Connacht Life Science Recruitment Ltd

Posted today

Job Viewed

Tap Again To Close

Job Description

Job Title: Principal Reliability Engineer Location: Parkmore Galway Benefits: Bonus, Pension, Healthcare, Hybrid working (1 day from home) Job Purpose Cardiac Ablation Solutions (CAS) is an Operating Units within the Cardiac portfolio with my client. At CAS, the team are developing next generation medical technologies that treat patients with abnormal heart rhythms. Our technologies save lives and improve the quality of living for millions of patients across the world by advancing innovation for the diagnosis and ablation of cardiac arrhythmias and enabling clinicians to perform procedures with superior outcomes. Our growing and innovative portfolio provides solutions that advance and enhance care. This position will be on-site (minimum 4 days per week) based out of Parkmore, Galway, Ireland site. Job Requirements: Works closely with Research & Development in the development of Test methods to ensure that they are ready for the Test Method Validation (TMV) process. Ensures all TMVs are validated to meet the required TMV procedure and standards. Takes direction from the quality core team member in delivering day to day project deliverables as an extended QCTM. Collaborates with engineering and manufacturing functions to ensure quality standards are in place. Perform systematic reliability analysis against features, requirements, architecture, interfaces, and designs, through the appropriate application of reliability engineering techniques (e.g. fault tree analysis, failure trending and analysis, reliability forecasting, etc.) to understand product and process robustness. Understand risk management concepts used throughout the quality system to successfully meet FDA, ANSI/AAMI/ISO , and ANSI/AAMI/ISO requirements. You will lead strategies for test method validations, design verification and shelf-life protocols / reports. Education Requires advanced knowledge of job area combining breadth and depth, typically obtained through advanced education combined with experience. Requires a minimum Level 8 Degree in Engineering or other relevant discipline and minimum 8 years of relevant experience. Or advanced Degree with a minimum of 7 years of relevant experience. May have practical knowledge of project management. Experience in a highly regulated industry, preferably medical devices. Experience with solving complex issues by interacting with cross functional groups. Proven ability to operate in a matrix organization and navigate complex business systems, regulations, standards, and performance requirements. Knowledge of reliability tools and practices that effectively support requirements, design, integration and verification, and validation. Demonstrated critical thinking skills with focus on improved system performance outcomes and positive business impact. Excellent communication and ability to influence is critical to the role. Does this sound like your next career move? To apply and For more info forward your application to the link provided or contact me on OR Benefits: Healthcare + Annual Bonus + Pension
This advertiser has chosen not to accept applicants from your region.

Sr Site Reliability Engineer

Kilkenny, Leinster UKG (Ultimate Kronos Group)

Posted today

Job Viewed

Tap Again To Close

Job Description

Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation.
Site Reliability Engineers must be passionate about learning and evolving with current technology trends. They strive to innovate and are relentless in pursuing a flawless customer experience. They have an "automate everything" mindset, helping us bring value to our customers by deploying services with incredible speed, consistency, and availability.
**Job Responsibilities:**
+ Engage in and improve the lifecycle of services from conception to EOL, including system designconsulting, and capacity planning
+ Define and implement standards and best practices related to: System Architecture, Servicedelivery, metrics and the automation of operational tasks
+ Support services, product & engineering teams by providing common tooling and frameworks todeliver increased availability and improved incident response
+ Improve system performance, application delivery and efficiency through automation, processrefinement, postmortem reviews, and in-depth configuration analysis
+ Collaborate closely with engineering professionals within the organization to deliver reliableservices
+ Increase operational efficiency, effectiveness, and quality of services by treating operationalchallenges as a software engineering problem (reduce toil)
+ Guide junior team members and serve as a champion for SiteReliability Engineering
+ Actively participate in incident response, including on-call responsibilities
**Required Qualifications:**
+ Must have at least 3 years of hands-on experience working in Engineering or Cloud
+ Minimum 2 years' experience with public cloud platforms (e.g. GCP, AWS, Azure)
+ Minimum 2 years' Experience in configuration and maintenance of applications and/orsystems infrastructure for large scale customer facing company
+ Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java)
**Preferred Qualifications:**
+ Knowledge of Cloud based applications & Containerization Technologies
+ Demonstrated understanding of best practices in metric generation and collection, log aggregationpipelines
+ Demonstrable fundamentals in 2 of the following: Computer Science, Cloud architecture, Security,or Network Design fundamentals Demonstrable fundamentals in 2 of the following: Computer Science, Cloud architecture, Security, or Network Design fundamentals
**Where we're going**
UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet it's our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow!
UKG is proud to be an equal opportunity employer and is committed to promoting diversity and inclusion in the workplace, including the recruitment process.
Disability Accommodation in the Application and Interview Process
For individuals with disabilities that need additional assistance at any point in the application and interview process, please email
NOTICE ON HIRING SCAMS
UKG will never ask you for a copy of your driver's license, social security card, or passport during a job interview. For new hires, we do not ask for payment for equipment purchase, cost for training, or to receive onboarding documents. UKG does not make job offers outside of our formal hiring process. To help protect yourself against potential hiring scams, learn more about our formal hiring process, outlined here ( .
ABOUT OUR JOB DESCRIPTIONS
All job descriptions are written to accurately reflect the open job and include general work responsibilities. They do not present a comprehensive, detailed inventory of all duties, responsibilities, and qualifications required for the job. Management reserves the right to revise the job or require that other or different tasks be performed if or when circumstances change.
It is the policy of Ultimate Software to promote and assure equal employment opportunity for all current and prospective Peeps without regard to race, color, religion, sex, age, disability, marital status, familial status, sexual orientation, pregnancy, genetic information, gender identity, gender expression, national origin, ancestry, citizenship status, veteran status, and any other legally protected status entitled to protection under federal, state, or local anti-discrimination laws. This policy governs all matters related to recruitment, advertising, and initial selection of employment. It shall also apply to all other aspects of employment, including, but not limited to, compensation, promotion, demotion, transfer, lay-offs, terminations, leave of absence, and training opportunities.
This advertiser has chosen not to accept applicants from your region.

Staff Site Reliability Engineer

Dublin, Leinster ServiceNow, Inc.

Posted today

Job Viewed

Tap Again To Close

Job Description

It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone.
This is an exciting opportunity for someone who is passionate about driving innovation, enhancing service reliability, and making a tangible impact on the organization's success.
**What you get to do in this role:**
+ Provide relief and sustainable resolution to issues within our infrastructure.
+ Use your knowledge and experience in software development, systems engineering, and networking to proactively prevent repeatable issues.
+ Lead internal stakeholders and partner teams to improve the reliability, scalability and performance of the infrastructure through improved system design.
+ Champion and contribute to a culture of intolerance to manual activity, which results in an automation environment delivering repeatable and scalable response to system issues.
**To be successful in this role you have:**
+ Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving. This may include using AI-powered tools, automating workflows, analyzing AI-driven insights, or exploring AI's potential impact on the function or industry.
+ Excellent Knowledge of Linux systems.
+ Comfortable designing, authoring, testing, and debugging code in a team setting in one of the following languages such as Python, Go, Java, or Ruby.
+ Experience working with systems at scale - supporting critical services with focus on automation, observability, availability, and performance.
+ Experience with MySQL and PostgreSQL database administration, troubleshooting, and performance tuning.
+ Develop and maintain telemetry and monitoring solutions using OpenTelemetry standards to gain deep insights into system behaviour, proactively address issues, optimise performance, and improve efficiency through comprehensive data collection, analysis, and visualisation.
+ Proven experience in defining and managing SLAs.
+ Collaborate with development teams to ensure new services align with architectural standards and best practices.
Good to have:
+ Expertise in Observability and Monitoring of applications, services, and networks at scale.
+ Experience with DevOps automation, CI/CD pipeline and agile methodologies such as Gitlab CI-CD.
+ Experience writing test specifications and understand the fundamentals of test automation.
+ Experience working with Cloud technologies such as Azure and AWS.
+ Experience in configuration management of infrastructure using Ansible.
+ Experience with Kubernetes to orchestrate the deployment, scaling, and management of containers.
+ Hands-on experience with Microsoft Azure, Google Cloud (GCP) and Amazon Web Services (AWS), including designing, implementing, and maintaining reliable and scalable systems.
We also have pluses! They are not a 'must', but please highlight them on your resume if you have any of these: experience with cloud engineering, knowledge of core AI/ML techniques and algorithms, familiar with implementing Chaos engineering principles, experience in incident response process, post-mortem practices, or service best practice standards and web applications engineering.
**What you can expect from us:**
At ServiceNow, we make work better for everyone - including our own employees. We know that your best work happens when you live your best life and share your unique talents, so we do everything we can to make that possible for our employees. Win as a Team is part of our culture, and we aspire to wow our customers. We stay hungry and humble and focus on creating belonging. Sustainability, inclusivity, and diversity are key focus areas within our business framework so that we have transparency, equity, and accountability to deliver meaningful, measurable change. With our vision and dedication for a better future already underway. Join us on this journey!
In addition to a competitive salary, supportive teams, and a real opportunity to progress in your career with a forward-thinking organisation, we provide resources to help you and your loved ones be well. From benefits plans and programs, to mental health resources that offer coaching and 24/7 support, to family support resources and parental leave programs - we want to help you take care of yourself and your loved ones. Below is a glimpse into even more of our offerings or click here for a full list: ( Along with holidays, we have company-wide designated global well-being days where everyone is off and can spend time doing what matters most.
+ Good working culture to support the balance you need in both work and life.
+ Parental leave programs.
+ Childcare and caregiving benefits.
+ A learning experience platform built using our own technology, to support your learning and development goals as well as a tuition reimbursement program.
+ A global, cross-functional mentoring program.
+ We also have team building activities, various employee belonging groups, volunteering, and community outreach programs.
**Work Personas**
We approach our distributed world of work with flexibility and trust. Work personas (flexible, remote, or required in office) are categories that are assigned to ServiceNow employees depending on the nature of their work and their assigned work location. Learn more here ( . To determine eligibility for a work persona, ServiceNow may confirm the distance between your primary residence and the closest ServiceNow office using a third-party service.
**Equal Opportunity Employer**
ServiceNow is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status, or any other category protected by law. In addition, all qualified applicants with arrest or conviction records will be considered for employment in accordance with legal requirements.
**Accommodations**
We strive to create an accessible and inclusive experience for all candidates. If you require a reasonable accommodation to complete any part of the application process, or are unable to use this online application and need an alternative method to apply, please contact for assistance.
**Export Control Regulations**
For positions requiring access to controlled technology subject to export control regulations, including the U.S. Export Administration Regulations (EAR), ServiceNow may be required to obtain export control approval from government authorities for certain individuals. All employment is contingent upon ServiceNow obtaining any export license or other approval that may be required by relevant export control authorities.
From Fortune. ©2025 Fortune Media IP Limited. All rights reserved. Used under license.
This advertiser has chosen not to accept applicants from your region.

Principal Site Reliability Engineer

Oracle

Posted today

Job Viewed

Tap Again To Close

Job Description

**Job Description**
OCI Incident Response is the first line of defense for maintaining the high availability of Oracle's cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by utilizing our operational experience, knowledge of best practices, and ability to develop tools to automate incident management.
We are looking for a Principal SRE to join our OCI teamThis role is part of a globally distributed team responsible for detecting, triaging, and mitigating OCI service-impacting events as quickly as possible. You will be a part of one of these regional teams and be responsible for minimizing the downtime of OCI services. You will achieve this through delivering excellent major incident management and by operating systems with high scalability, performance, and security that prevent incidents from occurring.
Oracle's Cloud is state-of-the-art and constantly evolving. When it experiences issues, your team will respond within minutes to ensure customer impact is mitigated. This experience will expose you to the inner workings of OCI's systems and organizations. You will interact with and influence leaders from across the Oracle business and will drive broad cross-organization programs meant to iteratively improve OCI-wide service availability. We are an agile team with significant impact. If you want to be a part of a fast-moving team breaking new ground, we would like to speak with you!
Career Level - IC4
**Responsibilities**
Oracle's Cloud is innovative and constantly evolving. When it experiences issues, your team will respond within minutes to ensure customer impact is mitigated. This experience will expose you to the inner workings of OCI's systems and organizations. You will interact with and influence leaders from across the Oracle business and will drive broad cross-organization programs meant to iteratively improve OCI-wide service availability. We are an agile team with significant impact. If you want to be a part of a fast-moving team breaking new ground, we would like to speak with you!
**Responsibilities**
+ Solve complex problems related to infrastructure cloud services and automate common tasks to enable continuous availability with minimal human overhead
+ Command and coordinate SMEs and Service leaders to restore service as quickly as possible during Major Incidents while keeping accurate and timely data on the progress of such incidents
+ Utilize a deep understanding of cloud computing design patterns and their dependencies to mitigate complex Major Incidents.
+ Embed a methodical approach to troubleshoot large, complex, interconnected systems used in Incident Detection & Orchestration
+ Documents pertinent information relating to Incidents that aids process improvement, identifies deviations and enables the creation of an Incident Knowledge Base
+ Monitors and evaluates high-level service and infrastructure dashboards and takes action to address identified anomalies
+ Identifies opportunities and takes ownership for automation and/or continuous improvement of Incident Management process steps and best practices
+ Can define and document technical architecture of large-scale distributed systems.
+ Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
+ Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance.
+ Partner with development teams in defining operational requirements for product roadmaps.
+ Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
+ Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
**Minimum Qualifications**
+ Bachelor's degree or higher in Computer Science or relevant work experience.
+ 5+ years experience in Site Reliability Engineering, DevOps or System Engineering.
+ Must have public cloud operations experience (e.g., AWS, Azure, GCP, OCI).
+ Extensive experience with Major Incident Management in a cloud-based environment.
+ Demonstrate clear understanding of automation and orchestration principles.
+ Experience having worked in at least one modern object-oriented programming language.
+ Experience with professional software engineering standard methodologies such as Agile project management, coding standards, code reviews, source control management, build processes, testing, and operations.
+ Familiarity with infrastructure automation tools such as Chef, Ansible, Jenkins, Terraform
+ Excellent expertise with several of following technologies: Infrastructure-as-a-Service, CI/CD systems, Docker, RESTful APIs, log analysis tools, debugging tools
**Preferred Qualifications**
+ Strong leadership, project planning, communication, and execution skills
+ Strong analytic and problem-solving skills.
+ Proven track record of leading high blast-radius Major Incidents in cloud-based platforms.
+ Strong leadership, project planning, communication, and execution skills
+ Ability to handle multiple competing priorities in a fast-paced environment.
+ Ability to communicate clearly with technical and non-technical stakeholders at all levels.
+ Confidence to drive and manage large conference calls.
+ Experience with distributed service-oriented architectures
Career Level - IC4
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
This advertiser has chosen not to accept applicants from your region.

Senior Network Reliability Engineer

Dublin, Leinster Oracle

Posted today

Job Viewed

Tap Again To Close

Job Description

**Job Description**
**Job Description**
**About Us:**
At Oracle Cloud Infrastructure (OCI), we're building the future of cloud technology for enterprises. As a team of innovative, diverse creators and engineers, we operate with the agility of a startup, but the scale and customer-first mindset of the leading enterprise software company in the world. We thrive on equity, inclusion, and respect for all, and are deeply committed to creating a positive impact in everything we do. Our values shape the way we work, from delivering excellent products to fostering an environment of continuous learning and career growth.
We are looking for passionate and driven professionals to join our dynamic team where autonomy and collaboration are key to delivering outstanding results. Here, you'll have the support and freedom to excel and push the boundaries of what's possible.
You will be part of a fast-paced, innovative team responsible for swiftly responding to network disruptions, identifying root causes, and collaborating with both internal and external stakeholders to restore services. Your work will also focus on automating daily operations, improving workflow efficiency, and optimizing network performance. With OCI's expansive global footprint, you will manage hundreds of thousands of network devices across a mix of dedicated backbone infrastructure, CLoS networks, and the internet.
**Preferred Qualifications:**
+ **Education & Experience** :
+ Experience working in a large-scale **ISP** or **cloud provider** environment, supporting global network infrastructure.
+ Prior experience in a **network operations** role, with a proven track record of handling complex network events.
+ **Technical Skills** :
+ Strong proficiency in **network protocols** and services, including **MPLS, BGP, OSPF, IS-IS, TCP/IP, IPv4/IPv6, DNS, DHCP, VxLAN, and EVPN** .
+ Extensive experience with **network automation** , scripting, and data center design. **Python** is preferred, though expertise in other scripting or compiled languages is a plus.
+ Hands-on experience with **network monitoring and telemetry solutions** , with the ability to leverage these tools to drive improvements in network reliability.
+ Familiarity with **network modeling and programming** , including **YANG, OpenConfig, and NETCONF** .
+ **Problem-Solving and Collaboration** :
+ Ability to apply **engineering principles** to resolve complex network issues, collaborating across teams to deliver effective solutions.
+ Strong **communication skills** , both written and verbal, with the ability to present technical information clearly to both technical and non-technical stakeholders.
+ Demonstrated experience in influencing product roadmap decisions, priorities, and feature development through sound judgment and technical expertise.
Career Level - IC3
**Responsibilities**
**Responsibilities**
**What You'll Do:**
+ **Support and Operate OCI's Global Network:** Design, deploy, and manage large-scale network solutions that power Oracle Cloud Infrastructure (OCI), ensuring reliability and performance at a global scale.
+ **Collaborate and Drive Change:** Use best practices and tools to develop and execute network changes safely. Work closely with cross-functional teams to continuously improve network performance.
+ **Incident Response and Troubleshooting:** Lead break-fix support for network events, provide escalation for complex issues, and perform post-event root cause analysis to prevent future disruptions.
+ **Automation and Efficiency:** Create and maintain scripts to automate routine network tasks, working with business units and teams to streamline operations and increase productivity.
+ **Mentorship and Knowledge Sharing:** Guide and mentor junior engineers, fostering a culture of collaboration, continuous learning, and technical excellence.
+ **Network Monitoring and Performance Analysis:** Collaborate with network monitoring teams to gather telemetry data, build dashboards, and set up alert rules to track network health and performance.
+ **Vendor Collaboration:** Work with network vendors and technical account teams to resolve network issues, qualify new firmware/operating systems, and ensure the network ecosystem's stability.
+ **On-Call Support:** Participate in the on-call rotation to provide after-hours support for critical network events, ensuring that operational excellence is maintained 24/7.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
This advertiser has chosen not to accept applicants from your region.
Be The First To Know

About the latest Reliability engineer Jobs in Ireland !

Senior Site Reliability Engineer

Mulhuddart, Leinster IBM

Posted today

Job Viewed

Tap Again To Close

Job Description

**Introduction**
IBM Trusteer has an opportunity for a senior SRE; We are seeking an experienced and talented individual that is passionate about infrastructure and interested in working with cutting edge technology in a global large-scale environment.
**Your role and responsibilities**
This role is responsible for designing, deploying, and maintaining our infrastructure and CI/CD pipelines. The work includes designing, building and deploying high availability, robust, resilient and supportable products while streamline and automate our software delivery and infrastructure operations in a large-scale SaaS environment. With a focus on the infrastructure and operational elements of designing and deploying large scale solutions, the SRE must ensure the infrastructure is highly available, have sufficient capacity in place and are fully resilient across multiple data centers and cloud architectures.
This role works as part of an operations team to design, deploy and support 24x7x365 operations with day-to-day responsibilities that include:
* Develop and maintain automation scripts and tools using Python, Groovy & bash.
* Create and manage our pipelines, CI/CD infrastructure and automations jobs in Jenkins.
* Manage Development/QA/Production environments with Terraform.
* Integrate, create and maintain monitoring for various flows and components of the system to ensure systems' reliability and observability.
* Occasional off-shift availability to resolve Production issues.
* Work closely with other members of the SRE and R&D teams.
* Responsible for system performance and reliability.
* Ensure proactive engagement in Incident Management process, working with Operational teams to minimize the impact of database outages.
**Required technical and professional expertise**
* Several years experience as SRE
* Several years experience in scripting and automation using Python or similar language
* Experience in Cloud-related environment - AWS preferred
* Experience in CICD tools like Jenkins
* Experience with IaC and configuration management tools like Terraform and Ansible
* Experience with docker-based environments and Kubernetes orchestration (GitOps and ArgoCD are an advantage)
* Experience working in production environments requiring 99.99% availability
* Excellent communication skills including the ability to effectively communicate with technical, non-technical employees and vendors.
* Strong problem solving, testing, and network troubleshooting skills.
* Bachelor's degree in computer science, Information Technology, or a related field.
IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.
This advertiser has chosen not to accept applicants from your region.

Senior Site Reliability Engineer

Rithum

Posted today

Job Viewed

Tap Again To Close

Job Description

Rithum is the world's most trusted commerce network, accelerating how brands, suppliers, and retailers work together to deliver seamless e-commerce experiences. We provide an unmatched platform for brands and retailers, enabling them to accelerate growth, optimise operations across channels, scale product offerings and enhance margins.
Today, more than 40,000 companies trust Rithum to grow their business across hundreds of channels, representing over $50 billion in annual GMV. Using our commerce, marketing, and delivery solutions, our customers create optimised consumer shopping journeys from beginning to end.
**Overview**
As a Senior Site Reliability Engineer in our Platform Engineering Organization, you help to build and run large-scale, distributed, fault-tolerant systems. In this role, you are involved in the complete lifecycle of our products from inception to operation, ensuring they are reliable, performant and meet appropriate uptime and availability targets. You design and maintain resilient systems, implement robust observability through metrics, logging, and tracing, and build automation that improves deployment, monitoring, and incident response workflows. This includes leveraging AI/ML for intelligent alerting, anomaly detection, and predictive incident response to enhance system reliability and scalability. Working with others in the organization, you help develop and influence operational tooling, best practices, and standards that empower the engineering organization and help ensure Rithum's effective and efficient operations. As a Senior Engineer, you operate independently, self-prioritizing work, design and lead projects from start to completion, engaging with stakeholders for successful delivery. You mentor and assist less experienced people on the team and coach them to help improve their skills.
**Responsibilities**
+ Collaborate with developers, Client Support, and cross-functional teams to build production automation, analysis tools, and improving reliability and performance.
+ Design, implement, and maintain robust application monitoring and observability systems for a distributed, highly available, and scalable software stack leveraging AI/ML to detect anomalies and asset with incidents.
+ Analyse and resolve problems in legacy environments while designing and implementing modern, scalable solutions from the ground up.
+ Participate in the rotating on-call schedule, ensuring that user emergencies, platform alerts, and support requests are addressed.
+ Drives automation and operational efficiency.
**Qualifications**
Minimum Qualifications
+ 3+ years' experience working as an SRE, DevOps Engineer or related
+ Experience with logging and monitoring systems like CloudWatch, Grafana or Prometheus
+ Experience with AWS foundations, including compute, storage, and security
+ Good AWS knowledge including application design, migration support, cost planning, capacity allocation, and application resiliency
+ Expertise in creating multi-region cloud systems with a solid disaster recovery plan
+ Experience with both high-level and scripting languages like Python, Bash or Typescript
+ Experience troubleshooting and debugging complex, distributed applications
+ IaC experience automating infrastructure with CDK, Terraform or Ansible
+ Experience with continuous deployment pipelines and containerization like EKS or ECS
+ Strong understanding of software engineering fundamentals, including object-oriented design, modular architecture, and maintainable coding practices.
Preferred Qualifications
+ You have a bachelor's degree, or higher, in Computer Science or related field; or equivalent practical experience demonstrating strong software engineering fundamentals.
+ Experience working in a highly collaborative environment with both platform and product teams,
+ Excellent collaboration and communication skills, consistently learning new technologies and helps foster an environment of continuous improvement and innovation.
+ Client satisfaction focus.
**Travel Required**
Up to 10%
**Other Duties**
_Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice._
**What it's like to work at Rithum**
When you join Rithum, you can expect to work with smart risk-takers, courageous collaborators, and curious minds.
As part of the Rithum team, you are valued, supported, and included. Guided by a transparent culture and accessible, approachable leadership, we offer career opportunities aligned to your ambitions and talents. To ensure work and life balance works for you, we also offer an array of resources to support you and your families, including comprehensive benefits and wellness plans.
At Rithum you will:
+ Partner with the leading brands and retailers.
+ Connect with passionate professionals who will help support your goals.
+ Participate in an inclusive, welcoming work atmosphere.
+ Achieve work-life balance through remote-first working conditions, generous time off, and wellness days.
+ Receive industry-competitive compensation and total rewards benefits.
**Benefits**
+ Medical coverage provided through Irish Life Health; premiums paid by the company
+ Life & disability insurance
+ Pension plan with 5% company match
+ Competitive time off package with 25 Days of PTO, 11 Company-Paid holidays, 2 Wellness days and 1 Paid Volunteer Day
+ Access to tools to support your wellbeing such as the Calm App and an Employee Assistance Program
+ Professional development stipend and learning and development offerings to help you build the skills and connections you need to move forward in your career.
+ Charitable contribution match per team member
Rithum is an equal opportunity employer. We are committed to providing an environment of mutual respect where equal employment opportunities are available to all applicants and teammates without regard to race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status or any other protected characteristic. All employment is decided on the basis of qualifications, merit, and business need.
We're committed to providing reasonable accommodations in accordance with the law for qualified applicants. If you require assistance during the interview process due to a medical condition or need support accessing our website or completing the application process, please reach out to us by completing the Accommodations Request Form ( . Your comfort and accessibility are important to us, and we're here to ensure a seamless experience as you explore opportunities with our team.
This advertiser has chosen not to accept applicants from your region.

Senior Site Reliability Engineer, Observability

MongoDB

Posted today

Job Viewed

Tap Again To Close

Job Description

MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft Azure. Atlas allows customers to build and run applications anywhere-on premises, or across cloud providers. With offices worldwide and over 175,000 new developers signing up to use MongoDB every month, it's no wonder that leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications.
**Team and Role Overview**
The SRE Observability team is part of the larger Platform Engineering organization, and is dedicated to building and maintaining the observability stack (metrics, logging, tracing) used by all engineering teams to ensure the smooth functioning of their service. We also own related services, including our telemetry pipeline, and our monitoring and alerting infrastructure. Our stack includes VictoriaMetrics, Splunk, QuickWit, Jaeger, Fluentbit, and Vector. In addition to owning our observability infrastructure, as an Engineer on the team, you'll also work closely with other SWE and SRE teams to promote and implement best practices in instrumenting and monitoring their services. This is a highly collaborative role, and you will get to own some of the most relied upon internal infrastructure at Mongo.
This role will be based remotely in Ireland.
**Responsibilities**
+ Define standards and vision for the mission-critical observability platform leveraged by all parts of the engineering organization
+ Design, architect, build and deliver core pieces of our observability services in collaboration with other vested parties
+ Design, implement, and troubleshoot the monitoring of services that seamlessly spans the globe - including several cloud providers
+ Build for reliability, making services and infrastructure available, resilient, fault tolerant and self-healing
+ Identify and configure key metrics to detect incidents and quantify service health, availability and performance.
+ Participate in a week-long on-call rotation and blameless post-mortem process
+ Improve our observability capabilities, optimizing for cost, ease of use, and maintainability
**Requirements**
+ Experience running mission critical services at scale
+ Experience with observability of large scale distributed systems
+ An understanding of information security issues
+ Firm grasp of at least one modern programming language, beyond basic scripting
+ Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc)
+ Bachelor's degree in Computer Science or equivalent experience
**Nice to haves**
+ Experience with at least one of the major cloud providers (Amazon Web Services, Google Compute, Microsoft Azure)
+ Experience working in a kubernetes-based environment kubernetes clusters
**What's in it for you**
+ Generous compensation package
+ Opportunities to learn on the job (time to up skill in new technologies)
+ High level of independence in your day to day work
To drive the personal growth and business impact of our employees, we're committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees' wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it's like to work at MongoDB ( , and help us make an impact on the world!
MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.
MongoDB is an equal opportunities employer.
Req ID:
This advertiser has chosen not to accept applicants from your region.

Nearby Locations

Other Jobs Near Me

Industry

  1. request_quote Accounting
  2. work Administrative
  3. eco Agriculture Forestry
  4. smart_toy AI & Emerging Technologies
  5. school Apprenticeships & Trainee
  6. apartment Architecture
  7. palette Arts & Entertainment
  8. directions_car Automotive
  9. flight_takeoff Aviation
  10. account_balance Banking & Finance
  11. local_florist Beauty & Wellness
  12. restaurant Catering
  13. volunteer_activism Charity & Voluntary
  14. science Chemical Engineering
  15. child_friendly Childcare
  16. foundation Civil Engineering
  17. clean_hands Cleaning & Sanitation
  18. diversity_3 Community & Social Care
  19. construction Construction
  20. brush Creative & Digital
  21. currency_bitcoin Crypto & Blockchain
  22. support_agent Customer Service & Helpdesk
  23. medical_services Dental
  24. medical_services Driving & Transport
  25. medical_services E Commerce & Social Media
  26. school Education & Teaching
  27. electrical_services Electrical Engineering
  28. bolt Energy
  29. local_mall Fmcg
  30. gavel Government & Non Profit
  31. emoji_events Graduate
  32. health_and_safety Healthcare
  33. beach_access Hospitality & Tourism
  34. groups Human Resources
  35. precision_manufacturing Industrial Engineering
  36. security Information Security
  37. handyman Installation & Maintenance
  38. policy Insurance
  39. code IT & Software
  40. gavel Legal
  41. sports_soccer Leisure & Sports
  42. inventory_2 Logistics & Warehousing
  43. supervisor_account Management
  44. supervisor_account Management Consultancy
  45. supervisor_account Manufacturing & Production
  46. campaign Marketing
  47. build Mechanical Engineering
  48. perm_media Media & PR
  49. local_hospital Medical
  50. local_hospital Military & Public Safety
  51. local_hospital Mining
  52. medical_services Nursing
  53. local_gas_station Oil & Gas
  54. biotech Pharmaceutical
  55. checklist_rtl Project Management
  56. shopping_bag Purchasing
  57. home_work Real Estate
  58. person_search Recruitment Consultancy
  59. store Retail
  60. point_of_sale Sales
  61. science Scientific Research & Development
  62. wifi Telecoms
  63. psychology Therapy
  64. pets Veterinary
View All Reliability Engineer Jobs