316 IT Infrastructure jobs in Ireland
Infrastructure Systems Administrator
Posted today
Job Viewed
Job Description
My client based in Drogheda are currently on the lookout for an Infrastructure Systems Administrator to manage and support our on-premises and cloud-based IT infrastructure. This role is responsible for maintaining servers, endpoints, and network components, administering user access and security, and ensuring system reliability across a diverse technology stack. The ideal candidate will bring strong troubleshooting skills, hands-on experience with both hardware and software systems, and the ability to provide end-to-end support in a dynamic environment.
Key Responsibilities
- Troubleshoot and resolve hardware, software, and network issues, including diagnosing outages and system performance problems.
- Configure, install, and manage servers, laptops, desktops, mobile devices, printers, and scanners.
- Administer Active Directory, Group Policies, OUs, and Azure AD/Entra ID for user/group management and identity access.
- Manage Microsoft 365 services (Exchange, SharePoint, Teams, OneDrive) and support cloud-based business applications such as Oracle NetSuite.
- Perform system backups, patch management, and implement endpoint protection using Microsoft Defender, MDM, and ATP.
- Support network fundamentals (TCP/IP, DNS, DHCP, VLANs, WAN, Fibre, Wi-Fi standards) and assist with structured cabling and connectivity issues.
- Monitor system performance using Microsoft PurView, InTune, SNMP, syslog, and log analysis tools.
- Provide desktop and software support, including imaging devices, supporting engineering applications (SolidWorks, ZW CAD), ERP (NetSuite), payroll software, and Microsoft Office suite.
- Manage and support remote access tools, ticketing systems, and IT documentation.
- Assist in basic security administration, including certificate generation, installation, and revocation.
- Collaborate with stakeholders to ensure IT systems meet business needs, while maintaining compliance with security and governance standards.
Required Skills & Experience
- 3+ years' experience in systems or infrastructure administration.
- Strong knowledge of Windows Server (2022), Windows 11, Active Directory, Azure AD/Entra ID, and endpoint security solutions.
- Experience with Microsoft 365 administration and cloud services.
- Hands-on knowledge of hardware platforms (HP ProLiant servers, Dell/Lenovo laptops, HP workstations, printers, mobile devices).
- Familiarity with basic networking concepts (TCP/IP, DNS, DHCP, VLANs, Wi-Fi, fibre connectivity).
- Strong troubleshooting, diagnostic, and problem-solving skills.
- Excellent communication and stakeholder engagement abilities.
Desirable Skills
- Experience with macOS and iOS devices in mixed environments.
- Exposure to scripting/automation with PowerShell or Python.
- Familiarity with engineering/CAD applications (SolidWorks, ZW CAD).
- Understanding of ITIL processes and documentation standards.
- Monitoring and reporting experience across on-premises and cloud environment
Network Production Engineer, Infrastructure
Posted 14 days ago
Job Viewed
Job Description
The Network Infrastructure team is responsible for designing, building and operating one of the largest networks in the world. Networking is at the core of all Meta products and experiences, and we are looking for Production Network Engineers who are interested in solving complex technical challenges in the Backbone, Datacenter Network, and AI Network domains. Production Network Engineers at Meta are hybrid software and network engineers who keep reliability and scalability in mind as they work on different parts of the lifecycle (designing, building, and operating our worldwide network). This role offers an opportunity to solve the scaling challenges of supporting billions of people using our family of apps; to cutting-edge challenges in AI workloads that power new Meta products.
**Required Skills:**
Network Production Engineer, Infrastructure Responsibilities:
1. Develop operational process improvements and implement them in scalable, automated workflows to enhance operational efficiency
2. Design and develop solutions that scale across a variety of hardware platforms of network equipment
3. Lead enhancements of automation for continuous integration, validations, testing infrastructure, release, and configuration management across our global backbone, data center, and edge networks
4. Conduct thorough investigations into complex technical issues across networks, ranging from automated tooling to hardware failures and network issues
5. Participate in an oncall rotation with the rest of the team
6. Proactively find gaps that impact multiple teams, come up with the execution plan and drive the project and influence other teams to reach there
7. Help increase operational efficiency between peers and cross-functional teams by identifying roadblocks, designing and delivering automation solutions, and driving change
**Minimum Qualifications:**
Minimum Qualifications:
8. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
9. 2+ years of coding experience in at least one programming language (e.g. Python, Go, C++, or Java), and learning new development languages
10. Experience with any of the following areas: planning, designing, building or operating scalable systems/networks
**Preferred Qualifications:**
Preferred Qualifications:
11. Experience with software and network debugging, profiling, and instrumentation techniques
12. 2+ years of experience building software for managing network infrastructure
13. Experience with developing distributed systems and operating them at scale
14. Experience designing and maintaining automated testing infrastructure
15. MS or Graduate work experience in Computer Science, Computer Engineering, or a related technical discipline
16. 2+ years of experience developing software on operating systems such as Linux
**Industry:** Internet
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Technical Infrastructure Analyst
Overview
The Technology Services & Operations Team provide end user support and is responsible for the Network & infrastructure, DR and Business continuity and out of hours support.
In this role you are responsible for the delivery, support and maintenance of the IT infrastructure environments, which include Production, Business Continuity and Data Centre with a focus on Network, VDI Infrastructure and Exchange infrastructure.
You will also be involved in the delivery of infrastructure projects, changes and improvements and support the existing infrastructure and enterprise systems environment.
- Assist with the monitoring, maintenance, upgrade & support of the SMT IT infrastructure
- A point of escalation for the helpdesk and to act as the 3rd line ICT specialist.
- Responsible for the day-to-day management of key delivery partners and vendors
The ideal person will have strong Network Admin background, strong Microsoft server o/s and desktop support, MS Teams admin experience, service delivery, project management implementing upgrades, migrations of infrastructure environment and qualifications in Microsoft, ITIL and/or project management.
Technical knowledge
- Networking e.g TCP/IP, DNS, DHCP, Routers, switches and V-LANs & maintaining both cloud based and on-prem IT Infrastructure
- Citrix Full Desktop & Applications Mgmt & VMware design, implementation and support
- Backup Strategy & Disaster Recovery
- Data Centre Management / Migrations
- ITIL certification, Prince/PMP certification, MCSE/MCSA, VCP-DCV 2020 , CCNA, CompTIA is of benefit
If interested, please contact Fergal Keys at The Panel
Infrastructure Engineer
Posted today
Job Viewed
Job Description
(LAMP & Container) Infrastructure Engineer
We are a leading digital commerce platform delivering exceptional uptime exceeding 99.95% while serving 10 million page views daily. We are dedicated to providing seamless and reliable experiences for our global customer base. As we evolve, we aim to transition from a traditional high-availability LAMP stack to a containerized infrastructure to enhance scalability, efficiency, and performance.
Job Description:
We are seeking a skilled
Infrastructure Engineer
to join our international team. The successful candidate will play a pivotal role in migrating our existing high-availability LAMP stack to a containerized environment. You will collaborate with teams across our range of products to standardize approaches and tooling, ensuring consistency and optimization throughout our operations.
Key Responsibilities:
- Lead the migration of our LAMP stack to a containerized platform.
- Manage high-availability infrastructure to maintain >99.95% uptime.
- Implement and maintain container-based orchestration tools
- Engage with third parties to conduct penetration testing and implement security best practices to safeguard our infrastructure.
- Collaborate with teams to standardize tools, processes, and best practices.
- Participate in an on-call rotation to provide 24/7 support for critical systems.
- Monitor and optimize system performance, troubleshoot issues, and ensure system reliability and security.
- Develop automation scripts to streamline operations and deployments.
- Stay updated with the latest industry trends and integrate new technologies as appropriate.
Experience:
- Proven experience in high-availability VM management.
- Hands-on experience with container technologies and migration processes.
- Strong background in LAMP stack environments
Skills:
- Proficiency with containerization tools like Docker and orchestration platforms like Kubernetes and Helm.
- Experience with SSH and managing systems running on AlmaLinux.
- Knowledge of penetration testing and security best practices.
- Proficiency with cloud platforms and iPaaS (AWS, Azure, Google Cloud).
- Strong scripting skills (e.g., Bash).
- Excellent problem-solving and analytical abilities.
- Effective communication skills and ability to work within an international team.
Preferred:
- Scaling LAMP based applications
- Experience in migration from VMs to container environments
- Knowledge of CI/CD pipelines and DevOps practices.
- Azure certifications
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Summary
Description
*Position Summary
We are seeking a highly skilled
Infrastructure Engineer
to join our
Technology Operations group
in Dublin. This is a
hands-on engineering
role with strong elements of
operational engagement and technical ownership *
.
The successful candidate will provide expertise across
Google Cloud Platform (GCP), VM Infrastructure, Terraform, Kubernetes, and configuration management tools (Ansible or equivalent)
, while helping to define and drive Infrastructure standards, resilience, and reliability.
The role spans
build, run and operations
of our cloud & vm environment, supporting the full lifecycle from infrastructure-as-code through to day-to-day operations, on-call, and continuous improvement initiatives.
*Key Responsibilities
1. Cloud Infrastructure Engineering *
- implement, and operate GCP infrastructure using Terraform (IaC) and Kubernetes (GKE / container orchestration).
- Own infrastructure configuration management (Ansible or equivalent).
- Ensure security, scalability, and high availability across infrastructure platforms.
- Support cloud cost optimisation and capacity planning.
*2. Operations & On-Call *
- Participate in the on-call rotation, providing tier-3/4 support for critical infrastructure services.
- Participate in incident response and root-cause analysis for infrastructure and cloud issues.
- Drive automation of monitoring, alerting, and remediation workflows.
*3. Technical Governance & Standards *
- Define, document, and enforce minimum technical standards across the Infrastructure, VM & Cloud landscape.
- Establish / Implement "Ready for Service" and service acceptance criteria for all new projects.
- Enhance / Implement existing processes for vulnerability management, patching, and change control.
*4. Delivery & Process Improvement *
- Partner with cross functional teams to align delivery models (Kanban for BAU, Agile for projects).
- Jira governance for Infrastructure projects — workflow / agile methodology
- Produce technical assessments / documentation upholding standards and providing recommendations for management.
*5. Cross-Team Collaboration *
- Act as a technical point of contact for Infrastructure teams.
- Provide ownership in workshops, technical design reviews, and operational readiness sessions.
- Support knowledge sharing, mentoring, and technical upskilling within the team.
*Core Deliverables *
VM & Cloud Provisioning
Build and manage GCP compute instances, storage, and networking using Terraform.
- Harden / Patch / Maintain VM images for Linux and Windows workloads.
Implement monitoring, logging, and backup policies for VM fleets.
Kubernetes Operations
Deploy and maintain Kubernetes clusters (GKE).
- Configure namespaces, RBAC, ingress controllers, and service mesh as required.
- Build and deploy containerised workloads; manage scaling, upgrades, and patching.
Troubleshoot pod, node, and networking issues in production clusters.
Configuration Management (Ansible)
Write and maintain Ansible playbooks and roles for VM and cluster configuration.
- Automate patching, security hardening, and system updates.
Standardise environment builds and enforce consistency across dev/test/prod.
Monitoring & Reliability
Implement and tune alerting rules (Stackdriver/Prometheus/Grafana).
- Perform root cause analysis for incidents and feed improvements back into IaC.
Create runbooks and playbooks for common operational scenarios.
On-Call & Incident Response
Participate in on-call rotation for infrastructure and cloud services.
- Lead troubleshooting during critical incidents and restore service quickly.
Document incident findings and implement automation/preventative fixes.
Day-to-Day Ops
Track and remediate vulnerabilities, including OS/kernel upgrades.
- Optimise resource usage and costs through tuning and automation.
*Required Skills & Experience
Technical Expertise *
- 5+ years' experience as an Infrastructure Engineer, SRE, or Cloud Engineer in enterprise environments.
- Proven expertise in Google Cloud Platform (GCP) / AWS
- Strong skills in VM workloads, Terraform (IaC) and Kubernetes (GKE or equivalent).
- Hands-on with Ansible (or equivalent configuration management).
- Broad knowledge of networking, databases, Linux systems, and hybrid infrastructure.
- Experience with monitoring, observability, and incident response tooling.
*Delivery & Process *
- Demonstrated success implementing Kanban and Agile delivery within Infra/Cloud Ops an advantage
- Ability to document and enforce technical standards.
- Proven track record of improving operational performance and cross-team delivery.
*Soft Skills *
- Strong communicator — able to influence both technical engineers and senior stakeholders.
- Organised and pragmatic, with strong ownership and accountability.
- Collaborative style — balances governance with delivery speed.
*Preferred Qualifications *
- GCP Professional Cloud Architect or equivalent certification or equivalent hands on Experience.
- Experience in regulated environments (PCI-DSS, ISO, SOC2, etc.) an advantage
- DevOps / SRE hands on experience
*Engagement Details *
- Location: Dublin (Hybrid – typically 2 days in office / 3 days remote).
- Contract Type: Full-Time, Permanent.
- Working Model: Hands-on engineering
- On-Call: Participation in rota with additional allowance.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Position:
Infrastructure Support Engineer Level 3 (Azure focus)
Type:
4 month Day-rate contract (strong scope for extension)
Location:
Dublin/Hybrid
Please note, you must be based in Ireland with the relevant visa to be considered for this role.
We are seeking a skilled Infrastructure Engineer (Level 3) to manage, support, and optimize our clients IT operations and project delivery. The ideal candidate will have strong technical support (level 3) and Azure Active directory (Entra ID) experience, along with being comfortable working with cloud and on-premise environments.
Key responsibility:
- Manage and maintain Microsoft Azure environment
Key functions of the role:
- Oversee and maintain both Microsoft Azure and on-premise infrastructure, ensuring stability and optimal performance.
- Deliver Level 3 technical support for escalated issues and guide the service desk team in problem resolution.
- Lead or assist in infrastructure projects, including system upgrades, deployments, and patch management.
- Implement and monitor security controls, manage assets, and maintain accurate infrastructure documentation.
- Collaborate with internal teams and vendors to improve systems, enhance automation, and ensure compliance with organisational standards
Experience Required:
- Strong experience in infrastructure engineering or systems administration, preferably across hybrid cloud environments
- Excellent experience in Microsoft Azure, Windows Server, Azure Active Directory, and virtualised systems
- Ability to troubleshoot complex issues, manage patching and security hardening, and monitor systems effectively
- Familiarity with asset management, development/test environments, and infrastructure lifecycle processes
- Great communication, documentation, and collaboration skills, with the ability to handle multiple priorities in a fast-paced setting
Please apply with your CV in the strictest of confidence. All applications must have valid working rights in Ireland and be able to prove same.
Infrastructure Manager
Posted today
Job Viewed
Job Description
Infrastructure/Support manager
We're looking for an experienced Infrastructure/Support manager to join a growing engineering company. You'll be the go-to person for all things IT, from day-to-day support to shaping the company's infrastructure across 4–5 international offices.
You'll play a key role in an upcoming office relocation, helping define IT requirements, manage setup, and ensure everything runs smoothly.
What you'll be working on:
- Onboarding setup and device management
- Email and SharePoint configuration
- Security protocols and best practices
- ISO certification support
- Implementing and maintaining IT support systems
Who you are:
You're hands-on, proactive, and comfortable juggling both technical and strategic work. You'll have experience across IT support, infrastructure, and systems administration — ideally within a multi-site or international setup.
If you like variety, ownership, and helping a growing company stay connected and secure, this role's for you. So just hit that apply button
Be The First To Know
About the latest It infrastructure Jobs in Ireland !
Infrastructure Inspector
Posted today
Job Viewed
Job Description
Company Description
Turner & Townsend is a global professional services company with over 22,000 people in more than 60 countries.
Working with our clients across real estate, infrastructure, energy and natural resources, we transform together delivering outcomes that improve people's lives. Working in partnership makes it possible to deliver the world's most impactful projects and programmes as we turn challenge into opportunity and complexity into success.
Our capabilities include programme, project, cost, asset and commercial management, controls and performance, procurement and supply chain, net zero and digital solutions.
We are majority-owned by CBRE Group, Inc., the world's largest commercial real estate services and investment firm, with our partners holding a significant minority interest. Turner & Townsend and CBRE work together to provide clients with the premier programme, project and cost management offering in markets around the world.
Please visit our website:
Job Description
Turner & Townsend are seeking a Light Rail Network Inspector to carry out weekly network infrastructure condition surveys (Excluding Energy and Systems) monitor and report on the progress of 3rd party developments adjacent and near the light rail network.
Responsibilities
- Carry out regular on-site inspections of the light rail network and jointly with Light Rail Maintenance Contractor as required.
- Provide weekly infrastructure inspection reports to Adjacent Development Manager/Senior asset manager, detailing issues of concern related to the condition of the asset and/or activities of 3rd party developers working in proximity to the network.
- Notify relevant manager immediately of any issues requiring immediate corrective action and monitor progress in implementing agreed actions. Provide close out reports where required.
- Provide weekly inspection report to maintenance staff.
- Maintain a register of faults/defects on rail infrastructure, noting date, location, fault identification, possible action required, issue assigned to and proposed rectification date.
- Undertake regular monitoring of unauthorised 3rd party adjacent development activity that has not received authorisation and/or impact on the safe operation of the network.
- Undertake weekly site inspection visits to record and monitor the progress of adjacent developments and ensure their compliance with agreements, ensuring the safe operation of the network.
Qualifications
- At least 5 years of project and / or contract management experience in an engineering, construction, asset management role preferably within the transport or infrastructure sectors with relevant site experience.
- Knowledge of asset management principles, combined with a good understanding of lifecycle management.
- Knowledge of working in railway operating environments and / or working within safety critical environments.
- Good communication and interpersonal skills, with the ability to present complex information clearly and concisely to a variety of audiences.
- Level 7 or equivalent engineering type qualification and experience gained in a similar site-based environment.
- Experience of hands-on construction works, in a demanding environment (rigorous safety environment, unsocial hours)
- Strong history of comprehensive training in the use and application of construction tools, materials, plant and on track plant.
- A good understanding of programme and project management and risk management.
- A strong commitment to public safety, to the safety of staff
- Ability to work unsocial hours when required and nightshifts if required.
Additional Information
- Full time, permanent
- Competitive remuneration and attractive range of benefits
- 1 volunteering day
- Gym discount
- Company funded social club
- Opportunity to work on impactful and innovative projects
- Career development opportunities both in Ireland and globally
- Opportunity to work with a diverse group of talented and collaborative colleagues
Our inspired people share our vision and mission. We provide a great place to work, where each person has the opportunity and voice to affect change.
We want our people to succeed both in work and life. To support this we promote a healthy, productive and flexible working environment that respects work-life balance.
Turner & Townsend is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees and actively encourage applications from all sectors of the community.
Please find out more about us at
Join our social media conversations for more information about Turner & Townsend and our exciting future projects:
It is strictly against Turner & Townsend policy for candidates to pay any fee in relation to our recruitment process. No recruitment agency working with Turner & Townsend will ask candidates to pay a fee at any time.
Any unsolicited resumes/CVs submitted through our website or to Turner & Townsend personal e-mail accounts, are considered property of Turner & Townsend and are not subject to payment of agency fees. In order to be an authorised Recruitment Agency/Search Firm for Turner & Townsend, there must be a formal written agreement in place and the agency must be invited, by the Recruitment Team, to submit candidates for review.
Infrastructure Engineer
Posted today
Job Viewed
Job Description
Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing results. We are driven by our passion for success and we are proud to deliver best-in-class payment technology and software solutions. Join our dynamic team and make your mark on the payments technology landscape of tomorrow.
Position SummaryWe are seeking a highly skilled Infrastructure Engineer to join our Technology Operations group in Dublin. This is a hands-on engineering role with strong elements of operational engagement and technical ownership.
The successful candidate will provide expertise across Google Cloud Platform (GCP), VM Infrastructure, Terraform, Kubernetes, and configuration management tools (Ansible or equivalent), while helping to define and drive Infrastructure standards, resilience, and reliability.
The role spans build, run and operations of our cloud & vm environment, supporting the full lifecycle from infrastructure-as-code through to day-to-day operations, on-call, and continuous improvement initiatives.
Key Responsibilities1. Cloud Infrastructure Engineering
- implement, and operate GCP infrastructure using Terraform (IaC) and Kubernetes (GKE / container orchestration).
- Own infrastructure configuration management (Ansible or equivalent).
- Ensure security, scalability, and high availability across infrastructure platforms.
- Support cloud cost optimisation and capacity planning.
- Participate in the on-call rotation, providing tier-3/4 support for critical infrastructure services.
- Participate in incident response and root-cause analysis for infrastructure and cloud issues.
- Drive automation of monitoring, alerting, and remediation workflows.
- Define, document, and enforce minimum technical standards across the Infrastructure, VM & Cloud landscape.
- Establish / Implement "Ready for Service" and service acceptance criteria for all new projects.
- Enhance / Implement existing processes for vulnerability management, patching, and change control.
- Partner with cross functional teams to align delivery models (Kanban for BAU, Agile for projects).
- Jira governance for Infrastructure projects — workflow / agile methodology
- Produce technical assessments / documentation upholding standards and providing recommendations for management.
- Act as a technical point of contact for Infrastructure teams.
- Provide ownership in workshops, technical design reviews, and operational readiness sessions.
- Support knowledge sharing, mentoring, and technical upskilling within the team.
- VM & Cloud Provisioning
- Build and manage GCP compute instances, storage, and networking using Terraform.
- Harden / Patch / Maintain VM images for Linux and Windows workloads.
- Implement monitoring, logging, and backup policies for VM fleets.
- Kubernetes Operations
- Deploy and maintain Kubernetes clusters (GKE).
- Configure namespaces, RBAC, ingress controllers, and service mesh as required.
- Build and deploy containerised workloads; manage scaling, upgrades, and patching.
- Troubleshoot pod, node, and networking issues in production clusters.
- Configuration Management (Ansible)
- Write and maintain Ansible playbooks and roles for VM and cluster configuration.
- Automate patching, security hardening, and system updates.
- Standardise environment builds and enforce consistency across dev/test/prod.
- Monitoring & Reliability
- Implement and tune alerting rules (Stackdriver/Prometheus/Grafana).
- Perform root cause analysis for incidents and feed improvements back into IaC.
- Create runbooks and playbooks for common operational scenarios.
- On-Call & Incident Response
- Participate in on-call rotation for infrastructure and cloud services.
- Lead troubleshooting during critical incidents and restore service quickly.
- Document incident findings and implement automation/preventative fixes.
- Day-to-Day Ops
- Track and remediate vulnerabilities, including OS/kernel upgrades.
- Optimise resource usage and costs through tuning and automation.
Technical Expertise
- 5+ years' experience as an Infrastructure Engineer, SRE, or Cloud Engineer in enterprise environments.
- Proven expertise in Google Cloud Platform (GCP) / AWS
- Strong skills in VM workloads, Terraform (IaC) and Kubernetes (GKE or equivalent).
- Hands-on with Ansible (or equivalent configuration management).
- Broad knowledge of networking, databases, Linux systems, and hybrid infrastructure.
- Experience with monitoring, observability, and incident response tooling.
- Demonstrated success implementing Kanban and Agile delivery within Infra/Cloud Ops an advantage
- Ability to document and enforce technical standards.
- Proven track record of improving operational performance and cross-team delivery.
- Strong communicator — able to influence both technical engineers and senior stakeholders.
- Organised and pragmatic, with strong ownership and accountability.
- Collaborative style — balances governance with delivery speed.
- GCP Professional Cloud Architect or equivalent certification or equivalent hands on Experience.
- Experience in regulated environments (PCI-DSS, ISO, SOC2, etc.) an advantage
- DevOps / SRE hands on experience
- Location: Dublin (Hybrid – typically 2 days in office / 3 days remote).
- Contract Type: Full-Time, Permanent.
- Working Model: Hands-on engineering
- On-Call: Participation in rota with additional allowance.
Global Payments Inc. is an equal opportunity employer. Global Payments provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, sex (including pregnancy), national origin, ancestry, age, marital status, sexual orientation, gender identity or expression, disability, veteran status, genetic information or any other basis protected by law. If you wish to request reasonable accommodations related to applying for employment or provide feedback about the accessibility of this website, please contact
Infrastructure Engineer
Posted today
Job Viewed
Job Description
About Moonvalley
Moonvalley's mission is to solve Visual Intelligence in the age of generative AI. We are building technology that can tell stories, scale creativity, and understand both the physics and semantics of the world. With Marey, our first high-definition foundation model trained exclusively on licensed data, we are powering the next era of cinematic, commercial, and enterprise-grade creation.
Our team is an unprecedented convergence of talent across industries. Our elite AI scientists from Deepmind, Google, Microsoft, Meta & Snap, have decades of collective experience in machine learning and computational creativity. We have also established the first AI-enabled movie studio in Hollywood, filled with accomplished filmmakers and visionary creative talent. We work with the top producers, actors, and filmmakers in Hollywood as well as creative-driven global brands. So far we've raised over $100M+ from world-class investors including General Catalyst, Bessemer, Khosla Ventures & YCombinator – and we're just getting started.
Job Summary
We're hiring an Infrastructure Engineer to design and maintain the systems that power Moonvalley's generative AI research and product development. You'll be joining at a pivotal moment, helping to define the foundations of our infrastructure as we train and deploy cutting-edge video foundation models.
In this role, you'll work closely with researchers, engineers, and cross-functional partners to ensure our infrastructure is scalable, reliable, and efficient. From managing GPU clusters to optimizing ETL pipelines, you'll be instrumental in ensuring the technical performance and productivity of our entire AI platform.
What you'll do
Build, manage, and scale GPU infrastructure using tools like Kubernetes, Terraform, or Pulumi
Maintain and optimize ETL pipelines using Spark, Ray, or Airflow
Operate and improve our telemetry and monitoring stack (Datadog, Grafana, Weights & Biases)
Manage CI/CD pipelines and development tooling (GitHub, PyTorch, Python)
Track and optimize datasets, checkpoints, compute utilization, and related assets
Automate repetitive tasks to improve efficiency and reduce friction across engineering workflows
Participate in an on-call rotation to resolve infrastructure issues and ensure uptime
Provide tooling, documentation, and support to accelerate internal engineering productivity
What we're looking for
Strong generalist with experience managing large-scale, high-performance infrastructure
Skilled in designing scalable systems for compute, data, and developer tooling
Comfortable in high-urgency environments with the ability to prioritize for impact
Familiar with infrastructure stacks for AI model training and experimentation
Experienced with Kubernetes, Terraform/Pulumi, Spark/Ray, and observability tools
Pragmatic problem-solver who favors automation and simplicity over complexity
Open to using and contributing to open-source tooling when appropriate
Bonus: experience as a Cluster Engineer, Data Engineer, or Developer Advocate in AI/ML environments
What we offer (compensation & benefits)
Competitive salary and equity
Private health coverage
Pension contribution
Unlimited paid vacation
Fully-distributed, async-first culture
Hardware setup of your choice
Stipends for phone, internet, and meals
In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation.
If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for.
All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company.
If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you
The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work
Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs.
Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy located here for further information.