Filters (Clear filters)
Salary
Categories
Prometheus
Add
Company
Work model
Employment type
Find your next tech job
Most relevant

Prometheus jobs

Lead Software Engineer (PostgreSQL)Lead Software Engineer (PostgreSQL)
Tripadvisor
London, UK
Kubernetes
Postgres
Prometheus
Jenkins
AWS
Software engineer
Grafana
Python
Docker
GitLab
Agile
GitHub
Typescript
Cloud
Ansible
Posted 2 days ago
Linux System EngineerLinux System Engineer
Geneva Trading
Chicago Office
Kubernetes
Network
GCP
Prometheus
AWS
Grafana
Python
Docker
Terraform
Bash
Salt
Cloud
Linux
Ansible
Posted 3 days ago
Principal Cloud Engineer - Remote USPrincipal Cloud Engineer - Remote US
Seamless.AI
United States
Kubernetes
AI
Network
Solutions Architect
DevOps
Prometheus
AWS
EC2
Docker
Terraform
Search
Big Data Engineer
S3 Bucket
Lambda
Cloud
Sales
Posted 3 days ago
Lead Cloud Engineer NewLead Cloud Engineer New
Kobie
India
Kubernetes
Azure
GCP
DevOps
Jenkins
Prometheus
AWS
Docker
GitLab
Cloud
Ansible
Chef
Posted 4 days ago
Site Reliability Engineer-IIISite Reliability Engineer-III
Innovaccer Inc.
Noida, Uttar Pradesh, India
Kafka
Site reliability engineer
GCP
Linux
Architect
Cloud
ElasticSearch
DevOps
Python
AWS
Postgres
MongoDB
Kubernetes
Azure
Prometheus
Jenkins
Posted 6 days ago
Senior DevOps EngineerSenior DevOps Engineer
Altium
Poland Remote
Terraform
Site reliability engineer
S3 Bucket
Linux
Cloud
DevOps
EC2
AWS
Docker
GitLab
nginx
Kubernetes
Prometheus
Jenkins
Posted 6 days ago
Senior DevOps EngineerSenior DevOps Engineer
Altium
United Kingdom Remote
Terraform
Site reliability engineer
S3 Bucket
Linux
Cloud
DevOps
EC2
AWS
Docker
GitLab
nginx
Kubernetes
Prometheus
Jenkins
Posted 6 days ago
Software Engineer 2 - Docker/Kubernetes/Ansible/ArgoCD/HelmSoftware Engineer 2 - Docker/Kubernetes/Ansible/ArgoCD/Helm
Captivation Software
Annapolis Junction, MD
$130k - $270k
Docker
Grafana
Helm
Software engineer
Kubernetes
Prometheus
Ansible
Posted 8 days ago
AI & Software Engineer Lead - BackendAI & Software Engineer Lead - Backend
PMG
Dallas, TX
$100k - $130k
Marketing
Grafana
Node.js
GCP
Cloud
Solutions Architect
LLM
Postgres
Back-end
Javascript
Kubernetes
Django
GitHub
Prometheus
Terraform
Express.js
Software engineer
React
AI
Python
Ansible
AWS
MongoDB
Docker
Flask
Git
Scrum
MySql
Redis
Posted 11 days ago
Senior Staff Cloud EngineerSenior Staff Cloud Engineer
Elite Technology
US - Remote
$150k - $200k
Grafana
Terraform
Architect
Cloud
DevOps
Python
CircleCi
GitLab
Kubernetes
GitHub
Azure
Prometheus
Datadog
Posted 14 days ago
Cloudstack Engineer - Public Cloud Scalability TeamCloudstack Engineer - Public Cloud Scalability Team
Leaseweb
Amsterdam
Python
API
DevOps
Developer
Apache
Bash
Cloud
Shell
Kubernetes
Chef
Docker
Scrum
Prometheus
Linux
Jenkins
Grafana
Java
Git
Posted 15 days ago
Senior Solutions Architect, APAC (Remote, India)NewSenior Solutions Architect, APAC (Remote, India)New
Grafana Labs
India (Remote)
$6M - $7M
Grafana
Sales
Helm
GCP
Kubernetes
Prometheus
Azure
Open Source
Cloud
Solutions Architect
AWS
Posted 15 days ago
Senior Solutions Architect, APAC (Remote, Australia)Senior Solutions Architect, APAC (Remote, Australia)
Grafana Labs
Australia (Remote)
$181k - $218k
Grafana
Sales
Helm
GCP
Kubernetes
Prometheus
Azure
Open Source
Cloud
Solutions Architect
AWS
Posted 15 days ago
Site Reliability Engineer II (SRE II)Site Reliability Engineer II (SRE II)
OppFi
Remote
$102k - $153k
Grafana
Terraform
Node.js
Site reliability engineer
C
GCP
Linux
Cloud
Chef
Bash
DevOps
Ansible
Python
AWS
Ruby
CircleCi
Javascript
Splunk
Kubernetes
GitHub
Prometheus
Azure
Java
Datadog
Posted 16 days ago
Senior Cloud Platform EngineerSenior Cloud Platform Engineer
Collectors
Guadalajara, Jalisco, Mexico
AWS
GitHub
Cloud
Architect
Sales
Lambda
S3 Bucket
DevOps
Jenkins
GCP
Docker
CircleCi
Kubernetes
Datadog
Prometheus
Terraform
Video
DynamoDB
EC2
Agile
Posted 16 days ago
Data EngineerData Engineer
Xebia
Xebia
Django
Front-end
CSS
Postman
Redis
Kotlin
ML Engineer
Back-end
Android
Datadog
Ruby
AWS
HTML
Azure
Data Analyst
MySql
C
Typescript
Python
Jenkins
Java
Git
Terraform
Maven
Kafka
GCP
Express.js
GraphQL APIs
Swift
Linux
Data science
Kubernetes
AI
Node.js
Ansible
Grafana
Big Data Engineer
Helm
jUnit
Computer Vision
BlazeMeter
Vue.js
API
Prometheus
Business Intelligence
Postgres
Gradle
Flask
SQL Server
Angular
ElasticSearch
.NET
Shell
Scrum
PHP
Jira
Golang
Agile
Marketing
Javascript
Cloud
Databricks
Docker
GitHub
REST APIs
iOs
Oracle
ESLint
Spring Boot
SQL
Data Warehouse
DevOps
CircleCi
React
MongoDB
Posted 16 days ago
Senior DevOps EngineerSenior DevOps Engineer
Duetto Research
USA
Terraform
Site reliability engineer
S3 Bucket
API
React
GCP
jQuery
Chef
Cloud
Bash
DevOps
EC2
Ansible
AWS
Ruby
MongoDB
Javascript
GitHub
Java
Azure
Prometheus
Lambda
Datadog
Jenkins
Posted 17 days ago
Senior DevOps Engineer NewSenior DevOps Engineer New
Arine
Remote (United States of America) - Must Be Available Pacific Time Zone
$150k - $170k
ML Engineer
Terraform
S3 Bucket
AI
Cloud
Bash
DevOps
EC2
Python
Solutions Architect
AWS
Docker
Git
Data science
Javascript
Prometheus
Lambda
Datadog
Jenkins
Posted 17 days ago
Senior Site Reliability EngineerSenior Site Reliability Engineer
Aerospike
Bengaluru, India
Grafana
Terraform
Site reliability engineer
Linux
Unix
Cloud
Bash
ElasticSearch
DevOps
Python
Solutions Architect
AWS
Docker
Kubernetes
Azure
Prometheus
Datadog
Developer
Posted 17 days ago
Senior Solutions Architect (Remote, EST)Senior Solutions Architect (Remote, EST)
Grafana Labs
United States (Remote)
$165k - $200k
Grafana
Cloud
Helm
Solutions Architect
Open Source
Prometheus
Golang
AWS
GCP
Azure
Sales
Kubernetes
Typescript
Posted 17 days ago
Senior Site Reliability Engineer, RunwaySenior Site Reliability Engineer, Runway
GitLab
Remote, APAC; Remote, EMEA
GCP
Site reliability engineer
Terraform
AI
Golang
Back-end
Kubernetes
GitLab
Cloud
Prometheus
AWS
DevOps
Grafana
Posted 18 days ago
Dev Ops EngineerDev Ops Engineer
AI Squared
India
Git
Azure
AI
Docker
Python
Kubernetes
GitHub
Bash
Cloud
Prometheus
AWS
DevOps
Grafana
Sales
ML Engineer
Posted 19 days ago
Senior Software Engineer ISenior Software Engineer I
Freenome
Remote
$131k - $201k
GCP
Terraform
Azure
AI
Back-end
Docker
Python
Kubernetes
Cloud
Prometheus
Software engineer
Grafana
ML Engineer
Apache
Posted 19 days ago
Staff Site Reliability EngineerStaff Site Reliability Engineer
Wikimedia Foundation
Remote
$129k - $200k
Site reliability engineer
Terraform
C
Open Source
Docker
Ansible
Python
Kubernetes
Helm
Tensorflow
Prometheus
DevOps
Grafana
PyTorch
ML Engineer
Posted 19 days ago
IT & Security Admin in DevOps TeamNewIT & Security Admin in DevOps TeamNew
April
Tel Aviv
Site reliability engineer
DevOps
AWS
grpc
Docker
Kubernetes
Grafana
GCP
Linux
Prometheus
Azure
Terraform
Git
Cloud
Python
Bash
Posted 20 days ago
Data EngineerData Engineer
Roadie
REMOTE
GitHub
Docker
Business Intelligence
Cloud
Postgres
Kafka
SQL
Swift
Redis
Objective-C
Data Warehouse
Golang
Helm
Apache
Prometheus
CircleCi
Front-end
Ruby on rails
AWS
Data science
React
DevOps
Big Data Engineer
Terraform
Kubernetes
Java
Git
Grafana
Databricks
Python
Android
Network
Posted 22 days ago
Senior DevOps EngineerSenior DevOps Engineer
LivePerson
Poland
Cloud
Grafana
Python
DevOps
Network
Terraform
Kubernetes
AI
Bash
GCP
Prometheus
Posted 23 days ago
Principal, Software EngineerPrincipal, Software Engineer
Broadvoice
Portugal (Remote)
Typescript
Node.js
DevOps
Prometheus
Cloud
GCP
GraphQL APIs
Kubernetes
Ruby on rails
Front-end
Agile
Grafana
AWS
Software engineer
Azure
Docker
Posted 24 days ago
Senior Software Engineer - PlatformSenior Software Engineer - Platform
Prodigal
Mumbai (Powai)
Software engineer
Architect
EC2
MongoDB
Databricks
Lambda
Postgres
Python
Prometheus
AWS
Data Warehouse
Golang
AI
ML Engineer
DynamoDB
GPT
SQL
Docker
Kubernetes
DevOps
Posted 27 days ago
Senior Software EngineerSenior Software Engineer
Prodigal
Bengaluru
AI
Back-end
React
S3 Bucket
Databricks
Software engineer
Lambda
MongoDB
Prometheus
Grafana
Python
Redis
SQL
ML Engineer
AWS
Node.js
EC2
Posted 27 days ago
Lead Site Reliability EngineerLead Site Reliability Engineer
Roadie
REMOTE
Ruby on rails
Agile
ElasticSearch
Architect
Git
Redis
DevOps
Grafana
Network
Swift
Ruby
GCP
Golang
Objective-C
Prometheus
Kafka
Terraform
AWS
Postgres
Docker
React
CircleCi
Kubernetes
Android
Helm
Python
Site reliability engineer
S3 Bucket
Bash
Posted 29 days ago
Staff Site Reliability Engineer tags.newStaff Site Reliability Engineer tags.new
primer.ai
Pasadena, California, United States; Remote; San Francisco, California, United States; Washington, District of Columbia, United States
$180k - $230k
AI
Datadog
DevOps
Linux
Prometheus
Cloud
Kubernetes
Python
Site reliability engineer
Architect
AWS
Bash
Posted 29 days ago
Backend EngineerBackend Engineer
PartsTech
Remote - Lisbon, Portugal
Jenkins
ML Engineer
Rust
MongoDB
DynamoDB
SQL
jUnit
MySql
ElasticSearch
Front-end
Spring Boot
Software engineer
NLP
Git
GitLab
Redis
Node.js
Grafana
Search
GraphQL APIs
Cloud
Prometheus
grpc
Back-end
Kafka
AWS
AI
Kotlin
CircleCi
Helm
Kubernetes
Java
Python
GitHub
MariaDB
API
Posted 1 month ago
Senior Solutions Architect, EMEA (Remote, UK)Senior Solutions Architect, EMEA (Remote, UK)
Grafana Labs
United Kingdom (Remote)
€100k - €121k
Solutions Architect
Sales
Prometheus
Cloud
Grafana
Kubernetes
Helm
AWS
Azure
Open Source
GCP
Posted 1 month ago
Senior Site Reliability EngineerSenior Site Reliability Engineer
Reach Financial
United States
Datadog
Cloud
Python
Docker
AWS
DevOps
Javascript
Grafana
GitHub
Prometheus
Site reliability engineer
Typescript
Posted 3 months ago
Published: 2025-05-07  •  London, UK
AWS
Docker
Kubernetes
Typescript
Python
Postgres
Cloud
Software engineer
GitHub
GitLab
Agile
Grafana
Prometheus
Jenkins
Ansible
On-site
Full-time

We believe that we are better together, and at Tripadvisor we welcome you for who you are. Our workplace is for everyone, as is our people powered platform. At Tripadvisor, we want you to bring your unique perspective and experiences, so we can collectively revolutionize travel and together find the good out there.

 

The Site Operations team at Tripadvisor is responsible for maintaining and enhancing the core systems that power and support the tripadvisor.com website. This includes systems in both private data centers and over a hundred accounts in AWS. Our scope of responsibilities is vast and would take an entire page to list here. Suffice it to say that we are the go-to team for questions about the interface boundaries that lie between these two halves of the company, as well as the deep inner workings of the legacy half. Data at Tripadvisor is hugely important, and as a result, we have over 600 on-premise logical databases running on over 100 database hosts serving petabytes of data. As a Principal Software Engineer/DBA on the SiteOps team, you will be a force multiplier for our engineering & operations teams, delivering tooling & infrastructure that not only has a direct impact on day-to-day operations but also helps contribute to the future evolution of Infrastructure & Engineering here at Tripadvisor. You'll be part of a dynamic team responsible for ensuring the high availability, reliability, and scalability of our data maintenance and delivery.

 

We are looking for passionate engineers with deep experience in Postgres, as well as AWS DMS, RDS, and Aurora, to help us optimize and automate our infrastructure and deployment processes around our databases. We are currently involved in several types of systems migrations, within both the scope of on-prem to AWS/cloud-native migrations, as well as on-prem data centers to alternate AWS-based data center migrations. As a Principal Software Engineer/DBA, you will be involved in designing and implementing how we perform those migrations, testing those migrations, and then performing them with a “no surprises in production” mindset. In addition, you will have a major role in evolving the infrastructure as code and configuration management we use to both keep the lights on for our existing on-prem databases and transition them into the cloud. This is a business-facing role, and as such, significant leadership and communication experience is required.

 

What you'll do:

  • Infrastructure Automation: Design, implement, and maintain automated infrastructure provisioning and configuration management using Python, Ansible, and Typescript CDK to ensure consistency and scalability.
  • Strong programming skills in these areas is a must have.
  • Monitoring and Alerting: Set up monitoring and logging systems to proactively detect and address potential issues, ensuring optimal performance and reliability, in environments like on-prem Prometheus/Thanos, as well as Grafana Cloud and Loki.
  • Database Management: Manage hundreds of on-prem PostgreSQL databases, including performance tuning, backups, disaster recovery strategies, and their active/passive counterparts in AWS.
  • Collaboration: Work closely with cross-functional teams, including developers, system administrators, and technical managers, to improve the overall development and deployment processes, and keep everyone in sync as to deliverables and timelines.
  • Troubleshooting and Incident Management: Assist in identifying and resolving operational issues and participate in on-call rotations.

 

Skills & Experience:

  • 10 years of expertise in database operations with a focus on building and maintaining scalable infrastructures around data.
  • 5 years of working directly with PostgreSQL at a Senior level is essential.
  • 5 years of experience in leadership and communicating with the business.
  • Strong programming experience with Python is essential
  • Strong problem-solving skills and the ability to work in a fast-paced, agile environment.
  • Solid understanding of AWS-based data management technologies.
  • Experience in configuration management using Ansible.
  • Experience with infrastructure as code using CDK.
  • Understanding of CI/CD tools like Jenkins, GitLab CI, and GitHub Actions.
  • Understanding of networking concepts such as load balancing and DNS is also a plus.
  • Knowledge of containerization technologies like Docker and container orchestration tools such as Kubernetes is a plus.
  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).

 

If you need a reasonable accommodation or support during the application or the recruiting process due to a medical condition or disability, please reach out to your individual recruiter or send an email to [email protected] and let us know the nature of your request. Please include the job requisition number in your message.

  

 

 

#LI-AMCVAY

#LI-Hybrid

#LI-Remote

Looking for talent?

Get in front of thousands of skilled ML/AI Engineers and discover a suitable candidate for your job opening.