Svitla Systems Inc. is looking for a Senior Site Reliability Engineer for a full-time position (40 hours per week) in Chile. Our client is a venture-backed, B2B enterprise software company based in Silicon Valley. They offer a machine learning data catalog to help people find, understand, and trust data across their organizations. Organizations across various industries use the client’s platform, including finance, healthcare, retail, and technology. It aims to empower data-driven organizations by improving data visibility, collaboration, and governance. Thanks to its powerful behavioral analysis engine, inbuilt collaboration capabilities, and open interfaces, the solution combines machine learning with human insight to tackle even the most demanding data and metadata management challenges. More than 450 enterprises build data culture and improve data-driven decision-making with client, including Cisco, Nasdaq, Pfizer, Salesforce, and Virgin Australia. 

You will ensure the software platform’s operational efficiency, scalability, and reliability. You will work closely with the engineering, operations, and customer support teams to design, develop, and maintain the infrastructure, focusing on operational processes and incident response. Your primary focus will be improving the services’ availability, performance, and security.

This is a 24/7 rotational role with on-call responsibilities. You will work a standard work week with two designated off-days that rotate throughout the year to ensure optimal work-life balance.

Requirements

  • 8+ years of experience in SRE or equivalent roles, with a strong background in cloud computing (AWS/GCP), containerization (Kubernetes/Docker), and automation tools.
  • Strong knowledge of Linux, networking fundamentals, and system administration principles.
  • Strong understanding of cloud computing, containerization, and microservices.
  • Strong knowledge of at least one programming language, such as Python, Go, or JavaScript.
  • Experience in monitoring & logging frameworks (e.g., Prometheus, Grafana) and data visualization tools.
  • Excellent problem-solving skills, with the ability to analyze complex issues and provide actionable recommendations.
  • Effective communication & collaboration skills, with experience working in cross-functional teams.

Nice to have

  • Understanding infrastructure automation tools such as Terraform, Ansible, or Chef.
  • Experience in routine task automation with JavaScript and Python.
  • Experience in setting up the observability platform.(e.g., Prometheus, Grafana, Datadog, New Relic).
  • Understanding continuous integration and continuous delivery (CI/CD) pipelines.

Responsibilities

  • Design, build, deploy, and maintain highly available, scalable, and secure infrastructure components (e.g., Kubernetes clusters, containerized applications) on cloud providers like AWS or GCP.
  • Respond to production incidents on time, collaborating with cross-functional teams to identify root causes, implement fixes, and communicate updates to stakeholders.
  • Analyze system performance metrics to predict capacity needs, plan for scaling, and execute upgrades or reconfigurations as necessary.
  • Develop scripts, tools, and automation frameworks (e.g., Ansible, Terraform) to streamline infrastructure management tasks, reduce manual effort, and improve overall efficiency.
  • Work closely with engineering teams to design scalable systems, participate in code reviews, and contribute to developing documentation and knowledge bases.
  • Implement monitoring tools (e.g., Prometheus, Grafana) to track system performance, identify areas for improvement, and provide actionable feedback to stakeholders.
  • Ensure that all infrastructure components meet Alation’s security standards and compliance requirements.

We offer

  • Competitive compensation based on skills and experience.
  • Flexibility in the workspace and remote-friendly culture.
  • Free webinars, meetups and conferences organized by Svitla.
  • Awesome team, friendly and supportive community!

About Svitla

Svitla Systems is a global trusted IT solutions company headquartered in California, with business and development offices through out the US, Latin America, Europe, and Asia. Svitla is an outspoken advocate of workplace flexibility, best known for its well-established remote culture, individual approach to our teammate’s professional and personal growth, and family-like environment.

Since 2003, Svitla has served a wide range of clients, from innovative start-ups in California to mega-large corporations such as Ingenico, Amplience, InvoiceASAP and Global Citizen. At Svitla, developers work with clients’ teams directly, building lasting and successful partnerships, as a result of seamless integration with on-site processes.

Svitla Systems’ global mission is to build a business that contributes to the well-being of our partners, personnel and their families, improves our communities, and makes a lasting difference in the world. Join us!

If you are interested in our vacancy, please send your CV.
We will be happy to see you in our friendly team :)

Let's meet in person

Daria Verbovska
Daria Verbovska
Recruiter