Svitla Systems Inc. is looking for a Senior DevOps Engineer with Azure and Datadog for a full-time position (40 hours per week) in Argentina. Our client is a leading expert network that provides business and government professionals with opportunities to communicate with industry and subject-matter experts to answer research questions. Customers consult with these experts over the phone and in person at conferences, teleconferences, custom events, and workshops. They may also gather their primary research data through surveys, polls, or web-based offerings. Experts are categorized into six main industry sectors: healthcare, financial and business services, consumer goods and services; energy, industrials, and basic materials; tech, media, and telecom; and legal and regulatory. Since 2003, the company has provided its customers with primary research services, helping professionals comprehensively understand a topic before making significant investment and/or business decisions. Their multinational customer list includes nine top 10 consulting firms, hundreds of hedge funds, and many of the largest private equity firms and fortune-ranked companies.
The engineering team thrives on solving complex challenges and continuously improving the systems that power the services. You will join the CloudOps team and help drive the optimization and performance of the infrastructure monitoring and observability practices. As a Datadog Administrator, you will manage, maintain, and optimize Datadog for comprehensive monitoring and observability across the Azure infrastructure, Kubernetes environments, and application services. You will play a key role in ensuring the cloud-based systems’ high availability, reliability, and performance by leveraging Datadog’s monitoring, alerting, and automated remediation tools.
Requirements
- 5+ years of experience with clouds ( AWS, GCP, Azure).
- 3+ years of strong experience with Datadog as a monitoring and observability platform.
- At least 2-3 years of hands-on experience managing and optimizing Azure cloud infrastructure.
- Experience in automation using Datadog’s integration features (alerts, monitoring dashboards, and automated remediation).
- 2+ years of experience in Azure cost management (FinOps) and cloud cost optimization practices.
- Understanding scripting with Bash, PowerShell, Python, or similar languages.
- Strong troubleshooting and debugging capabilities in an agile software development environment.
- Strong problem-solving skills and a proactive approach to system monitoring and issue resolution.
- Excellent interpersonal and communication skills for cross-team collaboration.
- Independent and self-motivated with the ability to drive tasks to completion.
- Team-oriented with a collaborative mindset and able to work in a fast-paced, agile environment.
- Strategic thinking with the ability to balance operational needs with long-term goals.
- Ownership of tasks and a strong sense of accountability.
- Proven experience managing projects and meeting deadlines while maintaining high-quality standards.
- Ability to prioritize tasks effectively and exhibit good judgment when managing resources.
- Upper-intermediate-level of English.
Nice to have:
- Experience with other monitoring tools (e.g., Splunk, Prometheus, Zabbix).
- Knowledge of Infrastructure as Code using tools like Terraform, ARM templates, or Azure CLI is a huge plus.
- Azure Solutions Architect Expert certification or equivalent.
- Azure Security Engineer certification (Associate level).
- Familiarity with Ansible for automation and configuration management.
- Advanced knowledge of Kubernetes and container orchestration best practices.
- Experience in CI/CD pipelines and integrating Datadog with DevOps processes.
Responsibilities:
- Take full ownership of Datadog to monitor infrastructure, services, and applications across multiple environments (production, development, test). Ensure optimal configurations for observability and alerting.
- Using Datadog, monitor infrastructure and application performance, identify potential issues, and create automated remediation workflows to resolve them.
- Use Datadog and other cloud tools to optimize and monitor Azure cloud costs, tracking and improving resource usage and cost efficiency.
- Leverage Datadog’s alerting system and integrations to automate the remediation of common infrastructure and application issues.
- Collaborate with CloudOps and Engineering teams to monitor and optimize Kubernetes environments, ensuring containers, pods, and services run efficiently.
- Work closely with Engineering, AppOps, and CloudOps teams to address complex infrastructure challenges, ensuring smooth deployments and high availability.
- Ensure security and compliance best practices are followed for monitoring and logging, participating in security audits and incident response activities as required.
- Support the automation and deployment of infrastructure using tools like Terraform and Azure Resource Manager (ARM).
- Contribute to FinOps activities by tracking resource usage and optimizing cloud costs, providing data-driven insights into cost-saving opportunities.
- Continuously review and improve monitoring configurations, workflows, and processes for maximum efficiency, performance, and security.
We offer
- US and EU projects based on advanced technologies.
- Competitive compensation based on skills and experience.
- Annual performance appraisals.
- Remote-friendly culture and no micromanagement.
- Bonuses for recommendations of new employees.
- Bonuses for article writing, public talks, other activities.
- 15 vacation days, 10 national holidays, sick leaves.
- Platzi unlimited training account.
- Free webinars, meetups and conferences organized by Svitla.
- Fun corporate celebrations and activities.
- Awesome team, friendly and supportive community!
About Svitla
Svitla Systems is a global trusted IT solutions company headquartered in California, with business and development offices through out the US, Latin America, Europe, and Asia. Svitla is an outspoken advocate of workplace flexibility, best known for its well-established remote culture, individual approach to our teammate’s professional and personal growth, and family-like environment.
Since 2003, Svitla has served a wide range of clients, from innovative start-ups in California to mega-large corporations such as Ingenico, Amplience, InvoiceASAP and Global Citizen. At Svitla, developers work with clients’ teams directly, building lasting and successful partnerships, as a result of seamless integration with on-site processes.
Svitla Systems’ global mission is to build a business that contributes to the well-being of our partners, personnel and their families, improves our communities, and makes a lasting difference in the world. Join us!