DevOps ELK Stack Grafana IoT DevOps New Relic
SensorFlow is at the forefront of IoT-driven building automation, optimizing energy usage across a global fleet of sensors and gateways. We’re seeking a DevOps Engineer with expertise in monitoring, logging, and observability to ensure our cloud infrastructure and worldwide fleet operate with high reliability. In this role, you’ll set up and maintain the infrastructure needed to proactively catch and resolve issues, enabling quick response and maintaining uptime across all our services and devices. Responsibilities: - Design, implement, and manage monitoring, logging, and observability systems to ensure high uptime for our cloud infrastructure and global fleet of IoT devices. - Set up automatic alerting systems to notify the tech support and customer success teams of issues requiring action, focusing on minimizing downtime and optimizing fleet performance. - Collaborate with engineering and support teams to troubleshoot sensor and gateway issues, lead incident response, and drive post-incident analysis to prevent recurrences. - Monitor network health and uptime across all deployed gateways and sensors to ensure consistent performance for customers worldwide. - Extend and enhance existing monitoring and observability tools (currently using New Relic and Grafana) with additional tools like ELK, Prometheus, or Datadog, as needed, to improve visibility and response times. - Develop and maintain documentation for monitoring and incident management processes.
- Bachelor’s degree in Computer Science, Engineering, or a related field. - 5+ years of experience in DevOps, with a focus on monitoring, logging, and observability. - Fluent English - Proficiency with monitoring tools such as New Relic, Grafana, ELK, Prometheus, and Datadog. - Strong understanding of cloud infrastructure and experience setting up and managing alerting and logging systems. - Hands-on experience with incident management, including setting up automated alerts and troubleshooting network and device issues. - Excellent collaboration skills, with the ability to work cross-functionally with engineering, tech support, and customer success teams. - Strong problem-solving skills and a proactive, detail-oriented mindset.
At SensorFlow, we’re made up of dreamers, achievers, and visionaries whose passion and belief in a greater cause drive us to do more and push the boundaries of innovation every day. If you want to work with fun-loving and diverse personalities in an environment that prioritizes your learning, development, and autonomy, then SensorFlow is the perfect place for you. We also have amazing benefits that include a generous annual leaves, medical coverage, birthday leave, fitness benefits, and Flexi benefits. *Total compensation package will include fixed salary + incentive (according to company's performance)
- Bonus: Experience with Balena or other similar IoT fleet management tool
Direct Manager
Interview with HR -> Interview with Hiring Manager -> Take-home assignment -> Culture fit