Sr. Cloud Engineer – Observability

Sr. Cloud Engineer – Observability

ProViso Consulting

Story Behind the Need:

• Business group: Cloud Engineering – Cloud Engineering – main teams are responsible for creating platform (Kubernetes) and all connections for logging monitoring chargeback and setting up all processes around creating/establishing Cloud resources for different projects; creating processes for automating releases; spearheading creation and management of new resources coming to Bank; some teams are responsible for CICD processes – encompassing Bank’s security and auditable processes to go from development/non-prod to production environment; DevOps teams assisting in SDLC; troubleshooting and acting as a point person between development and operations teams.
• Project: Cloud Acceleration Program – moving towards Google Cloud, team will be focusing on virtual learning machines in the Cloud, initiative to move virtual machines from private cloud environment VMWare to GCVE/GCE environments; in initial phases; program will take a few years before is fully completed
• As a Senior Engineer – Observability you are responsible for the operation and maintenance of our logging and metrics infrastructure. You ensure that the cloud platform users are aware of the health and performance of their workloads running on the cloud platforms. You keep our cloud platform stable to provide an essential and dependable service that millions of customers use every day. If you’re passionate about cutting-edge technologies, cloud-native solutions, and driving innovation, this role offers an exciting opportunity to shape the future of digital banking.

Candidate Value Proposition:

• The successful candidate will have the opportunity to work with Google’s latest services and technologies to build enterprise grade solutions.

Typical Day in Role:

• Design, develop, and deploy advanced tooling and systems that enhance the reliability, scalability, and efficiency of our cloud platforms
• You build software to automate infrastructure platform operations and management of cloud platforms
• You find opportunities to improve our cloud platforms by using metrics and monitoring the operations
• Participating in design discussions focused on building robust large scale distributed systems.
• Develop robust, scalable, and efficient observability solutions by providing technical mentorship within the team and lead by example
• Document infrastructure operations processes and insights, identify repeatable actions, and lead the automation of repetitive tasks
• Level 3 support responsibilities are required

Candidate Requirements/Must Have Skills:

• 5+ years experience in loud (with at least 5 in GCP), software or other related engineering roles, with a strong improvement-focused mindset
• 3+ years of hands-on technical working experience with Bash and/or Powershell to develop automation & deployment utilities
• 3+ years’ experience with Operations and Monitoring (Cloud Logging, Cloud Monitoring, Cloud Profiler, Cloud Trace, Log Router, Cloud Audit Logs)
• 2+ years’ experience in in roles focused on observability, monitoring, and systems performance analysis, with a proven track record of technical leadership
• 1+ year experience in deploying and managing monitoring, alerting, and logging systems at massive-scale, such as Prometheus, Grafana, Kibana, OpenTelemetry, ELK, Jaeger etc.

Nice-To-Have Skills:

• 3+ years of hands-on technical working experience of using Continuous Integration/Continuous Delivery (CI/CD) Tools (e.g., Git plus Azure DevOps and/or Jenkins and/or GitHub Enterprise, etc.) for the purposes of maintaining pipelines.
• 1+ years of hands-on technical working experience in the use of Configuration Management & Automation tools (e.g., Saltstack, Perforce’s Puppet and/or Red Hat Ansible and/or Progress® Chef®).
• Advanced proficiency in creating production-ready code in high-level languages, such as Go, Python
• Extensive experience operating large-scale infrastructure in public cloud environments, such as Azure or GCP

Soft Skills Required:

• Self-sufficient, works under the supervision of a more senior engineer.
• Strong communication skills, both written and spoken; of specific importance the strong communication to a technical audience.
• Attention to details, high standards for quality.
• Writing and maintaining related documentation.


• Bachelor’s degree in computer science, engineering, or a related field (master’s degree preferred).

Best VS. Average Candidate:(BNSJP00034426)

• Ideal candidate has experience migrating workloads from existing cloud and on-prem; strong GCP experience

Candidate Review & Selection:

• In person interviews
o 1st round – 30 mins – HM interview
o 2nd round – 1-hour technical panel interview – deep dive on skillset, technical questions on solutioning

Job Details



4.5 months



Latest Blogs

© 2020 ProViso Consulting - Toronto Recruitment and Staffing Agency

× Chat

Send this to a friend