Platform Engineer

col-narrow-left   

Job ID:

465752

Location:

Kitchener, ON 

Category:

Computers/Info Tech, Engineering, Other
col-narrow-right   

Postal Code:

N2G1B9

Employment Type:

Full Time

Posted:

02.05.2024
col-wide   

Job Description:

Platform Engineer - Platform Guild
 
Smile.io is the world’s largest loyalty platform, providing easy-to-use reward programs that help to scale ecommerce brands and transform one-time sales into repeat, loyal customers. Over 100,000 brands use Smile to turn transactional purchases into passionate repeat shoppers.
 
Smile.io is seeking a highly skilled and experienced Platform Engineer with a strong background in Site Reliability Engineering to join our dynamic Platform Engineering Guild. As a Platform Engineer, you will play a pivotal role in designing, building, and maintaining our platform infrastructure, ensuring its reliability, scalability, and performance. You'll collaborate closely with our cross-functional engineering squads to optimize our systems and processes, enhancing the overall stability and efficiency of our platform. This remote role is full-time. 
 
We invite you to join our mission to create technical solutions that securely, reliably, and quickly provide value to our customers. We're a fully remote company and believe in giving you the freedom to work from anywhere. Our team spans the globe, and we’re as connected as ever.
 
Our engineering culture is rooted in code ownership, pride in what we ship, and the agile methodology. Currently, we're focused on elevating our cloud infrastructure capabilities, and we’re looking for someone who can contribute as an individual to our existing team.
 
About the Team:
The Platform Engineering Guild was born out of our previous Site Reliability Engineering team. The guild is focused on providing a secure, reliable, and performant cloud infrastructure that enables Smile to deliver product updates and other enhancements to our merchants and their customers quickly and efficiently.
 
We're looking forward to receiving your application and learning more about how you can contribute to making Smile.io the go-to platform for customer loyalty solutions.

 

Job Requirements:

Role Responsibilities:
  • Platform Infrastructure: Architect, develop, and maintain robust and scalable infrastructure solutions, leveraging best practices in cloud technologies (e.g., AWS) and container orchestration (e.g., Kubernetes).
  • Site Reliability Engineering: Implement SRE principles to enhance the reliability, resilience, and availability of our platform. Establish monitoring, alerting, and incident response procedures to proactively address issues and minimize downtime.
  • Automation & Tooling: Drive automation initiatives to streamline deployment, configuration, and monitoring processes. Develop and maintain tools to support continuous integration and continuous deployment (CI/CD) pipelines.
  • Performance Optimization: Identify performance bottlenecks and optimize system components for improved scalability and efficiency. Conduct regular performance assessments and implement enhancements as needed.
  • Collaboration & Mentorship: Collaborate closely with engineering squads to understand their infrastructure needs and provide guidance on best practices. Mentor junior team members and contribute to the development of a strong engineering culture.
  • Security & Compliance: Work closely with the security working group to implement and maintain robust security measures and ensure compliance with industry standards and regulations.
  • On-call: Take turns in the on-call rotation focused on answering and dealing with any incident that comes up.

What you'll help us achieve:
  • Help scale, operate, and improve the loyalty layer of the internet with our teams.
  • Review the code and provide thoughtful, constructive feedback for your team.
  • Collaborate closely with Product, Design, and partner Engineering teams to build new features.
  • Analyze how your features are performing in production to learn and make the product better.
  • Support a highly available service used by millions of users daily.
  • Write, test, instrument, document, and maintain infrastructure as code, automation, and software for Smile.io.
Who you are:
  • Bachelor's or Master's degree in Computer Science, Engineering, or related field.
  • Experience as a Platform Engineer, with a focus on SRE practices.
  • Excellent experience in the Linux operating system.
  • Proficiency in cloud platform (AWS) and container orchestration tools (Kubernetes, Docker).
  • Strong scripting and programming skills (Bash, Python, Go, etc.) for automation and tooling.
  • Experience with infrastructure as code (Terraform, Packer, etc.) and CI/CD pipelines, preferably GitHub Actions.
  • Experience with DevOps and GitOps practices, and tools (e.g., ArgoCD, Helm, Kaniko, etc.), Git Branching, Blue-Green, and/or Canary Deployments.
  • Deep understanding of networking, security principles, and best practices.
  • Excellent problem-solving abilities and a proactive approach to troubleshooting complex systems.
  • Strong communication skills and the ability to collaborate effectively in a cross-functional team environment.
Bonus if you have:
  • Have an existing AWS certification.
  • Excellent skills in AWS services such as EKS, Lambda, Cloudwatch, API Gateway, SQS, SNS, MSK (KafKa), VPC/Networking/Peering, EC2, Elasticache, RDS (PostgreSQL), Route53, SSM, KMS & IAM.
  • Hands-on experience maintaining Infrastructure as Code, Terraform modules, Terragrunt, and Terraform unit testing.
  • Knowledge of secure coding practices including OWASP, secrets management (e.g. Hashicorp Vault), and vulnerability remediation.
  • Prior experience in the e-commerce industry.

Apply here: https://jobs.lever.co/Smile.io/9a59cd2a-ed8c-4773-8271-f057c6304ad2/apply
 
Company Info
Smile.io 305 King St W,
Kitchener, ON, Canada

Phone: +18556999322
Web Site: https://smile.io

Company Profile


Share this page: