Company Overview: At Forward Earth, we are dedicated to making sustainability and decarbonization core elements of every business. Our innovative software equips partners with carbon management tools that simplify compliance, reduce emissions, and bolster success in a sustainable economy. We are committed to reducing emissions across products and supply chains, helping combat global climate change. Our goal is to automate carbon management, making it accessible and scalable, empowering users to optimize processes and achieve substantial emission reductions. About the Role We are looking for a Site Reliability Engineer (SRE) to join our team and help ensure the reliability, scalability, and performance of our systems. In this role, you will focus on optimizing infrastructure, automating processes, and collaborating closely with development teams to maintain a highly available and efficient platform. 5 years of experience in SRE, DevOps, or a related field. Strong understanding of cloud infrastructure (preferably AWS) and services such as EC2, S3, Lambda, and CloudWatch. Experience with Infrastructure-as-Code (IaC) tools such as CloudFormation or Terraform. Proficiency in Python and Bash scripting for automation. Hands-on experience with monitoring and observability tools (e.g., Prometheus, Honeycomb, Grafana, Datadog). Strong understanding of CI/CD pipelines and best practices. Excellent communication, problem-solving, and collaboration skills. Professional-level English and eligibility to work in Germany. Berlin-based or able to relocate from somewhere in other parts of Germany and start soon. Reliability Engineering: Implement and maintain monitoring, alerting, and logging systems to proactively identify and address potential issues. Develop and enforce SLOs (Service Level Objectives) and error budgets. Incident Response: Participate in incident response efforts, conduct root cause analysis, and implement preventative measures to minimize future occurrences. Performance Optimization: Analyze system performance, identify bottlenecks, and implement optimizations to improve efficiency and scalability. Automation: Develop and maintain tools and scripts for automating tasks such as deployment, scaling, and incident response to reduce manual effort and improve operational efficiency. Collaboration: Work closely with development teams to ensure reliability and performance are considered throughout the software development lifecycle (SDLC). Cloud Infrastructure: Manage and optimize our AWS cloud infrastructure, ensuring cost-effectiveness, security, and high availability. Bonus Points Experience with containerization technologies (e.g., Docker, Kubernetes). Familiarity with database administration (e.g., PostgreSQL, MySQL). Knowledge of security best practices and compliance frameworks. A role in a company dedicated to advancing carbon management technology and making a positive impact on the environment. Hybrid/remote-flexible work environment (Berlin-based team, but remote-friendly). Significant growth potential within a rapidly expanding company. A dynamic, supportive work environment that champions innovation, initiative, and diversity. Ready to make an impact? Apply now