Question: How do you run 1000s of long-running and CPU intensive machine learning models every month in a cost-efficient and stable way? What do you do if that number increases by 10x or 100x? Blackwood Seven is a fast growing AI company. And we are looking for a bright and enthusiastic SRE or DevOps engineer to join our team and help scale our infrastructure for the future.
Come and join our Site Reliability Team as we are currently hiring for our Copenhagen office.
You will be in charge of managing our Kubernetes clusters running in AWS, as well as various additional infrastructure, such as databases, logging, monitoring and networking. Together with the rest of the SRE and development teams, you will help develop our infrastructure so it meets the future demands of stability and scale.
Blackwood Seven works with cutting-edge AI-systems and to support those our infrastructure uses some of the newest tech. Our systems are based on a microservice architecture, everything runs in AWS cloud, and we have a large Kubernetes setup for running machine learning models and many other products.We have a “you build it, you run it” approach to building software, meaning your main responsibility will be to provide an infrastructure for other developers.
You will also be developing and managing our build and deployment systems, where the developers define their deployments.
Finally you will be building tools and helping out the developers deliver value faster.
The team works in close collaboration with the AI team and the Development team - all product development teams are located in our Copenhagen office.
Requirements:
Linux administration experience (Primarily Debian)
Kubernetes experience (Orchestrated by kops)
Docker experience
AWS experience
Networking experience
Programming experience in one or more relevant languages (Python, Go, etc.)
Bonus competences:
Unified logging (Graylog)
Distributed systems
Ansible
Vulnerability scanning (Alienvault)
Monitoring (Prometheus)
CI / CD systems
About Blackwood Seven
We are very agile and follow modern software engineering best practices. We believe that technology and creativity are the most powerful tools to disrupt and set the standards of a new era. Most of all, we believe in people. The way that we work is guided by some fundamental values:
Trust
We believe that our colleagues will do their best to perform their tasks and to realize Blackwood Seven’s goals. We believe that our colleagues are trustworthy, hardworking, and honest.
Equal worth
We believe that everyone has the same right to make and influence decisions. This means that we safely trust our colleagues to make the majority of the decisions relevant for the role they are filling in the team.
Transparency
We believe that everyone needs full access to all knowledge in order to have the best possible basis for making decisions. We believe that they may have valuable ideas that we otherwise might miss.
If you see yourself as a part of our team at Blackwood Seven and believe that you can contribute with something valuable, apply for the position!
This job comes with several perks and benefits