Site Reliability Engineer
We are looking for a Site Reliability Engineer who has a huge interest in automation and scalability, who likes to take ownership of projects (i.e. doesn’t want to be micromanaged), and who wants to be able to see the impact he or she will make on Collective[i].
Your primary responsibility will be to research, design, build, and deploy solutions supporting the rollout and maintenance of a large scale Linux infrastructure, which is the platform for a large scale (petabytes) data analytics and collaboration environment.
You’ll be an integral part of our tech ops team, which is 4 people and growing. You should have integrity, drive, dedication, ingenuity, a collaborative nature, the ability to laugh, a commitment to excellence, and a get things done attitude.
- Research, recommend, and implement subject area application spaces (e.g. Infrastructure Monitoring, Nagios preferred)
- Identify ideal integration methods with existing infrastructure and tools
Operate, integrate and optimize incumbent systems (e.g. Puppet, Foreman, RackTables)
- Build system configurations for at-scale deployment and management
Debug OS as well as software and hardware issues
- Coordinate implementation and execution of configuration management infrastructure
- Triage system alerts; incorporate solutions into infrastructure
- Contribute to developing and implementing security policies
- Participate in a standard 24×7 “on-call” rotation
- BS in Computer Science or related, or equivalent experience
- Experience configuring and using Puppet or similar configuration management infrastructure
- 5+ Years working on large scale (100’s to 1000’s of systems) production grade IT projects
- Linux/UNIX, shell scripting, software development and deployment, system architecture, technology integration
- A track record of learning and deploying new technologies
- Experience with Hadoop
- Integrity, intelligence, drive, charisma
In lieu of a cover letter, please answer any two of the following questions. Please try to keep each answer to two paragraphs or less.
1. If you had five extra hours in every day, what would you do?
2. Pick a product you use every day and tell us how you’d make it better.
3. What subject makes your eyes light up when you talk about it?
4. Tell us what sets you apart from others (or include a standard cover letter).
5. Write a program that evaluates how good our job postings are. Please submit your source code, a URL to a demo of your solution and a short (1 – 2 sentence) description of what you did. Focus on providing useful answers, not aesthetics.
Also, feel free to concentrate on a single way to solve this problem to show your depth or expertise in a particular domain.
Collective[i] is an equal opportunity employer