Lead DevOps Engineer
Ann Arbor, MI
Posted: September 17, 2021
Application Deadline: Open Until Filled
Job DescriptionJob Summary
The Inter-university Consortium for Political and Social Research (ICPSR, https://www.icpsr.umich.edu/icpsrweb/content/about/) is part of the Institute for Social Research (ISR) at the University of Michigan. The consortium maintains the world’s largest archive of social science data with 10,000+ studies relating to education, aging, criminal justice, substance abuse, terrorism, and more. A global leader in data science, ICPSR also supports continuing education in research design, statistics, and data analysis.
ICPSR operates a diverse environment across on-premise and cloud systems, with an emphasis on security. We are looking for an experienced DevOps Engineer with a security bent to help us continue on our journey to systems automation.
You will report to the DevOps team lead in Computing & Network Services.
Provide technical leadership to a team of DevOps engineers and collaborate with a team of architects to set future technical direction
Securely design, implement and operate a Kubernetes-based container orchestration platform hosting application containers in AWS
Securely implement and operate Linux servers and cloud environments, and automate their builds and configuration
Document your work and the computing environment, both for internal use and to satisfy regulatory requirements
Support software development efforts including CI/CD pipelines
On-call availability, as part of a scheduled weekly rotation (25-33%), is required and may involve working during non-business hours and on weekends
Other tasks as assigned
Bachelor’s degree in Computer Science, MIS, or a similar relevant field, or an equivalent combination of education and experience
Demonstrated experience documenting infrastructure design and implementation details
2 or more years demonstrated experience with operations of production services in AWS, GCP, or Azure, and container technology including Kubernetes and Docker
5 or more years demonstrated experience with operations of UNIX/Linux systems, UNIX/Linux performance monitoring and security techniques, including UNIX/Linux system security vulnerabilities and mitigations
Experience with systems automation tools such as Terraform, Helm, Ansible, Packer, or Puppet, and programming/scripting in Python, bash, Powershell, or other high-level programming language
Experience with CI/CD / GitOps tools such as Gitlab, CDK8s, Flux v2, or ArgoCD
CISSP or related certification and experience securing and documenting systems subject to a regulatory regime such as FISMA, HIPAA, SOX, or PCI
2 or more years of experience in operations of Windows systems, and experience with VMWare Horizon VDI, AWS Workspaces, and/or Azure Virtual Desktop
DBA experience with Oracle and/or Postgres
Experience with metrics collection and observability tools such as Prometheus, Grafana, Fluentd, Kibana, Elasticsearch, Logstash, or Beats
Part of this job may require some work outside normal working hours to analyze and correct critical problems that arise in ICPSR's 24 hours per day operational environment.
This role may be underfilled at the senior level. Candidates with lesser experience are encouraged to apply.