Cloud Engineer II (ucar)

ucar    Building, Lab, Mesa    2024-08-31

Job posting number: #145872 (Ref:REQ-2024-172)

Job Description

Job Description Summary:

UCAR is excited to announce the job opening for a Cloud Engineer II role. This position is responsible for supporting on-premise cloud computing resources and facilitating the use of commercial cloud environments such as AWS, Azure, and Google Cloud. This role focuses on helping users transition to and utilize cloud computing technologies that enhance their workflows, with a particular emphasis on the needs of the HPC community.

The Cloud Engineer will provide system engineering support for CISL’s on-premise cloud resources (including Kubernetes, and other software stacks and technologies) and integrate commercial off-premise cloud resources as needed. As part of this role, the engineer will provide system administration support for on-premise cloud infrastructure, including monitoring, and troubleshooting the on-premise cloud infrastructure. Research and evaluation of cloud technologies will also be expected and strong and effective communication with stakeholders is also a key requirement.

Production systems supported are located at the NCAR Wyoming Supercomputing Center (NWSC) in Cheyenne, Wyoming. Test and Research systems are located at the NSF | NCAR Mesa Lab in Boulder Colorado. The engineer may be required to work at the NWSC during periods of system installation, upgrades, or troubleshooting..

NSF | NCAR’s Computational and Information Systems Laboratory (CISL) is a leader in supercomputing and data services necessary for the advancement of atmospheric and geospace science. CISL’s mission is to remain a leader at the forefront of ensuring that research universities, NCAR, and the larger atmospheric, oceanographic, and related research communities have access to the computational resources they need for their research. To fulfill the need for a stronger workforce at the intersection of High Performance Computing (HPC) and geoscience problems, CISL engages in education and outreach activities to inspire and attract a diverse future workforce.

Position Details:

Visa Sponsored Job:

No

Relocation Assistance Eligible:

Yes

Job Location:

Boulder, Colorado

Position Type & Term:

Full time, Term - 6 months or more (Fixed Term)

Compensation Range:

Salary Range $85,676 - 107,095

*Final salary and rates are based on education, experience, skills relevant to the role, and internal equity.*

Application Notes

Job Location: Occasional travel to/onsite work at Mesa Lab (Boulder, CO) and NWSC (Cheyenne, WY); Remote option available to eligible hire; Hybrid and Fully Onsite options alo available

Position Type & Term: Full Time, Regular; 2-year term with possibility of extension

Application Deadline:

  • This position will be posted until 11:59 PM MT, on Friday, September 13, 2024. Applications will not be accepted past this date.

Required application materials: (preferably in PDF Format)

  • Resume

  • Questionnaire (included in application)

Background Checks: Conducted for candidates selected for hire. Learn more.
Work Location: Regardless of flexible work arrangements, UCAR requires ALL positions to be performed within the U.S., excluding U.S. Territories.

What You Will Do

Here is a brief summary of what one would expect to be generally responsible for in this role.

1. Cloud System Engineering and Development

  • Develops, implements, and documents new features or capabilities in cloud system administration software.

  • Maintains and develops systems software for deploying and managing both on-premise and commercial off-premise cloud resources.

  • Integrates cloud technologies into high-performance supercomputers, clusters, service infrastructures (e.g., JupyterHub, Globus, containers, OSDF), and network fabrics.

  • Performs installation and necessary hardware and software integration for cloud and HPC infrastructure deployments and upgrades.

  • Helps define standards and guidelines for cloud system maintenance, automation, and documentation.

  • Writes code to enhance cloud system management capabilities and automate system administration tasks.

  • Develops acceptance testing criteria and applications for cloud system procurements.

2. Cloud Research and Evaluation

  • Research new and emerging cloud technologies (e.g., multi-cloud, CI/CD, containers, networks).

  • Evaluates the potential impact of new cloud software and hardware technology on workflows and plans.

  • Makes recommendations for future procurement of cloud hardware and software products, configurations, and functional enhancements or upgrades.

  • Performs evaluations and compiles reports on new cloud hardware and software systems.

  • Participates in design and procurement decisions for cloud systems, including developing systems-level code to support various aspects of cloud infrastructure.

  • Contributes to the RFP process by defining technical specifications, and requirements, reviewing decisions, and implementing future procurements.

3. Cloud Operational Monitoring and Troubleshooting

  • Operates and monitors the behavior of cloud-managed supercomputers, clusters, software services/servers, storage systems, and network fabrics to ensure proper and efficient operations.

  • Alerts relevant staff of abnormal conditions or behaviors and take remedial actions as necessary.

    View Orignal JOB on: italents.net

  • Diagnoses and may repair failed cloud software and hardware components, or mentors/assists other staff in such tasks.

  • Provides 24/7 on-call service for troubleshooting and resolving cloud system-related problems.

  • Documents troubleshooting and operational techniques and best practices, and mentors other team members as necessary.

4. Cloud and HPC Systems Administration

  • Provides systems support for diverse cloud and HPC hardware and software architectures.

  • Leads the installation and upgrades of cloud and HPC system hardware and software, including computational systems, clusters, standalone machines, storage systems, and network fabrics.

  • Helps define standards and guidelines for cloud operation and maintenance and produces systems operation and procedural documentation.

  • Compiles, installs, and maintains commercial and open-source cloud application software.

  • Documents cloud system administration tasks and mentors other team members as necessary.

5. Project Management

  • Leads cloud-related team projects utilizing standard project management tools and techniques.

  • Provides project coordination, technical expertise, and planning for cloud system deployment projects under the direction of the HSG group lead.

  • Guides and reviews the tasks of team members and provides guidance as necessary.

  • Participates in cross-group and cross-division cloud projects as necessary, including taking a lead role.

6. Organizational Representation and Reporting

  • Provides regular HSG activity reports to management and contributes to CISL or NSF NCAR annual reports and development plans.

  • Attends group, division, and laboratory meetings and represents HSG and its cloud activities.

  • Represents the group at larger organizational meetings and broader community events as appropriate.

  • May interact at the national level with sponsors and present at conferences.

Who We'd Love To Join Our Team

Successful candidates will ensure their application materials speak to the following criteria:
 

Education and Experience:

Required:

Bachelor's degree in computer-related or engineering field and progressive relevant experience, which is typically gained by four to eight years of work experience. Desired, yet not required:  Educational background in Computer Science, Mathematics, Computer/Electrical Engineering, Information Sciences, Software Engineering, or equivalent related field.

Desired  but not required:

  • Experience with infrastructure as code solutions, such as Ansible.

  • Experience with both on-premise and commercial clouds.

  • Experience with infrastructure for CI/CD workflows.

  • Experience with high-performance computing and related technologies.

  • Experience with project management.

Knowledge, Skills, and Abilities (Required/Desired):

  • Demonstrated skill in the installation, configuration, administration, troubleshooting, and securing of cloud environments.

  • Demonstrated skill in deploying and maintaining infrastructure for services such as AWS, Azure, Google Cloud, and Kubernetes.

  • Demonstrated skill in the configuration and troubleshooting of high-performance network fabrics (e.g., Ethernet, InfiniBand).

  • Demonstrated skill in operating and managing container infrastructure (e.g., Docker, Kubernetes).

  • Proficiency in common scripting and programming languages (e.g., Python, Bash) and general software engineering practices.

  • Strong organizational skills and attention to detail.

  • Excellent written and verbal communication skills, with the ability to write and interpret technical documentation.

  • Effective communication with various teams and stakeholders across the organization and externally.

  • Ability to explain complex technical concepts to individuals with varying technical backgrounds, including risks, controls, and impacts.

  • Active listening skills to understand and address technical needs at a high level of complexity.

  • Demonstrated skill in making formal presentations and advocating for technical solutions.

  • Ability to work collaboratively with teams of different skill levels and backgrounds.

  • Ability to mentor and supervise students, interns,  team members and collaborators.

  • Ability to function effectively within a matrixed, multidisciplinary team.

  • Maintains professional contact with industry members and sponsors.

  • Familiarity with cloud automation and orchestration tools (e.g., Terraform, CloudFormation).

  • Knowledge of cloud security best practices and compliance requirements (e.g., IAM, encryption, GDPR, HIPAA).

  • Familiarity with cloud cost management and optimization strategies.

  • Familiarity with serverless architecture and services (e.g., AWS Lambda, Azure Functions).

  • Understanding of microservices architecture and implementation.

Desired not required:

  • Occasional travel to the NCAR Wyoming Supercomputer Center, which is approximately 90 miles north of Boulder

  • Periodic 7x24 on-call support in rotation with other staff

  • Providing assessment and feedback on vendor technology roadmap, RFI/RFP to the HSG group head and the HPCD division director

Benefits Overview 

UCAR affirms its commitment to employees through competitive benefits. In addition to medical, dental, vision, retirement, and life insurance,  UCAR offers a variety of programs focused on work-life balance and professional, and personal development. These include:

  • Tuition Assistance, time off allowance to attend classes, and other professional development opportunities

  • UCAR contributes 10% of your eligible pay into your retirement account; 100% fully vested on day one

  • Starting minimum accrual of 20 days of personal time off each year (prorated for less than full-time positions)

  • 10 paid holidays

  • 10 days of sick leave each year

  • 12 weeks of paid parental leave

  • Short-term medical leave paid at 100% of your regular salary

  • EcoPass for local Colorado residents to use the Denver and Boulder-area transit system at no cost

Commitment to Diversity, Equity & Inclusion

Our organization is committed to creating a diverse, equitable, and inclusive work environment and fostering a culture where everyone feels welcome and supported. To learn more about these efforts, visit the Office of Diversity, Equity & Inclusion Strategic Plan and our Diversity & Inclusion: A Welcoming Workplace site. 

Research shows that women and people of color are less likely to apply for a position if they do not meet almost 100% of the desired skills and experience. Please note this is not necessary! If you meet the minimum requirements and have a passion for the work, you are encouraged to apply. We can provide on-the-job training for the rest!

Commitment to Job Application Fairness

Applicants are not required to provide age or age-related information and may redact information related to age, date of birth, or dates of attendance at or graduation from an educational institution from any submissions during the initial application process.

Some Final Considerations

At UCAR|NCAR|UCP, you will work alongside a dedicated team of professionals conducting critical research and community outreach to solve complex Earth system science  problems including climate change, air pollution, extreme weather, floods, drought, wildfires, and space weather, all with the goal of improving human life and reducing economic loss. Each of us, from scientists to the professionals who support their work, serves the public and a collaborative community of scientists in our mission to understand the complex processes that make up the Earth system, from the ocean floor to the Sun’s core.

Flexible Work

At UCAR, we are committed to supporting our mission by giving staff the flexibility to find the schedule and location that works best to maintain their own work-life circumstances and reach their full potential as professionals. Many positions within our organization are eligible for fully on-site, hybrid, fully-remote and/or flexible work schedules.

Equal Opportunity Employer

UCAR is committed to providing equal opportunity for all employees and applicants for employment and does not discriminate on the basis of race, age, creed, color, religion, national origin or ancestry, sex, gender, disability, veteran status, genetic information, sexual orientation, gender identity or expression, or pregnancy. Whatever your intersection of identities, you are welcome at UCAR.

Export Control

All positions are required to comply with U.S. export compliance regulations work location requirements regarding access to facilities and research systems.

Visa Wait Times

Please consider the length of visa procurement when applying for this posting, understanding that you will not be able to begin employment until you are able to get a visa and enter the U.S.



Employer Info

Job posting number:#145872 (Ref:REQ-2024-172)
Application Deadline:2024-09-30
Employer Location:ucar
,
More jobs from this employer

Jobs Viewed Recently

顶部