Linux Systems Engineer - Senior HPC

Founded in 2002 and headquartered in Washington DC, Federated IT is dedicated to enhancing national security We leverage our extensive experience and technical expertise to deliver tailored solutions to federal customers. We offer robust, leading-edge information technology and cybersecurity solutions Our project portfolio includes the customization and delivery of optimized cloud computing, data center operations and migration, enterprise architecture, scientific research and analyses, and cyber security solutions. We serve the DoD, the IC, Federal Law Enforcement, and select federal civilian customers worldwide. Federated IT offers a productive, and collaborative work environment and competitive compensation packages including medical and dental insurance, paid time off (15-days) and holidays, tuition reimbursement, 401K, short and long-term disability, HSA/FSA, employee life insurance, and more.


Our Core Principles:  We value our PEOPLE, their integrity, their skills, and their professionalism - they enhance our reputation and ensure the success of our company. We value our position of trust with our clients --we strive to increase it in all interactions. We strive to provide the highest quality products and services at competitive prices; we constantly seek "best value" solutions for our clients.

Summary:


Federated IT seeks a highly qualified HPC Linux System Administrator to conduct Satellite Data Management Support in support of the Fleet Numerical Meteorology and Oceanography Center (FNMOC) in Monteray, California. The objective of this position is to provide system administration support for all aspects of the FNMOC systems that support high-volume processing of satellite data. The position requires the ability to integrate new types of satellite-related data flows (various formats, file sizes, addressing storage space issues, and file quantities) and to ensure down-stream applications can handle and manipulate the data. The performer shall provide maintenance and support to multicore HPC and virtualized and hardware-based infrastructure (typically running mail, Domain Name System (DNS), an identity management system (Red Hat Identity Management System [IdM] or Light Weight Directory Protocol [LDAP]), and data routing) systems, to include all associated subsystems. Support also extends to supporting FNMOC’s backup processes. The performer shall participate in the investigation of relevant systems/subsystems performance including interconnecting networks, storage, patch management, the data flow on the networks, the performance of computers and the overall flow and handling of information across the hardware and software enterprise. Work will consist of the effort required to support the operations and maintenance of HPC systems at FNMOC and shall include operational testing, validation, and documentation necessary for transition to operations. Work will extend to Cyber Security compliance and ensuring system backups occur correctly and on a regular basis. The performer shall be familiar with the High Performance Computing techniques, Linux Systems, system provisioning, Cyber Security Compliance, and be capable of addressing general system administration issues including addressing capacity issues that may affect application and processing performance. The performer shall participate in the investigation of relevant systems/subsystems including interconnecting networks, the data flow on the networks, the

performance of computers and the overall flow and handling of information across the hardware and software enterprise.


Essential Duties and Responsibilites:

  • All work will be performed on site (remote support is not an option)
  • SYSTEM INTEGRATION & SUPPORT
    • Provide end-to-end system integration and provide recommendations to support the performance of systems and their associated subsystems in support of HPC, satellite, and related government requirements. The contractor shall participate in the investigation of relevant systems/subsystems including interconnecting networks, the data flow on the networks, the performance of computers and the overall flow and handling of information across the hardware and software enterprise. Infrastructure support extends to identity management (LDAP/IdM) and node-provisioning via Red Hat Satellite Server.
    • Performance Standard: Provide system integration for systems and sub-systems as they relate to management of satellite data.
    • Assessment Method: Direct observation and review of system integration contributions for relevant area.
  • ARCHITECTURE GUIDANCE
    • Provide technical assistance to HPC Linux customer base regarding system/code/performance issues. This work may require problem resolution or working with an escalation team for problem resolution.
    • Performance Standard: Provide architectural guidance for systems and subsystems as they relate to management of satellite data.
    • Assessment Method: Direct observation and review of architectural contributions for relevant area.
  • NETWORK ATTACHED STORAGE SUPPORT
    • Provide system administration support to Network Attached Storage [NAS] (Currently NetApp systems) to ensure optimal NAS file system support for alpha/beta/operations systems for NIPR/SIPR environments. Resolve issues related to file system performance with subject matter expertise.
    • Performance Standard: Ability to support standard NAS performance issues; routine file system mounting support; and hardware support (e.g. disk drive replacement).
    • Assessment Method: Direct observation and review of supported systems.
  • BACKUP SUPPORT
    • Provide daily backup support to NIPR/SIPR backup systems (currently managed by managed by the IBM Spectrum Protect [a.k.a. Tivoli] product). Ensure regular configuration and verification of backups by management of backup policies to address system and user file backups. Support maintenance of backups, backup-media rotation, and support hardware/firmware/software upgrades.
    • Performance Standard: Ability to keep backup systems are functional and cyber compliant, revision current, and policies are correct allowing recoverable data.
    • Assessment Method: Inspection of backup processes and SW/firmware versions based on Cyber Security requirements.
  • DISA STIG REQUIREMENTS
    • Ensure work performed under this task order complies with applicable Defense Information Systems Agency (DISA) Application Security and Development Security Technical Implementation Guides (STIG) and/or Application Services STIG requirements.
    • Performance Standard: Work shall conform to DISA guidelines.
    • Assessment Method: Observation of work performed.
  • HPC ENGINEERING SUPPORT
    • Provide related engineering support to implement the NIPR/SIPR A2 HPC/Infrastructure systems at FNMOC. This support shall include operational testing, validation, and documentation necessary for transition to FNMOC IT environment.
    • Performance Standard: Ability to provide meaningful engineering support for the complexity of the utilized HPC systems...
    • Assessment Method: Technical review of provided engineering input.
  • g. TECHNICAL INPUT TO PRODUCTION RELATED MEETINGS
    • Participate in production-related technical meetings and provide input to plans, schedules, documents, and engineering tasks as required facilitating system upgrade efforts.
    • Performance Standard: sound engineering and technical input to production related meetings.
    • Assessment Method: Observation of ability to provide meaningful and useful input to technical meetings.

Required Qualifications, Education, and Experience:

  • Active TOP SECRET Clearance
  • DoD 8570 IAT/IAM II (e.g., Security +)
  • Experience with supporting Linux systems and subsystems, including storage and backup solutions, in an operational environment
  • Experience with provisioning Linux systems and performing operational testing
  • Experience with securing Linux systems using tools, including STIGS and vulnerability scanners
  • Knowledge of project management processes and documentation
  • Ability to draft technical documentation targeted at various reader levels, including users, operators, and system analysts

The Successful Candidate will Possess:

  • Prospective candidates should have strong risk management skills, excellent communication, teamwork, and conflict management skills.
  • The candidate must be analytical and effectively able to prioritize needs, requirements, and other issues.
  • Ability to communicate and interact effectively at all levels of staff and management.
  • Ability to exercise independent judgment, develop relationships, and obtain consensus among interested parties.
  • Critical thinker with strong technical skills, diagnostic skills and problem-solving ability
  • Solid written and verbal communication skills to negotiate direction, drive projects and projects to successful conclusion and deliver knowledge to team members verbally and via clear designs, runbooks and technical engineering and exchange sessions
  • Self-starter, flexible, adaptable, collaborative and motivated to champion continuous improvement
  • Ability to develop peer networks across an enterprise to maintain technology awareness and to support resolution of problems
  • Ability to operate across traditional technical boundaries, comfortable working in the compute space as well as the storage space in an operational capacity
  • Technically curious and driven to learn new skills.

General Factors:

  • Depending on project requirements, may be required to work within a compressed schedule; overtime should be expected when schedules demand it.
  • Willing to travel, if needed.
  • No Relocation.

The pay range for this role is:

130,000 - 150,000 USD per year (USN - CNMOC - FNMOC)

Defense

Monterey, CA

Share on:

Terms of servicePrivacyCookiesPowered by Rippling