Cloud Infrastructure Engineer - CDC

Full Time
Remote
Posted
Job description
Description:


PingWind is seeking a Cloud Infrastructure Engineer to support the NCCDPHP contract. The Centers for Disease Control and Prevention (CDC) National Center for Chronic Disease Prevention and Health Promotion (NCCDPHP), Office of Informatics and Information Resources Management (OIIRM) provides information technology and informatics portfolio management services for the Center. This involves analysis, design, development, implementation, support, and evaluation of all information systems residing on the NCCDPHP Platform, and under the OCIO consolidated platform allocated to NCCDPHP or other Centers. The applications in this portfolio continuously evolve and are subject to change throughout the period of performance of this contract. OIIRM also provides data management, integration services, and consultation to support NCCDPHP lines of business as well as integration with programmatic functions. OIIRM also provides knowledge management services including information retrieval, information mapping, information sharing, daDevta categorization, infrastructure support, and knowledge capture.

Responsibilities

  • Responsible for Production Monitoring expertise and implementing automation components including tools, platforms, process, and policies
  • Roll-out best practices to product and support teams for setting up alerts, monitoring queues, reviewing logs
  • Work with development teams throughout the software life cycle ensuring sustainable software releases
  • Design and build new features for infrastructure and services observability. Dive into new technologies and figure out how to best monitor them
  • Collaborate across Application Development, product, and production management to establish and maintain Service Level Objective (SLO), Service Level Indicator (SLI) for key production services
  • Hands-on experience with cloud-based technologies and tools in configuration management, deployment, monitoring and operations
  • Involved in in Incident, problem and change management processes and tools
  • Troubleshoot key technical issues or escalate and work with appropriate technology teams to provide solutions
  • Perform analysis on logs and use problem solving techniques
  • Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions
  • Maintain / upgrade / patch tracking and documentation software
  • Building a release pipeline to enable fast, but safe delivery of critical business software to Production
  • Develop & Maintain sound version control best practices-based CM systems (GIT) [Gitlab , Azure DEVOPS], including branching and merging strategies
  • Build terraform scripts of resource automation and managing configuration drifts
  • Experience working in a cross-functional team in a dynamic environment:
  • Ability to work independently and deliver to deadlines
  • Ability to solve problems with minimal direction
  • Strong deductive reasoning ability
  • Great attention to detail and accuracy
  • Ability to work in a dynamic team environment using AGILE methodology
  • Ability to mentor juniors
Requirements:
  • Bachelors in Engineering, Information Systems, Computer Science or Information Technology or equivalent experience. Preferably 5+ years relevant experience, will accept 3+ for a strong candidate with Azure Certifications.
  • Must have Demonstrated ability to install, configure, manage & maintain Kubernetes based systems in Azure or AWS (Azure Kubernetes Service – AKS and Elastic Kubernetes Service EKS)
  • Must have Experience with designing Virtual Networks with understanding of Networking on Azure including routing tables, fire wall rules
  • Must have Experience with Azure Private Links and DNS Concepts
  • Must have Demonstrated ability to work with Docker and its orchestration in Kubernetes
  • Working knowledge for Kubernetes, Operators and HELM for managing services on Kubernetes
  • Must have Demonstrated experience with Azure Storage Accounts and Event Service bus
  • Must have Scripting and automation experience using Shell / Perl / Python scripting. Scripting experience with Unix shell, Python, Perl, C
  • Knowledge of Security monitoring code scans and image scans
  • Establish standard backup / recovery policies and procedures
  • End-to-end accountability for Kafka, Elasticsearch environment stability, performance & availability
  • Ability to identify Kafka tracing and data flow issues and help developers prioritize their data flows
  • Strong Admin experience supporting Kafka, Zookeeper, Elastic Search on Kubernetes
  • Familiarity / experience with standard DBMS systems like Postgres, (Synapse or Redshift), Databricks, SQL
  • Familiarity of eco systems like Hadoop, Hive, Spark, zookeeper etc.
  • Ability to implement audit, security and risk controls to secure data in both on-prem & cloud
  • Participate in design discussions and offer consulting services to Business and Architecture community including Logical/Physical design discussions with Architects and Application teams
  • We are looking for proficiencies and architectures related to High Availability, Disaster Recovery & Business Continuity, Backup and Recovery procedures
  • Domain specialist and point-of-contact in core technical capabilities, onboarding new projects, operationalizing procedure, preparing SoP docs
  • Experience providing enterprise production operations support and 24/7 support
  • Your able to bring Application authorities and other infrastructure teams together for finding efficient solutions to issues related to capacity, security, performance
  • Strong dedication/commitment to automation, simplicity, and smooth-running systems

About PingWind

PingWind is focused on delivering outstanding services to the federal government. We have extensive experience in the fields of cyber security, development, IT infrastructure, supply chain management and other professional services such as system design and continuous improvement. PingWind is a VA CVE certified Service-Disabled Veteran-Owned Small Business (SDVOSB) and SBA HUBZone Certified with offices in Washington DC and Northern Virginia. www.PingWind.com

Our benefits include:

  • Paid Federal Holidays
  • Robust Health & Dental Insurance Options
  • 401k with matching
  • Paid vacation and sick leave
  • Continuing education assistance
  • Short Term / Long Term Disability & Life Insurance
  • Employee Assistance Program through Sun Life Financial EAP Guidance Resources

Veterans are encouraged to apply

PingWind, Inc. does not discriminate in employment opportunities, terms and conditions of employment, or practices on the basis of race, age, gender, religious or political beliefs, national origin or heritage, disability, sexual orientation, or any characteristic protected by law

colinoncars.com is the go-to platform for job seekers looking for the best job postings from around the web. With a focus on quality, the platform guarantees that all job postings are from reliable sources and are up-to-date. It also offers a variety of tools to help users find the perfect job for them, such as searching by location and filtering by industry. Furthermore, colinoncars.com provides helpful resources like resume tips and career advice to give job seekers an edge in their search. With its commitment to quality and user-friendliness, colinoncars.com is the ideal place to find your next job.

Intrested in this job?

Related Jobs

All Related Listed jobs