Job Summary:
The Senior Server Engineer will be responsible for designing, implementing, managing, and maintaining the company’s IT server infrastructure. This role requires a deep understanding of server technologies, virtualization, storage solutions, and network protocols. The Senior Server Engineer will ensure optimal server performance, availability, and security, as well as support the deployment of critical applications and services across the organization’s infrastructure.
As a Senior Server Engineer, you will also mentor junior engineers, collaborate with cross-functional teams, and lead initiatives to improve infrastructure scalability, reliability, and security.
Key Responsibilities:
- Server Infrastructure Management:
- Design, deploy, configure, and maintain physical and virtual servers across the company’s infrastructure.
- Ensure the stability, reliability, and performance of all server hardware and software components.
- Monitor server performance, identify bottlenecks, and proactively implement improvements.
- Perform regular system updates, patching, and troubleshooting to ensure systems are running smoothly.
- Server Virtualization:
- Implement, manage, and maintain virtualized environments (e.g., VMware, Hyper-V).
- Optimize server resources for virtual machines and manage virtual machine provisioning and life cycles.
- Troubleshoot and resolve virtualization-related issues and performance problems.
- Storage and Backup Solutions:
- Design and manage scalable storage solutions (SAN, NAS) and backup systems for critical data.
- Ensure data protection and disaster recovery procedures are in place and routinely tested.
- Implement and manage backup policies and schedules for all critical infrastructure.
- Network and Security Management:
- Collaborate with network and security teams to ensure that servers are securely configured and protected from external and internal threats.
- Implement and manage firewalls, intrusion detection/prevention systems, and encryption protocols on servers.
- Ensure compliance with company security policies and industry standards (e.g., ISO 27001, GDPR, etc.).
- Automation and Scripting:
- Utilize automation tools and scripting languages (e.g., PowerShell, Python, Bash) to streamline server management tasks.
- Create scripts for system monitoring, automation of routine tasks, and alerts for system performance or security issues.
- Improve system administration efficiency by automating repetitive processes and configurations.
- Collaboration and Support:
- Collaborate with IT teams, including network, database, and security specialists, to deliver integrated and reliable infrastructure solutions.
- Provide technical guidance and mentoring to junior engineers and IT support staff.
- Respond to and resolve complex server-related issues escalated by helpdesk teams.
- Work with project managers to ensure the timely delivery of infrastructure-related projects.
- Disaster Recovery and Business Continuity:
- Develop and implement disaster recovery plans and ensure regular testing of recovery procedures.
- Maintain business continuity strategies and infrastructure resilience to minimize downtime during incidents.
- Ensure that data is protected and can be quickly restored in case of a failure or disaster.
- Performance Monitoring & Reporting:
- Implement server performance monitoring tools to track usage, performance, and health of infrastructure.
- Regularly analyze server performance data and provide reports with recommendations for improvements.
- Maintain documentation for server configurations, processes, and network topologies.
- Server Upgrades and Technology Evaluation:
- Continuously evaluate the latest server technologies, hardware, and industry trends.
- Lead server infrastructure upgrades and transitions to newer technologies when necessary.
- Assess hardware and software needs and recommend appropriate solutions based on performance and cost-effectiveness.
Skills and Qualifications:
- Education:
- Bachelor’s degree in Computer Science, Information Technology, or a related field. Master’s degree is a plus.
- Experience:
- 5+ years of experience as a Server Engineer or System Administrator in a large-scale IT infrastructure environment.
- Proven experience in managing and optimizing Windows and Linux-based servers.
- Experience with virtualization technologies (e.g., VMware, Hyper-V).
- Expertise in server hardware configuration, troubleshooting, and optimization.
- Hands-on experience with storage solutions, backup systems, and disaster recovery planning.
- Technical Skills:
- Proficiency in managing and supporting both physical and virtualized server environments.
- Solid experience with cloud-based infrastructure (e.g., AWS, Azure, Google Cloud) is an advantage.
- Advanced knowledge of server operating systems (e.g., Windows Server, Linux).
- Experience with scripting languages (PowerShell, Bash, Python).
- Familiarity with infrastructure-as-code (IaC) tools such as Ansible or Terraform is a plus.
- Strong understanding of network protocols and concepts (TCP/IP, DNS, DHCP, etc.).
- Certifications:
- Microsoft Certified: Windows Server (e.g., MCSA, MCSE) or equivalent certification.
- VMware Certified Professional (VCP) or other relevant certifications in virtualization.
- ITIL certification (Foundation level or higher) is a plus.
- Any relevant Cloud certifications (e.g., AWS Certified Solutions Architect, Microsoft Azure Administrator) is a bonus.
- Soft Skills:
- Strong analytical and problem-solving abilities.
- Excellent communication and collaboration skills to work with various teams.
- Ability to prioritize tasks effectively and manage multiple projects simultaneously.
- Attention to detail and a commitment to high-quality work.
- Leadership skills, with the ability to guide and mentor junior team members.