This is the section where round two of the screen actually happens, the closing gate before an
interview hits your inbox. A recruiter takes their time here, and even at that, the current
role still drives around 95% of the result.
That tracks: nothing proves what you can run in production today like the seat you sit in
right now. To earn a "yes", the section has to hit every entry on the
Infrastructure Engineer role profile, one bullet per domain you named in Domain
Expertise above. Every bullet has to come off something you genuinely held in production,
never a ticket that landed on your queue.
1
Compute & Virtualization
The flagship work of the role. Show the compute platform you built, the hypervisor or
bare-metal fleet underneath, and the workloads it now carries. Name the platform and what
it enabled, not "managed servers".
Techniques
Hypervisor design
Bare-metal provisioning
Live migration
Capacity sizing
Tools
VMware vSphere, ESXi
KVM, Proxmox
MAAS, Foreman
Metrics
Hosts under management
Provisioning time cut
Workloads onboarded
2
Networking & Connectivity
The fabric that ties the estate together. Show the network design you built (BGP/OSPF
fabric, top-of-rack topology, VPN and Direct Connect), the load-balancing tier, and the
DNS and edge layer underneath it. Name the design and the SLA it holds, not "worked
on networking".
Techniques
BGP / OSPF
VLAN / VXLAN
L4 / L7 load balancing
VPN & Direct Connect
Tools
Cisco, Arista, Juniper
F5, HAProxy, Nginx
BIND, Infoblox
Metrics
Network availability
Latency cut
Packet loss reduced
3
Storage & Data Services
How the estate stores and protects data. Show the storage tier you designed (SAN, NAS,
object, block), the backup and replication policy, and the DR posture behind it. Name the
platform and the data class it serves, not "managed storage".
Techniques
Block, file, object
Backup & replication
Snapshot lifecycle
RPO / RTO design
Tools
NetApp, Pure, Dell EMC
Ceph, GlusterFS
NFS, iSCSI, S3
Metrics
PB under management
Restore success rate
Cost per TB cut
4
Identity & Security Hardening
What keeps the estate trusted. Show the directory and authentication backbone you run
(LDAP, AD, IdP), the OS hardening baseline, and the patch cycle behind it. Name the
control you enforced and the audit it closed, not "worked on security".
Techniques
LDAP / AD / SSO
OS hardening (CIS)
Patch & vuln cycle
PKI & certificates
Tools
FreeIPA, Active Directory
Vault, sssd
OpenSCAP, Lynis
Metrics
Hosts on baseline
CVEs closed
Audits passed
5
Linux & OS Engineering
The depth that separates an Infrastructure Engineer from someone clicking through a
console. Show the Linux distro you standardize on, the kernel-level work you did
(tuning, networking stack, filesystem), and the troubleshooting story behind a real
outage. Name what you actually tuned, not "Linux admin".
Techniques
Kernel tuning (sysctl)
Systemd, cgroups
Filesystem internals (XFS, ext4, ZFS)
strace, perf, eBPF
Tools
RHEL, Ubuntu, Debian
Bash, Python
kickstart, cloud-init
Metrics
Fleet under standard image
Issues debugged
Performance lifted
6
Automation & IaC
What takes the estate out of click-ops and into versioned code. Show the IaC modules you
authored, the configuration-management policy, and the hosts now provisioned and managed
from code. Name the workflow and what it replaced, not "used Ansible".
Techniques
Reusable IaC modules
Configuration management
Plan-based PR review
Idempotent playbooks
Tools
Terraform, Pulumi
Ansible, Puppet, Chef
Packer, cloud-init
Metrics
Hosts under code
Provisioning time cut
Drift incidents down
7
Capacity, Performance & DR
The discipline that keeps the estate ready for the next quarter and the next failure.
Show the capacity model you wrote, the performance baseline, and the DR drill you ran
against it. Name the incident or audit and what it shifted, not "handled capacity".
Techniques
Capacity modeling
Headroom planning
DR drills
Performance profiling
Tools
Nagios, Zabbix, Prometheus
Grafana, Datadog
iperf, sar, perf
Metrics
RPO / RTO held
Headroom protected
Performance lifted
8
Tooling & Workflow
The setup that lets one Infrastructure Engineer carry a thousand-host fleet. Show the
internal CLI or runbook library you maintain, the change-management workflow underneath,
and the docs that cut on-call ramp. Name the workflow, not "a modern stack".
Techniques
Internal CLI / runbooks
Change advisory boards
Infra PR review
Self-serve docs
Tools
Git, GitLab
Bash, Python
ServiceNow, Jira
Metrics
Runbooks maintained
PR cycle time
On-call ramp cut