DevJobs

Site Reliability Engineer

Overview
Skills
  • Python Python
  • Bash Bash
  • Elasticsearch Elasticsearch
  • Linux Linux ꞏ 3y
  • Jenkins Jenkins
  • Docker Docker
  • Kubernetes Kubernetes
  • Grafana Grafana
  • HTTPS
  • TCP/IP
  • Proxy technologies
  • load balancers
  • HTTP
  • firewalls
  • DNS
  • Prometheus Prometheus
  • cloud platforms
About The Job

We are looking for a hands-on Senior SRE – Network & Proxy Infrastructure to take ownership of a large-scale, global proxy infrastructure.

Our system consists of hundreds of Linux servers distributed worldwide, handling high-volume, real-time traffic. This role sits at the intersection of systems, networking, and production reliability.

You will be responsible for ensuring the stability, performance, and scalability of the platform - from deep packet-level troubleshooting to building monitoring, automation, and operational workflows.

This is not a traditional IT or networking role - it is a production engineering position focused on real-world traffic, performance, and reliability at scale.

Responsibilities

  • Configure, install, and manage proxy systems to optimize network and data flow performance.
  • Investigate and resolve production issues involving latency, connectivity, and routing behavior
  • Monitor system and network health using tools like Prometheus, Grafana, ELK, or similar observability stacks. Implement alerts and dashboards to proactively detect and mitigate service degradation.
  • Write and modify scripts/code to automate network and system operations.
  • Troubleshoot technical issues related to proxies, networks, and server performance.
  • Analyze network traffic (TCP/IP, HTTP/S) to identify bottlenecks and anomalies
  • Work with DevOps tools to manage and maintain server infrastructure.
  • Analyse network traffic data and identify patterns to enhance overall system efficiency.
  • Collaborate with R&D and IT teams to improve system architecture and performance.

Requirements

  • 3+ years of hands-on experience in production systems / SRE / network-oriented roles.
  • Strong understanding of:
    • TCP/IP fundamentals
    • HTTP/HTTPS behavior
    • DNS resolution and networking basics (firewalls, load balancers and Proxy technologies)
  • Strong Linux skills (processes, networking tools, system performance)
  • Proficiency in scripting languages (Bash, Python); experience with automation and CI/CD workflows (e.g., Jenkins).
  • Hands-on experience with DevOps tools (e.g., Linux, Docker, Kubernetes, cloud platforms).
  • Background in customer service/support is an advantage.
Why This Role Is Interesting

  • Work on a globally distributed system at a massive scale
  • Solve real network problems - not just dashboards
  • High ownership and impact on core product performance
  • Close collaboration with engineering on architecture decisions

NetNut takes pride in building a cutting-edge residential proxy network, ensuring unmatched speed and stability for any data collection project. Get access to 85M+ residential IPs worldwide and never get blocked or misled again.
NetNut