Java SRE (Site Reliability Engineer) Job at Openkyber, Texas

VWtzTWhjZXYzQUZJOVc4MFVaYTc0TTJWUFE9PQ==
  • Openkyber
  • Texas

Job Description

SRE Engineer with strong Expertise in Alert site Instrumentation

Austin, TX [Hybrid]

Exp: 12+ years

We are currently seeking a highly skilled SRE Sr Engineer with expert-level proficiency in Alert Site Instrumentation & Implementation to lead transformational initiatives in IT operations. This role focuses on designing, implementing, and evolving synthetic alerting for GUI and API endpoints and observability practices driving operational excellence.

Responsibilities:

  • Serve as the enterprise expert for Alert Site Instrumentation & Implementation, ensuring accurate, proactive, and actionable alerts across hybrid environments.
  • Architect and implement end-to-end alert lifecycle management from signal ingestion, deduplication, correlation, enrichment, and routing to resolution.
  • Define and manage alert thresholds, suppression rules, and noise reduction mechanisms to minimize false positives and alert fatigue.
  • Integrate alerts with AIOps platforms for anomaly detection, predictive alerting, and intelligent event correlation.
  • Establish golden-signal metrics (latency, errors, saturation, traffic) as standard alerting blueprints.
  • Implement observability practices with Dynatrace, SolarWinds, Prometheus, Grafana, Splunk, Kibana, and open-source collectors.

Qualifications:

  • 10 12 years of hands-on SRE experience.
  • Expert-level skills in Alert Site Instrumentation & Implementation, including:
  • Synthetic Monitoring setup for Front End GUI Apps and API endpoints ( API, browser scripts) to simulate user journeys.
  • Create availability and performance baselines across geographies.
  • Automate synthetic test creation for regression coverage.
  • Dynamic thresholding, adaptive alerting, and noise reduction strategies.
  • Alert integration with ITSM tools (ServiceNow, Jira, PagerDuty, OpsGenie).
  • Predictive and anomaly-based alerting with AIOps engines.

Dashboard & Visualization

  • Build real-time alert dashboards with Dynatrace.
  • Correlate synthetic test data with backend telemetry for root cause analysis.
  • Develop business-journey dashboards for transaction-level alerting.
  • Hands-on experience with observability stacks (Dynatrace, Datadog, AppDynamics, Prometheus, Grafana, Splunk, Kibana, ELK).
  • Cloud experience: AWS (Control Tower, RDS, SSO, Account Provisioning).
  • Strong containerization/orchestration experience: Docker, Kubernetes.
  • Proficiency in scripting/automation (Python, Ansible, Groovy-DSL, Java, YAML).
  • Knowledge of authentication platforms (Ping, ForgeRock, SiteMinder).

Job Tags

Similar Jobs

PwC

Deals - Business Recovery Services, Senior Associate Save for Later Remove job Job at PwC

 ...transactions and maximise value in their business deals.In deal recovery management at PwC...  ...you need to lead and deliver value at this level include but are not limited to:...  ...PwC does not intend to hire experienced or entry level job seekers who will need, now or in... 

Prairie Band, LLC

Labor and Delivery Registered Nurse Job at Prairie Band, LLC

Description: Make Every Birth a Story Worth Sharing Are you a compassionate and experienced Labor & Delivery RN ready to bring your expertise to a community where your care truly matters? Indian Health Services, Northern Navajo Medical Center in Shiprock, AZ, is seeking... 

Ladder

Apprentice Electrician with Penco Electric Job at Ladder

 ...distribution systems and associated electrical equipment Manage and lead small crews Qualifications: Must have 2 or more years experience Valid Drivers License Clean Motor Vehicle Report Basic computer literacy Capable of lifting and carrying a minimum... 

Royal Caribbean Group

Cruise Staff Job at Royal Caribbean Group

Cruise Staff hosts and participates in entertainment, recreational, and social programs for adults and families in the vessel. You will...  ...team if you have experience in a related role in an upscale cruise ship, resort, or recreational establishment. A college or university... 

Barclays

Head of Trade & Working Capital Product Management, Americas Job at Barclays

 ...provided via a team of specialists across Barclays Corporate Banking footprint. About the TWC Product Management team TWC Product...  ...related client channels across the UK, Europe, APAC & ME and Americas. This includes defining the client experience, digital and...