Java Backend Engineer with Reliability Engineering / Splunk ::: 100% Remote

Remote Full-time
Java Backend Engineer with Reliability Engineering / Splunk 100% Remote We are looking for a highly skilled Java Backend Engineer with Reliability Engineering experience to help design, build, and maintain reliable, scalable, and observable backend services. The ideal candidate will have strong handson expertise in Java/Spring Boot, microservices, API development, and a deep understanding of Splunk for observability, monitoring, alerting, and troubleshooting. This role focuses on improving application reliability, performance, and logging quality across distributed systems while collaborating with DevOps, SRE, and platform teams. Key Responsibilities Backend Engineering • Design, develop, and maintain backend services using Java (8+), Spring Boot, and microservices architecture. • Build scalable RESTful APIs and backend components with strong emphasis on performance and security. • Improve service reliability through better error handling, resiliency patterns, and design best practices. • Contribute to architecture discussions, technical design, and code reviews. Reliability Engineering & Observability • Implement and maintain application observability using Splunk (dashboards, alerts, log analysis, correlation searches). • Optimize log ingestion pipelines and ensure consistent logging standards (structured logs, correlation IDs, trace IDs). • Monitor application health, performance metrics, latency, errors, and resource utilization. • Troubleshoot production issues by analyzing logs, SPL queries, and monitoring data. • Identify and resolve reliability bottlenecks and proactively improve system stability. Systems Performance & Monitoring • Develop actionable Splunk dashboards for service KPIs, throughput, latency, and error rates. • Set up real-time alerts to detect anomalies, failures, or degradation in service behavior. • Tune SPL queries to improve performance and reduce compute cost. • Work with DevOps/SRE teams to strengthen monitoring, alerting, and incident response. Required Skills & Qualifications • Strong experience with Java (8 or above), Spring Boot, and REST API development. • Solid understanding of microservices, multi-threading, and design patterns. • Hands-on expertise with Splunk: • SPL queries • Dashboards (Classic or Dashboard Studio) • Alerts and reports • Field extractions & log parsing • Deep knowledge of logging frameworks (Log4j2, SLF4J, Logback). • Experience with JSON logging, structured logs, and correlation identifiers. • Familiarity with CI/CD pipelines, Git, Maven/Gradle. • Strong debugging and production troubleshooting skills using logs and monitoring tools. • experience with Kubernetes, Docker, or cloud-native applications. • Knowledge of observability tools like ELK, OpenTelemetry, Grafana, Prometheus. • Understanding of SRE concepts: SLIs, SLOs, SLAs, error budgets. • Familiarity with message brokers (Kafka, RabbitMQ). • Experience with Splunk configuration-as-code or automation via REST APIs Apply tot his job
Apply Now

Similar Opportunities

Backend Engineer : Remote : Only W2

Remote Full-time

Senior Business Analyst - Banking and Finance - Global Consulting job at Cognizant in Washington, DC

Remote Full-time

IT Business Analyst - Banking and Payment Systems

Remote Full-time

Qualified Behavior Health Specialist

Remote Full-time

Integrated Care Manager (FEP) (Remote-AZ)

Remote Full-time

Behavioral Health Specialist- Intensive Home Based Treatment

Remote Full-time

Behavioral Health Integration Coordinator/Provider (LMHC)

Remote Full-time

Supervisor, Care Management – Behavioral Health, Nationwide

Remote Full-time

[Remote] Specialist, Clinical Licensing

Remote Full-time

Behavioral Health Outreach Care Specialist

Remote Full-time

Sr. General Liability Claims Adjuster - NY Adjusters Licensed Needed

Remote Full-time

School-Based RBT 25/26

Remote Full-time

Experienced LiveChat Customer Support Representative for blithequark - Remote Opportunity with a Focus on Exceptional Customer Experience

Remote Full-time

Experienced Mobile Game Tester and Quality Assurance Specialist - Remote Work Opportunity in the Thriving Gaming Industry

Remote Full-time

American Express Remote Data Entry Specialist – Customer Service and Administrative Support with $30 Hourly Rate and Flexible Work Arrangements

Remote Full-time

Auto Liability Exp Claim Rep.

Remote Full-time

Experienced Online English Tutor – Remote Opportunity in Provo, UT, to Shape the Future of Language Learning

Remote Full-time

13040 - Professional & Industrial Sr. Account Executive - Carlisle-King of Prussia Region, PA - VIRTUAL

Remote Full-time

Experienced Remote Data Entry Specialist – Flexible Day & Night Shifts with Competitive Hourly Rates at blithequark

Remote Full-time

**Experienced Customer Service Representative – Work From Home Opportunity with blithequark**

Remote Full-time
← Back to Home