Sunday, March 9, 2025

RCA LEARNINGS

RCA Learning refers to the insights and takeaways gained from conducting a Root Cause Analysis (RCA) on an incident. It focuses on understanding what went wrong, why it happened, and how similar issues can be prevented in the future.


Step 1: Update the RCA Tracking Excel Sheet

You can add a "RCA Learnings" column in your existing RCA tracking sheet. The updated format could look like this:

Incident ID Issue Title Root Cause Resolution RCA Learnings Action Items Status
INC-1234 Policy renewal failure Null pointer exception in API Added null checks in backend Ensure all APIs have null checks; Add test cases for edge scenarios Add test cases in regression suite In Progress
INC-1235 Premium calculation error Incorrect formula used in backend Corrected formula & updated documentation Validate formulas before deployment; Peer review required for calculations Add validation step in CI/CD Completed

Step 2: Copy RCA Learnings to Confluence

Once the Excel sheet is updated, you can periodically copy-paste the RCA learnings into a Confluence page. The Confluence page could be structured as:

📌 RCA Learnings Repository (Updated on DD/MM/YYYY)

This page consolidates key RCA learnings that must be considered during development.

Issue Title RCA Learnings Action for Dev Team
Policy renewal failure Ensure all APIs have null checks; Add test cases for edge scenarios Implement null checks in all APIs; Add edge-case test scenarios
Premium calculation error Validate formulas before deployment; Peer review required for calculations Mandatory peer review for formulas; Automated validation in CI/CD

🔹 Next Steps:

  • Dev team should review this document before development.
  • Any new learnings should be updated in the RCA Excel sheet and synced to Confluence.

List of possible route cause: 

  • Missing Configuration
  • Incorrect Configuration
  • Data Issue
  • Code Defect
  • Environment Issue
  • Third-Party Dependency
  • Network/Connectivity Issue
  • Permissions Issue
  • Unexpected System Behavior
  • Process Gap
  • User Error
  • Security/Access Control
  • Performance Bottleneck
  • Hardware/Infrastructure Issue
  • Automation Failure
  • Kong API Gateway Misconfiguration
  • Kong Routing Issue
  • Kong Plugin Failure
  • 😊

    No comments:

    Post a Comment