Oracle Cloud Infrastructure (OCI) continues to evolve its observability stack, and one of the most critical yet under-discussed changes is documented in MOS note KA1304.

In this post, I will try to break down the technical implications, architectural impact, and recommended actions for enterprises relying on OCI Stack Monitoring.


1. What is KA1304 Really About?

At its core, KA1304 is tied to the deprecation of OCI Stack Monitoring.

  • OCI Stack Monitoring provides:
    • Full-stack observability across applications and infrastructure
    • Metrics collection (performance, availability, errors, usage)
    • Alarm and anomaly detection
  • It supports hybrid environments (on-prem + cloud)

However, Oracle has announced that:

Stack Monitoring is being deprecated and will remain available only until January 23, 2027


2. Why Oracle is Deprecating Stack Monitoring

From an architectural standpoint, Stack Monitoring represents a monolithic observability model. Oracle is now shifting toward:

a) Modular Observability Services

Instead of a single “all-in-one” tool, Oracle promotes specialized services:

  • OCI Monitoring → metrics + alarms
  • OCI Log Analytics → log-based intelligence
  • Database Management → DB-centric insights
  • Oracle Enterprise Manager → full lifecycle management

These services are listed as recommended alternatives .

b) Cloud-Native Design Philosophy

Modern observability trends emphasize:

  • Decoupled telemetry pipelines
  • Service-specific optimization
  • ML-driven analytics

3. Technical Impact on Existing Deployments

If you are currently using Stack Monitoring, the impact falls into three categories:

3.1 Observability Coverage Risk

Stack Monitoring provides holistic topology-based monitoring. Replacing it requires:

  • Reconstructing service relationships
  • Rebuilding dashboards across multiple services

3.2 Automation & Integration Breakage

Custom integrations may depend on:

  • Stack Monitoring APIs
  • Metric pipelines
  • Alarm templates

These must be re-engineered using alternative OCI services.

3.3 Operational Fragmentation

Moving to multiple services introduces:

  • Tool sprawl
  • Increased configuration overhead
  • Cross-service correlation challenges

4. Migration Strategy (Recommended Approach)

A direct replacement is not feasible. Instead, adopt a layered migration model:

Step 1: Inventory Current Usage

Identify:

  • Monitored resources (DB, middleware, hosts)
  • Custom metrics and alerts
  • Dependencies on Stack Monitoring APIs

Step 2: Map Capabilities

Stack Monitoring FeatureReplacement
Infrastructure metricsOCI Monitoring
Log analysisOCI Log Analytics
DB performanceOCI Database Management
Enterprise-wide observabilityOracle Enterprise Manager

Step 3: Rebuild Observability Pipelines

  • Use OCI Monitoring for metrics ingestion
  • Use Log Analytics for correlation
  • Integrate alerts via OCI Notifications

Step 4: Validate Coverage

Ensure parity in:

  • Alert thresholds
  • SLA monitoring
  • Incident response workflows

5. Architectural Insight: What This Means Long-Term

This is a shift,

From:

  • Centralized monitoring platform
  • Predefined topology discovery
  • Unified UI-driven operations

To:

  • Distributed observability services
  • API-driven integrations
  • Composable monitoring architecture

This aligns OCI with industry patterns seen in:

  • Kubernetes-native observability
  • OpenTelemetry pipelines
  • Microservices-based monitoring

OCI Observability Migration Checklist

Having said the above, if you are using Stack Monitoring, a migration checklist like the below, might prove to be useful.

OMC / Stack Monitoring → OCI Monitoring + Log Analytics + DB Management + Enterprise Manager 24ai


1. Discovery & Baseline Assessment

Inventory Monitoring Coverage

  • List all monitored entities:
    • Databases (Oracle / non-Oracle)
    • Middleware (WebLogic, etc.)
    • Compute instances (VM, bare metal)
    • Applications (custom / packaged)
  • Identify environments:
    • OCI regions
    • On-prem / hybrid connectivity
  • Export current topology (if using Stack Monitoring)

Identify Data Sources

  • Metrics sources (CPU, memory, app metrics)
  • Log sources (app logs, audit logs, DB logs)
  • Events / alerts currently configured

Capture Current State

  • Dashboards in use
  • Alert thresholds and escalation rules
  • Integrations:
    • ITSM (ServiceNow, etc.)
    • Notification systems (email, Slack, PagerDuty)

2. Dependency & Risk Analysis

API & Automation Dependencies

  • Identify scripts using Stack Monitoring APIs
  • Identify Terraform / OCI CLI automation tied to monitoring
  • Check custom exporters or agents

Feature Gap Analysis

  • Topology visualization (Stack Monitoring feature)
  • Cross-tier correlation (app ↔ DB ↔ infra)
  • Custom KPIs not natively supported

Risk Categorization

  • Critical alerts that must not break
  • Compliance-related monitoring (audit logs, retention)
  • SLA/SLO tracking dependencies

3. Target Architecture Mapping

Map each capability to OCI services:

CapabilityTarget Service
Infra metrics & alarmsOCI Monitoring
Log ingestion & analyticsOCI Log Analytics
DB performance & tuningOCI Database Management
Enterprise monitoringOracle Enterprise Manager 24ai

Architecture Decisions

  • Define metrics pipeline design
  • Define log ingestion strategy
  • Decide on central vs federated dashboards
  • Define alert routing architecture

4. Environment Preparation

Enable Required Services

  • Activate OCI Monitoring
  • Enable Log Analytics namespace & groups
  • Enable Database Management for all DBs
  • Deploy / upgrade Enterprise Manager 24ai (if used)

IAM & Security

  • Create policies for:
    • Metrics ingestion
    • Log access
    • Alarm management
  • Validate least-privilege access

Agent Deployment

  • Install/upgrade OCI agents on compute instances
  • Configure log collectors
  • Validate connectivity (private endpoints if needed)

5. Metrics Migration

Recreate Metrics

  • Map existing metrics → OCI Monitoring metrics
  • Recreate custom metrics (if any)
  • Validate metric namespaces and dimensions

Rebuild Alarms

  • Recreate thresholds
  • Validate alarm triggers
  • Configure notifications (OCI Notifications)

Validation

  • Compare old vs new alert behavior
  • Run synthetic tests (CPU spike, DB load, etc.)

6. Log Migration

Log Onboarding

  • Identify all log sources
  • Configure ingestion pipelines
  • Assign log groups

Parsing & Enrichment

  • Define parsers for custom logs
  • Validate field extraction
  • Enable indexing for key attributes

Correlation

  • Build queries for:
    • Error detection
    • Performance anomalies
  • Validate search latency and accuracy

7. Database Monitoring Migration

Enable DB Management

  • Enable for all Oracle DB systems
  • Configure performance hub access

Feature Mapping

  • Replace:
    • AWR-based insights
    • SQL performance tracking
  • Validate:
    • Wait events
    • Top SQL queries

8. Dashboard & Visualization Rebuild

  • Recreate dashboards in OCI Monitoring
  • Build log dashboards in Log Analytics
  • Create cross-service views (manual correlation)
  • Validate executive vs operational dashboards

9. Integration Rewiring

  • Reconnect ITSM tools (ServiceNow, etc.)
  • Configure webhook integrations
  • Update notification channels
  • Validate incident workflows end-to-end

10. Testing & Validation

Functional Testing

  • Trigger test alerts
  • Validate log ingestion
  • Confirm DB metrics visibility

Performance Testing

  • Check latency of alerts
  • Validate log query performance

Coverage Validation

  • Ensure no blind spots vs old system
  • Validate all critical systems monitored

11. Cutover Strategy

Parallel Run

  • Run Stack Monitoring + new stack in parallel
  • Compare outputs for 2–4 weeks

Gradual Cutover

  • Migrate non-critical systems first
  • Move critical workloads after validation

Final Switch

  • Disable Stack Monitoring alerts
  • Keep fallback window (1–2 weeks)

12. Decommissioning

  • Remove Stack Monitoring configurations
  • Archive historical data if needed
  • Decommission agents if obsolete
  • Update documentation

13. Post-Migration Optimization

  • Tune alert thresholds (reduce noise)
  • Optimize log retention policies
  • Implement cost controls
  • Introduce automation (auto-remediation)

Conclusion

If your organization depends on Stack Monitoring, the time to act is now. The sooner you redesign your observability strategy around the new/ replacement options, the smoother your transition will be.


Discover more from IT-Noesis

Subscribe to get the latest posts sent to your email.

Leave a comment